WO1993015763A1 - Vaccinal polypeptides - Google Patents
Vaccinal polypeptides Download PDFInfo
- Publication number
- WO1993015763A1 WO1993015763A1 PCT/US1993/001451 US9301451W WO9315763A1 WO 1993015763 A1 WO1993015763 A1 WO 1993015763A1 US 9301451 W US9301451 W US 9301451W WO 9315763 A1 WO9315763 A1 WO 9315763A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- leu
- glu
- lys
- gly
- asp
- Prior art date
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims description 67
- 102000004196 processed proteins & peptides Human genes 0.000 title claims description 46
- 229920001184 polypeptide Polymers 0.000 title claims description 43
- 229960005486 vaccine Drugs 0.000 claims abstract description 42
- 208000037798 influenza B Diseases 0.000 claims abstract description 4
- 108090000623 proteins and genes Proteins 0.000 claims description 158
- 102000004169 proteins and genes Human genes 0.000 claims description 145
- 150000001413 amino acids Chemical class 0.000 claims description 92
- 239000012634 fragment Substances 0.000 claims description 62
- 241000700605 Viruses Species 0.000 claims description 47
- 108020004414 DNA Proteins 0.000 claims description 40
- 241000712461 unidentified influenza virus Species 0.000 claims description 34
- 206010022000 influenza Diseases 0.000 claims description 31
- 230000002163 immunogen Effects 0.000 claims description 27
- 239000013612 plasmid Substances 0.000 claims description 26
- 230000004224 protection Effects 0.000 claims description 23
- 108091026890 Coding region Proteins 0.000 claims description 17
- 241001465754 Metazoa Species 0.000 claims description 17
- 241000252870 H3N2 subtype Species 0.000 claims description 13
- 229940001442 combination vaccine Drugs 0.000 claims description 11
- 102000053602 DNA Human genes 0.000 claims description 9
- 208000015181 infectious disease Diseases 0.000 claims description 5
- 244000005700 microbiome Species 0.000 claims description 4
- 230000004936 stimulating effect Effects 0.000 claims 3
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 claims 2
- 239000000203 mixture Substances 0.000 abstract description 28
- 208000037797 influenza A Diseases 0.000 abstract description 8
- 230000036039 immunity Effects 0.000 abstract description 4
- 235000018102 proteins Nutrition 0.000 description 134
- 229940024606 amino acid Drugs 0.000 description 77
- 235000001014 amino acid Nutrition 0.000 description 77
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 69
- 108010047495 alanylglycine Proteins 0.000 description 34
- 108020001507 fusion proteins Proteins 0.000 description 34
- 102000037865 fusion proteins Human genes 0.000 description 34
- 150000007523 nucleic acids Chemical group 0.000 description 34
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 29
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 27
- 108091028043 Nucleic acid sequence Proteins 0.000 description 26
- 108020004707 nucleic acids Proteins 0.000 description 26
- 102000039446 nucleic acids Human genes 0.000 description 26
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 25
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 25
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 24
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 24
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 24
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 24
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 24
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 24
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 24
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 22
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 21
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 20
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 19
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 19
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 19
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 19
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 18
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 18
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 18
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 18
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 18
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 18
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 18
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 18
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 18
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 18
- 210000004027 cell Anatomy 0.000 description 18
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 18
- 241000699670 Mus sp. Species 0.000 description 17
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 17
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 16
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 16
- 239000002671 adjuvant Substances 0.000 description 16
- 108010017391 lysylvaline Proteins 0.000 description 15
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 14
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 108010073969 valyllysine Proteins 0.000 description 14
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 13
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 13
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 13
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 13
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 13
- 230000004927 fusion Effects 0.000 description 13
- 108010089804 glycyl-threonine Proteins 0.000 description 13
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 12
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 12
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 12
- 241000197306 H1N1 subtype Species 0.000 description 12
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 12
- 108010056582 methionylglutamic acid Proteins 0.000 description 12
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 11
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 11
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 11
- 108010013835 arginine glutamate Proteins 0.000 description 11
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 11
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 11
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 10
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 10
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 10
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 10
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 10
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 10
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 10
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 10
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 10
- 108010062796 arginyllysine Proteins 0.000 description 10
- NSZJXSMPGXGNJX-VWCSCAALSA-N (2s)-2-[[(2s)-2-[[(2s,3s)-2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]-3-methylpentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 NSZJXSMPGXGNJX-VWCSCAALSA-N 0.000 description 9
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 9
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 9
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 9
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 9
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 9
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 9
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 9
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 9
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 9
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 239000000427 antigen Substances 0.000 description 9
- 108091007433 antigens Proteins 0.000 description 9
- 102000036639 antigens Human genes 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 8
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 8
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 8
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 8
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 8
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 8
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 8
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 8
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 8
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 8
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 8
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 8
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 8
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 8
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 8
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 8
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 8
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 8
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 8
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 8
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 8
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 8
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 8
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 230000001939 inductive effect Effects 0.000 description 8
- 238000002347 injection Methods 0.000 description 8
- 239000007924 injection Substances 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 230000004083 survival effect Effects 0.000 description 8
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 7
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 7
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 7
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 7
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 7
- 108091035707 Consensus sequence Proteins 0.000 description 7
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 7
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 7
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 7
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 7
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 7
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 7
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 7
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 7
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 7
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 7
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 7
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 7
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 7
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 7
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 7
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 7
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 7
- KYWBVMKEYAEDIX-BPUTZDHNSA-N Trp-Met-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 KYWBVMKEYAEDIX-BPUTZDHNSA-N 0.000 description 7
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 7
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 7
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 7
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 230000002708 enhancing effect Effects 0.000 description 7
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 229930027917 kanamycin Natural products 0.000 description 7
- 229960000318 kanamycin Drugs 0.000 description 7
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 7
- 229930182823 kanamycin A Natural products 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 6
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 6
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 6
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 6
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 6
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 6
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 6
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 6
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 6
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 6
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 6
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 6
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 6
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 6
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 6
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 6
- VBIIZCXWOZDIHS-ACZMJKKPSA-N Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CS VBIIZCXWOZDIHS-ACZMJKKPSA-N 0.000 description 6
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 6
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 6
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 6
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 6
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 6
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 6
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 6
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 6
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 6
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 6
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 6
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 101710154606 Hemagglutinin Proteins 0.000 description 6
- 241000282412 Homo Species 0.000 description 6
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 6
- VLCMCYDZJCWPQT-VKOGCVSHSA-N Ile-Met-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N VLCMCYDZJCWPQT-VKOGCVSHSA-N 0.000 description 6
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 6
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 6
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 6
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 6
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 6
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 6
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 6
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 6
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 6
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 6
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 6
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 6
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 6
- WSAPMHXTQAOAQQ-BVSLBCMMSA-N Phe-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=CC=C3)N WSAPMHXTQAOAQQ-BVSLBCMMSA-N 0.000 description 6
- 101710176177 Protein A56 Proteins 0.000 description 6
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 6
- 241000282898 Sus scrofa Species 0.000 description 6
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 6
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 6
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 6
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 6
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 6
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 6
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 239000000185 hemagglutinin Substances 0.000 description 6
- 239000008188 pellet Substances 0.000 description 6
- 230000007017 scission Effects 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 238000002255 vaccination Methods 0.000 description 6
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 5
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 5
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 5
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 5
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 5
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 5
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 5
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 5
- 241000271566 Aves Species 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 5
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 5
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 5
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 5
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 5
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 5
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 5
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 5
- 241000282414 Homo sapiens Species 0.000 description 5
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 5
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 5
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 5
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 5
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 5
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 5
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 5
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 5
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 5
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 5
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 5
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 5
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 5
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 5
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 5
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 5
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 5
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 5
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 5
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 5
- ZSSKZJBTPJBKFT-WZUXKDABSA-N Trp-Tyr-Gly-Tyr-His-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZSSKZJBTPJBKFT-WZUXKDABSA-N 0.000 description 5
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 5
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010084389 glycyltryptophan Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 230000003472 neutralizing effect Effects 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 4
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 4
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 4
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 4
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 4
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 4
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 4
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 4
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 4
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 4
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 4
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 4
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 4
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 4
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 4
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 4
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 4
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 4
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 4
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 4
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 4
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 4
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 4
- 241000701076 Macacine alphaherpesvirus 1 Species 0.000 description 4
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 4
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 4
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 4
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 4
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 4
- 239000007983 Tris buffer Substances 0.000 description 4
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 4
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 4
- 229910052782 aluminium Inorganic materials 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 4
- 230000001681 protective effect Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 4
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 3
- 101100245267 Caenorhabditis elegans pas-1 gene Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 241000283073 Equus caballus Species 0.000 description 3
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 3
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 3
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 3
- YLMIDMSLKLRNHX-HSCHXYMDSA-N Leu-Trp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YLMIDMSLKLRNHX-HSCHXYMDSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 3
- 102000005348 Neuraminidase Human genes 0.000 description 3
- 108010006232 Neuraminidase Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 3
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 3
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 230000003053 immunization Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 108091005573 modified proteins Proteins 0.000 description 3
- 102000035118 modified proteins Human genes 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- VBICKXHEKHSIBG-UHFFFAOYSA-N 1-monostearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(O)CO VBICKXHEKHSIBG-UHFFFAOYSA-N 0.000 description 2
- FATXTKJILXPNJL-UHFFFAOYSA-N 2-[[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 FATXTKJILXPNJL-UHFFFAOYSA-N 0.000 description 2
- IZHVBANLECCAGF-UHFFFAOYSA-N 2-hydroxy-3-(octadecanoyloxy)propyl octadecanoate Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(O)COC(=O)CCCCCCCCCCCCCCCCC IZHVBANLECCAGF-UHFFFAOYSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- GURLOFOJBHRPJN-AAEUAGOBSA-N Asn-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GURLOFOJBHRPJN-AAEUAGOBSA-N 0.000 description 2
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 2
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 2
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 2
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 2
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 2
- 241000283086 Equidae Species 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 2
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 2
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- YXASFUBDSDAXQD-UWVGGRQHSA-N His-Met-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O YXASFUBDSDAXQD-UWVGGRQHSA-N 0.000 description 2
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 2
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 2
- 241000712431 Influenza A virus Species 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 2
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 2
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 2
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 2
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 2
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 2
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 241000702437 Parvovirus H3 Species 0.000 description 2
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 2
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 2
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 2
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 230000005867 T cell response Effects 0.000 description 2
- 101150006914 TRP1 gene Proteins 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 2
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 2
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 2
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 2
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 239000002198 insoluble material Substances 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000001788 mono and diglycerides of fatty acids Substances 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 201000010740 swine influenza Diseases 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- LRKPDXSVQHEAJR-PMVMPFDFSA-N 2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1h-indol-3-yl)propanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 LRKPDXSVQHEAJR-PMVMPFDFSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108010061559 ACTH (7-10) Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 238000011748 CB6F1 mouse Methods 0.000 description 1
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000712469 Fowl plague virus Species 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000702620 H-1 parvovirus Species 0.000 description 1
- 206010069767 H1N1 influenza Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101500027527 Homo sapiens Transforming growth factor alpha Proteins 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- 229940124873 Influenza virus vaccine Drugs 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102000006747 Transforming Growth Factor alpha Human genes 0.000 description 1
- 101800004564 Transforming growth factor alpha Proteins 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000010419 agar Nutrition 0.000 description 1
- 210000004712 air sac Anatomy 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- PNEYBMLMFCGWSK-UHFFFAOYSA-N aluminium oxide Inorganic materials [O-2].[O-2].[O-2].[Al+3].[Al+3] PNEYBMLMFCGWSK-UHFFFAOYSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 239000007900 aqueous suspension Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- -1 aromatic amino acids Chemical class 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 235000011089 carbon dioxide Nutrition 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229910052593 corundum Inorganic materials 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 210000003278 egg shell Anatomy 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229940074045 glyceryl distearate Drugs 0.000 description 1
- 229940075507 glyceryl monostearate Drugs 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 108700032552 influenza virus INS1 Proteins 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 239000008024 pharmaceutical diluent Substances 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 235000020004 porter Nutrition 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000003352 sequestering agent Substances 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 229910001845 yogo sapphire Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16111—Influenzavirus A, i.e. influenza A virus
- C12N2760/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/16011—Orthomyxoviridae
- C12N2760/16211—Influenzavirus B, i.e. influenza B virus
- C12N2760/16222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the present invention relates generally to a polypeptide useful in a composition for providing
- Influenza virus infection causes acute respiratory disease in man, horses, swine and fowl, sometimes of pandemic proportions. Influenza viruses are orthomyxoviruses and, as such, have envelope virions of 80 to 120 nanometers in diameter, with two different glycoprotein spikes. Three types, A, B and C, infect humans. Type A viruses have been responsible for the majority of human epidemics in modern history, although there are also sporadic outbreaks of Type B infections. Known swine, equine and avian viruses have mostly been Type A, although Type C viruses have also been isolated from swine.
- Type A viruses are divided into subtypes based on the antigenic properties of the hemagglutinin (HA) and neuraminidase (NA) surface glycoproteins.
- HA hemagglutinin
- NA neuraminidase
- subtypes H1 swine flu
- H2 asian flu
- H3 Hong Kong flu
- swine flu the predominant influenza A subtypes are H1 and H3; in horses, H3 and H7; and in avians, H5 and H7.
- avians H5 and H7.
- Type B virus Presently only one Type B virus has been identified, with no subtypes.
- the present invention provides compositions containing, and methods for use of, a protein which is capable of inducing protection in animals and avians against challenge with more than one strain of influenza type A and influenza type B.
- one aspect of the invention provides a DNA sequence encoding a modified purified recombinant protein.
- the DNA sequence of the invention encodes a modified protein sequence derived from the HA2 subunit of a selected hemagglutinin (HA) protein.
- HA hemagglutinin
- the sequence is derived from an H3N2 subtype influenza virus. These H3N2 fusion proteins are capable of inducing T cell responses in the absence of neutralizing antibodies.
- a DNA sequence of this invention encodes a modified protein sequence derived from the HA2 subunit from a type B influenza virus. Still further embodiments include DNA sequences obtained as described for the two above virus, where the sequences are derived from other Type A
- influenza strains infecting animals as well as humans include, without limitation, Type A subtypes of H1, H2, H3, H4, H5, H6 and H7.
- the invention provides a DNA sequence encoding a recombinant fusion protein, in which the desired Type A subtype HA2 subunit sequence or a portion thereof, is fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein.
- One embodiment includes the H3N2 subtype HA2 subunit sequence described above fused in frame to another protein or fragment capable of enhancing
- a fusion protein comprises a type B HA2 sequence, described above, or a portion thereof, fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein. Still other Type A subtype HA2 sequences can be similarly used. It is desirable that this fusion partner protein be an influenza protein sequence or fragment thereof.
- a protein encoded by a DNA sequence of the invention is provided.
- the protein may be a protein sequence derived from the HA2 subunit of a hemagglutinin (HA) protein from a selected Type A subtype virus. Desirably the subtype virus is an H3N2.
- the protein may be derived from the HA subunit from a type B influenza virus.
- H5 or H7 subtypes include H5 or H7 subtypes.
- preferred embodiments include fusion proteins comprising a protein sequence derived from the HA2 subunit of an HA protein from a Type A virus, e.g., an H3N2 subtype, or from a type B virus fused in frame to a selected
- influenza sequence The proteins of this invention are particularly useful in inducing protection in mammals, especially humans, against challenge by type B or an H3N2 subtype of influenza A.
- the proteins employing other Type A subtypes, e.g., H5 and H7, are useful in inducing protection in animals against influenza viruses.
- the invention provides a vaccine composition containing a purified protein of the invention, as described above.
- a vaccine composition containing a purified protein of the invention, as described above.
- composition may include a fusion protein of the
- the vaccine compositions contain an H3HA2 protein of the invention and other influenza antigens; a type B HA2 protein of the invention and other influenza antigens; or both an H3HA2 protein, a BHA2 protein and other influenza antigens.
- a combination vaccine of the invention will contain an H3HA2 and a BHA2 protein of the invention in combination with influenza antigens derived from the other type A influenza virus subtypes, H1 and H2.
- An embodiment for use in animals may contain an H5HA2 or H7HA2 protein, among others.
- a further aspect of this invention is a method for inducing in an animal protection against influenza type A, influenza type B, influenza type C, or
- Still a further aspect of this invention is a method for inducing in an animal protection against multiple strains of influenza types A and B which
- Fig. 1 illustrates the nucleic acid sequences of the HA2 portions of (a) A/Udorn [SEQ ID NO: 1], (b) A/Victoria [SEQ ID NO: 3], (c) A/PR/8/34 [SEQ ID NO: 5], and (d) a consensus sequence [SEQ ID NO: 7]. Dashes indicate the same nucleotide as the consensus sequence. Different nucleotides from that of the consensus sequence are reported in lower case letters. Dots indicate no corresponding nucleotide when compared to the consensus sequence.
- Fig. 2 illustrates the nucleic acid and amino acid sequences of NS1 (1-81) H3HA2 (1-221) fusion protein [SEQ ID NO: 9 & 10].
- Fig. 3 illustrates the nucleic acid and amino acid sequences of the NS1 (1-81) H3HA2 (77-221) fusion protein [SEQ ID NO: 11 & 12].
- Fig. 4 illustrates the nucleic acid and amino acid sequences of the type B fusion protein, NS1 1-42 HA2 41-223 . [SEQ ID NO: 13 & 14]. Detailed Description of the Invention
- the present invention provides novel proteins, DNA sequences, pharmaceutical vaccine compositions and methods of use thereof for conferring protection in vaccinated mammals against one strain, or desirably multiple strains, of influenza viruses.
- the proteins and vaccine compositions of the present invention demonstrate the ability to stimulate or produce a protective immune response which is capable of recognizing an influenza virus or influenza virus-infected cells and protecting the vaccinated mammal against disease caused thereby.
- This protective response is desirably a T cell response, produced in the substantial absence of vaccine-induced neutralizing antibody.
- H3HA2 and BHA2 sequences originating from viral strains to which humans are susceptible
- similar sequences and molecules can be prepared for veterinary applications.
- selected HA2 sequences obtained from type A viral strains e.g., H5HA2, H7HA2 and other strains of interest may be obtained following the teachings described herein for the exemplified H3HA2 and BHA2 sequences.
- H5HA2, H7HA2 and other strains of interest may be obtained following the teachings described herein for the exemplified H3HA2 and BHA2 sequences.
- this invention is not limited to the exemplified protein and DNA sequences, even though the following disclosure is limited to the two latter sequences for simplicity.
- Such additional viral HA2 subunits are expected to share the biological
- this invention provides a protein or fragment thereof characterized by an amino acid sequence derived from the HA2 subunit of a hemagglutinin (HA) protein, e.g., from a H3N2 subtype virus.
- HA hemagglutinin
- proteins of the invention are capable of inducing T helper cells, particularly cytotoxic T lymphocytes, in the absence of neutralizing antibodies.
- H3N2 subtype strains of influenza A include A/Udorn and
- influenza A may also produce HA proteins for use in vaccine compositions according to this invention.
- Fig. 1 compares the nucleic acid sequences of the HA2 portions of the A/Udorn [SEQ ID NO: 1] and A/Victoria [SEQ ID NO: 3] strains with the nucleic acid sequence of an H1N1 subtype virus, A/PR/8/34 [SEQ ID NO: 5].
- a consensus sequence [SEQ ID NO: 7] was computer generated, and may likewise be useful in producing proteins according to this invention. This consensus sequence [SEQ ID NO: 7] can be constructed by a commercially available
- Proteins according to this invention may include unfused HA2 subunits of the influenza A viruses, particularly H3N2 subtype.
- H3N2 subtype For example, in one
- a protein of the invention contains amino acids 1-221 of a selected H3HA2 subunit. In another embodiment, a protein of the invention contains amino acids 77-221 of the H3HA2 subunit. Other fragments of this HA2 amino acid sequence characterized by the ability to stimulate similar immunological activity in an
- immunized animal are also encompassed by this invention.
- Proteins of this invention also include fusion proteins comprising a protein sequence derived from the HA2 subunit of an HA protein from a Type A virus, e.g., an H3N2 subtype virus, fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein.
- this fusion "partner" protein be an influenza protein sequence or fragment thereof derived from the same or another strain of influenza virus as the HA protein or protein fragment.
- this fusion partner protein is all or a portion of the influenza virus NS1 gene or an HA2
- the NS1 portion of the fusion protein is derived from an H1N1 subtype virus, A/PR/8/34.
- H1N1 subtype virus A/PR/8/34.
- the NS1 portion may comprise amino acid residues 1 to 42 of H1NS1. In another embodiment the NS1 portion may comprise amino acid residues 1 to 81 of the selected virus.
- the HA2 fragment may alternatively be fused to a portion of the NS1 peptide derived from a selected Type A virus, e.g., an H3 subtype virus (H3HA2), or a type B (BHA2) virus.
- H3HA2 H3 subtype virus
- BHA2 type B virus
- non-influenza fusion proteins may also produce desirable fusion proteins with the H3N2, or other Type A, or type B protein or portion thereof.
- the HA2 fragment may be fused to any peptide capable of enhancing its expression in the host cell selected.
- a fusion "partner" protein or fragment taking into account the desired host cell and utilizing the teachings herein.
- the fusion proteins of the present invention are not limited by the selection of the "partner" protein or fragment to which the HA2 fragment is fused.
- the present invention provides a modified protein containing a portion of the HA2 subunit of a type B influenza virus.
- a type B influenza virus Currently, the preferred human virus strain is B/Lee/40.
- the vaccinal proteins of this invention are not limited to this type B strain, and other strains
- HA2 protein infecting other species, or other as yet unidentified type B virus strains, may be used to produce the HA2 protein.
- type B HA2 proteins may be fused, as described above for the H3HA2 proteins of this invention, or remain unfused. In the construction of a fusion protein
- a linker sequence may be inserted optionally between the two fused sequences, i.e., between the NS1 portion and the HA2 portion.
- This optional linker may provide space between the two linked sequences.
- this linker sequence may encode, if desired, a polypeptide which is selectively cleavable or digestible by conventional chemical or enzymatic methods.
- the selected cleavage site may be an enzymatic cleavage site, including sites for cleavage by a proteolytic enzyme, such as
- enterokinase factor Xa
- trypsin trypsin
- collagenase and
- the cleavage site in the linker may be a site capable of being cleaved upon exposure to a selected chemical, e.g., cyanogen bromide or
- cleavage site if inserted into a linker useful in the fusion sequences of this invention, does not limit this invention. Any desired cleavage site, of which many are known in the art, may be used for this purpose.
- a presently preferred example of a fusion protein of this invention is NS1 (1-81) H3HA2 (1-221) [SEQ ID NO: 10], which comprises the first 81 amino acids of NS1 fused to amino acid 1 to 221 of the H3HA2 subunit (amino acids 1-221).
- Another exemplary fusion protein, NS1 (1 - 81) H3HA2 (77-221) [SEQ ID NO: 12] comprises the first 81 amino acids of NS1 fused to amino acid 77 to 221 of the
- H3HA2 proteins Yet another preferred example of a fusion protein of this invention is NS1 1-42 BHA2 41-223 [SEQ ID NO: 14], which comprises the first 42 amino acids of NS1 fused to amino acids 41 to 223 of the truncated BHA2 subunit.
- SEQ ID NO: 14 comprises the first 42 amino acids of NS1 fused to amino acids 41 to 223 of the truncated BHA2 subunit.
- the NS1 (1-81) H3HA2 (1-221) protein [SEQ ID NO: 10] of the invention has a three-dimensional structure which is substantially similar to that of the NS1 (1-81) HA2 (1-222) protein [SEQ ID NO: 16] derived from the H1N1 subtype virus
- the amino acid sequence of the NS1 (1- 81) H3HA2 (1-221) protein [SEQ ID NO: 10] has only approximately 50% homology with the amino acid sequence of C13 protein [SEQ ID NO: 16].
- the nucleic acid sequence of the H3HA2 1-221 fragment derived from A/Udorn (nucleotides 25-560 from that virus) [SEQ ID NO: 1] has only approximately 60% homology with the nucleic acid sequence of the H1HA2 1-222 protein derived from strain A/PR/8/34 (nucleotides 1872-2407 from A/PR/8/34) [SEQ ID NO: 5].
- nucleic acid sequence of H3HA2 1-221 from A/Udorn (nucleotides 1-499 of A/Udorn) [SEQ ID NO: 1] has approximately 99% homology with the nucleic acid sequence of H3HA2 1-221 from A/Victoria/H3/75 (nucleotides 1226-1725 of A/Victoria) [SEQ ID NO: 3]
- Analogs of the HA2 peptides from a Type A virus, e.g., an H3, or B viruses, included within the definition of this invention, include truncated
- polypeptides including fragments
- HA2 polypeptides e.g. mutants that retain the epitopes and thus the biological activity of HA2. It is anticipated that, because the NS1 portion of the fusion peptide provides a means of expressing the protein at high levels and does not appear to play as significant a role in the
- analogs of the HA2 peptides and/or the fusion partner differ by only 1 to about 4 codon changes.
- Other examples of analogs include
- polypeptides with minor amino acid variations from the natural amino acid sequence of HA2 in particular, conservative amino acid replacements.
- Conservative replacements are those that take place within a family of amino acids that are related in their side chains.
- isoleucine or valine an aspartate with a glutamate, a threonine with a serine, or a similar conservative replacement of an amino acid with a structurally related amino acid will not have a significant effect on its activity, especially if the replacement does not involve an amino acid at an epitope of the HA2 polypeptide.
- the HA2 portion of the fusion peptide e.g., H3HA2 1-221 , H3HA2 77-221 and
- BHA2 41-223 confers the majority of the necessary epitopes for antibody binding or T cell (particularly CTL)
- the present invention also encompasses DNA sequences of this invention encoding the above-described proteins and fusion proteins, the sequences characterized by having an immunogenic determinant of a modified HA2 subunit of an HA protein, derived from a Type A virus, e.g., an H3 subtype, or type B virus.
- a Type A virus e.g., an H3 subtype, or type B virus.
- sequences of this invention encode such HA2 subunits, optionally fused to a DNA sequence encoding a protein or peptide which is capable of enhancing expression of the protein in a selected host cell.
- the consensus sequence illustrated in Fig. 1(d) may provide a source of HA2 DNA.
- the currently preferred embodiment provides a DNA sequence encoding a Type A virus, e.g., an H3 or type B HA2 protein or fragment thereof fused in frame to a DNA sequence encoding a portion of the
- N nonstructural influenza protein 1
- Coding sequences for the HA2, NS1 and other viral proteins of influenza virus can be prepared
- influenza viruses including other strains, subtypes and types, are
- DNA sequences encoding the H3HA2 or BHA2 protein sequences are also included in the present invention, as well as analogs or derivatives thereof.
- DNA sequences which code for H3 or other Type A or type B HA2 proteins of the invention but which differ in codon sequence due to the degeneracies of the genetic code or variations in the DNA sequence encoding H3HA2, other Type A or BHA2 proteins which are caused by point mutations or by induced modifications to enhance the activity, half-life or production of the peptide encoded thereby are also encompassed in the invention.
- DNA sequences which hybridize under stringent conditions with the DNA sequences encoding the HA2 subunit proteins e.g., H3HA2 or BHA2 proteins
- DNA sequences which hybridize under non-stringent conditions with the disclosed sequences, but which encode proteins or fragments retaining the biological activities of the H3HA2 or BHA2 proteins are also included in this
- the fusion proteins of the invention may be prepared by conventional genetic engineering and
- proteins may be purified from expression in host cell or vector systems by conventional means.
- microorganisms and cells including, for example, E.
- E. coli Bacillus, Streptomyces, Saccharomyces, mammalian and insect cells, are known and available from private and public laboratories and depositories and from commercial vendors.
- the preferred host is E. coli
- polypeptide employed in the presently preferred embodiment is N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N-(2-aminoethyl)-2-aminoethyl-N
- a preferred method of production employs an alternative expression system in which the ⁇ -lactamase coding sequence is wholly or partially replaced by a coding sequence for an alternative selectable marker such as, for example, kanamycin or chloramphenicol.
- H3 or other Type A subunit or type B HA2 peptides or fusion protein To aid in expression of the H3 or other Type A subunit or type B HA2 peptides or fusion protein
- these protein sequences or fragments thereof may also be fused to a polypeptide capable of enhancing expression of these fragments in the selected host system.
- a polypeptide capable of enhancing expression of these fragments in the selected host system.
- a peptide would contain a leader sequence fragment that provides for secretion of the Type A subunit fragment, e.g., the H3HA2 fragment, or type B HA2 fragment in the host cell.
- the leader sequence fragment that provides for secretion of the Type A subunit fragment, e.g., the H3HA2 fragment, or type B HA2 fragment in the host cell.
- sequence fragment typically encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell.
- a promoter sequence may be linked directly with the DNA molecule encoding the HA2 fragment.
- Such polypeptides, promoter and leader sequences are known to those of skill in the art and may be readily selected for expression in the selected host.
- the present invention is therefore not limited to any particular expression system or vector, nor to any particular purification process from cell lysates or cell medium.
- proteins and fusion proteins of this invention may be employed in vaccine compositions.
- compositions of this invention therefore, contain an effective immunogenic amount of a selected HA2 protein, e.g., H3HA2 or BHA2 protein, of the invention in admixture with a suitable adjuvant in a nontoxic and sterile pharmaceutically acceptable carrier.
- a selected HA2 protein e.g., H3HA2 or BHA2 protein
- Suitable carriers for vaccine use are well known to those of skill in the art.
- exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, gelatin, dextrin, agar, pectin, peanut oil, olive oil, sesame oil, squalene and water.
- the carrier or diluent may include a time delay material, such as glyceryl monostearate or glyceryl distearate alone or with a wax.
- suitable chemical stabilizers may be used to improve the stability of the pharmaceutical preparation. Suitable chemical stabilizers are well known to those of skill in the art and include, for example, citric acid and other agents to adjust pH, chelating or sequestering agents, and
- Vaccine compositions of this invention may employ an immunogenic amount of a purified recombinant protein as described above.
- a preferred embodiment of the vaccine of the invention is composed of an aqueous suspension or solution containing the recombinant HA2 protein molecule, e.g., H3HA2 or BHA2, together with an adjuvant, preferably an aluminum, most preferably
- a preferred protein for use in these vaccine compositions includes a protein comprising amino acid residues 1 to 81 from NS1 fused to C-terminal amino acid residues 1-221 from the hemagglutinin subunit 2 (HA2) from influenza A, subtype H3N2.
- HA2 hemagglutinin subunit 2
- preferred vaccine composition of this invention employs a purified recombinant protein made up of amino acid residues 1 to 81 from NS1 fused to amino acid residues
- Still another preferred vaccine composition of this invention employs a purified recombinant protein made up of amino acid residues 1 to 42 fused to amino acid residues 41-223 of the HA2 from influenza B.
- Vaccine compositions of the invention may also employ an immunogenic amount of a recombinant protein of the invention in combination with other influenza
- Suitable influenza antigens for combination in a vaccine composition with the proteins of this invention may be derived from type A, H1 subtype viruses and may include the recombinant fusion proteins described in detail in copending U. S. Patent Application Ser. No.
- suitable H1 subtype immunogenic proteins include C13 (NS1 (1-81) -D-L-S-R-HA2 (1-222) ) [SEQ ID NO: 15 & 16], D (NS1 (1-81) -Q-I-P-HA2 (65-222) ) [SEQ ID NO: 17 & 18], C13 short (NS1 (1-42) -M-D-L-S-R-HA2 (1-222) ) [SEQ ID NO: 19 & 20], D short (NS1 (1-42) -M-D-H-M-L-T-S-T-R-S-HA2 (66-222) )
- H1 proteins consist of unfused polypeptides, such as H1HA2 66-222 [SEQ ID NO: 33 & 34] which is disclosed in copending U. S. Patent Application Ser. No. 07/751,898, incorporated herein by reference.
- one desirable combination vaccine to provide protection against Type A influenza contains NS1 (1-81) H3HA2 (1-221) protein [SEQ ID NO: 9 & 10] of the invention, one or more proteins derived from subtype H1N1 as described above, and an aluminum
- a combination vaccine of the invention will contain an immunogenic amount of the H3 fusion protein of the invention in combination with immunogenic amounts of influenza antigens derived from the other type A influenza virus subtypes, including among others, H1, H2, H3, H4, H5, H6 and H7 as well as a type B fusion protein of the invention.
- other preferred combination vaccines would include the NS1 (1- 81) H3HA2 (77-221) protein [SEQ ID NO: 11 & 12] in combination with one or more additional influenza antigens derived from the type or subtype influenza viruses described above.
- the combination vaccine will protect against influenza infections caused by both type A and type B influenza viruses.
- Still other combination vaccine compositions will employ other proteins described herein.
- compositions of the present invention are advantageously made up in a dose unit form adapted for the desired mode of administration.
- Each unit will contain, at a minimum, a predetermined quantity of the selected HA2 subunit protein, e.g., H3HA2 protein and/or BHA2 protein, and adjuvant calculated to produce the desired therapeutic effect in optional association with a pharmaceutical diluent, carrier, or vehicle.
- Dosage protocol can be optimized in accordance with standard vaccination practices.
- the vaccine will be administered intramuscularly, although other routes of administration may be used, such as intradermal. It is expected that an effective
- immunogenic amount of a protein, fusion protein or combination of proteins of this invention for average adult humans is in the range of 1 to 1000 micrograms.
- Another desirable immunogenic amount ranges between 50 to 500 micrograms.
- the proteins of the invention are in admixture with the same amount or more adjuvant to form a vaccine composition.
- While the proteins described herein have been particularly developed for use in humans (e.g., the H3HA2 and BHA2 sequences), it is expected that due to species cross-reactivity, these vaccines will be useful in other animals, particularly swine. Additionally, similar molecules can be prepared for equine and avian veterinary applications utilizing the HA2 proteins from other strains to which animals are susceptible. Combination vaccines for use in swine would preferably include protections against both H1 and H3 viruses. Combination vaccines for use in equine would preferably include protection against H3 and H7 viruses. Combination vaccines for use in avian species would preferably confer protection against H5 and H7 viruses. Appropriate dosages can be determined by one skilled in veterinary medicine.
- the specific effective immunogenic amount for any particular patient will depend upon a variety of factors including the age, general health, sex, and diet of the vaccinee; the species of the vaccinee; the time of administration; the route of administration; interactions with any other drugs being administered; and the degree of protection being sought.
- the vaccine can be administered initially in late summer or early fall and can be readministered two to six weeks later, if desirable, or periodically as immunity wanes, for example, every two to five years.
- the administration can be repeated at suitable intervals if necessary or desirable.
- Plasmid pFV88 contains the entire 221 amino acid length HA from A/Udorn, an H3 subtype virus [C. J. Lai et al, Proc. Natl. Acad. Sci. USA. 77:210-214
- HA nucleic acid sequence is illustrated in Fig. 1 [SEQ ID NO: 1].
- This plasmid was cut with Pst I.
- the resulting plasmid is termed pMS3 or pMS3H3HA.
- Plasmid pAPR801 is a pBR322-derived cloning vector which carries the NS1 coding region (A/PR/8/34). It is described by Young et al, in The Origin of Pandemic Influenza Viruses, ed. by W. G. Laver, Elsevier Science Publishing Co. (1983).
- Plasmid pAS1 is a pBR322-derived expression vector which contains the P L promoter, an N utilization site (to relieve transcriptional polarity effects in the presence of N protein) and the ell ribosome binding site including the ell translation initiation codon followed immediately by a BamHI site. It is described by
- Plasmid pAS1 ⁇ EH was prepared by deleting a non-essential EcoRI-HindIII region of pBR322 origin from pAS1.
- the resulting plasmid, pAS1 ⁇ EH/801 expresses authentic NS1 (230 amino acids).
- the plasmid has an NcoI site between the codons for amino acids 81 and 82 and an NruI site 3' to the NS sequences.
- the BamHI site between amino acids 1 and 2 is retained.
- Plasmid pMG27N a pAS1 derivative [ Mol . Cell. Biol., 5:1015-1024 (1985)] was cut with BamHI and SacI and ligated to a BamHI/NcoI fragment encoding the first 81 amino acids of NS1 from pAS1 ⁇ EH801 and a synthetic DNA NcoI/SacI fragment of the following sequence:
- Synthetic oligonucleotides were annealed to generate an NcoI 5' overhang sequence (at the 5' end) and a HhaI 3' overhang sequence (at the 3' end).
- SEQ ID NO: 37 5' -CATGGGCGCCCATATGGGCATATTCGGCG-3'
- SEQ ID NO: 38 3'- CCGCGGGTATACCCGTATAAGCC -5'
- the annealing reaction was performed as follows.
- the annealing mixture was made up of 2.5 ⁇ L each of 5' oligo (1.3 ⁇ g/ ⁇ L), the 3' oligo (1.2 ⁇ g/ ⁇ L), and added water (15 ⁇ L) to a final volume of 20 ⁇ L.
- the reaction tubes were then placed in 4 mL culture tubes containing water which had been heated to 65°C for 10 minutes and allowed to cool down slowly. The tubes were then put on ice and used immediately for ligation.
- This three part ligation generates pMG1H3HA2 (1-221) [SEQ ID NO: 9] which codes for the first 81 amino acids of NS1 fused to four amino acids donated from the linker and amino acids 1-221 of the HA2 subunit. This sequence is illustrated in Fig. 2 [SEQ ID NO: 9 & 10]. This molecule is also designated NS1 (1-81) H3HA2 (1-221) [SEQ ID NO: 9 & 10]. EXAMPLE 4 - NS1 (1-81) H3HA2 (77-221) [SEQ ID NO: 11 & 12]
- pMS3H3HA described in Example 1 above, was digested with EcoRI and end-filled (Klenow).
- the vector was digested with XbaI.
- a 487 bp fragment which contains the coding sequence for amino acids 77-221 of the HA2 subunit, was isolated and ligated to the HpaI and XbaI sites of pMG1.
- the resulting vector codes for a fusion polypeptide containing amino acids 1- 81 of NS1 fused to amino acids 77-221 of the HA2 subunit. This molecule has been termed NS1 (1-81) H3HA2 ⁇ 77-221) and is illustrated in Fig. 3 [SEQ ID NO: 11 & 12].
- pMG1 was digested with BamHI and NcoI and ligated to the BamHI/NcoI fragment encoding amino acids 2 to 42 of NS1 from pNS1 42 TGF ⁇ .
- pNS1 42 TGF ⁇ is derived when pASl ⁇ EH801 is cut with NcoI and SalI and ligated to a synthetic DNA encoding human TGF ⁇ as an NcoI/SalI fragment.
- pNS1 42 TGF ⁇ encodes a protein
- NS1 comprised of the first 42 amino acids of NS1 and the mature TGF ⁇ sequence.
- the NS1 portion of pNS1 42 TGF ⁇ contains an amino acid change from Cys to Ser at amino acid #13.
- pMG 42 A The resulting plasmid, termed pMG 42 A, was then modified to contain an alternative synthetic linker after the NS1 42 sequence with a different set of restriction enzyme sites within which to insert foreign DNA fragments into the three reading frames after the NS1 42 .
- This linker has the following sequence:
- pMG 42 B The resulting plasmid is called pMG 42 B.
- This vector is needed to contain the neomycin phosphotransferase-1 (NPT- 1) gene which confers kanamycin resistance.
- pOTS207 is a pAS derived cloning vector which carries the kanamycin resistance gene from Tn903 [Berg et al, Microbiology, ed. D.
- the pOTS207 was digested with EcoRI and PstI, and the 1467 bp fragment containing the kanamycin
- SEQ ID NO: 41 5' AATTCGTACCTA 3'
- pMG 42 B was digested with BglII and PstI.
- the EcoRI/PstI NPT-1 gene fragment and the synthetic oligo linker were ligated to the digested pMG 42 B.
- the resulting plasmid, pMG 47 Kn allows fusions, in three different reading frames, to the NS 1-42 gene, while allowing antibiotic selection with kanamycin.
- Plasmid pBHA is a pBR322-derived vector, containing the complete nucleotide sequence of the hemagglutinin (HA) gene of a type B influenza virus (B/Lee/40). It is described by Krystal et al, Proc.
- pBHA was digested with Rsal and a 813 bp fragment containing the HA subunit was isolated. This fragment was ligated into plasmid pMG 42 Kn (described above) that had been digested with ScaI. During the cloning, a base (T) was deleted from the ScaI recognition site shifting the gene out of the reading frame. The vector was digested with NcoI, and filled-in using Klenow, putting the gene back into the reading frame.
- the resulting construct expresses a fusion polypeptide containing amino acids 1-42 of NS1 and 41-233 of the HA2 subunit.
- This construct contains the Cys to Ser change at amino acid #13 of the NS1 portion of the fusion peptide.
- the seed virus, A/Udorn was prepared according to the procedures described in P. Palese and J. Schulman, Virol., 57:227-237 (1974). Briefly, this technique is as follows. Influenza virus strain A/Udorn was inoculated in 10-day old embryonated hen's eggs into the allantoic cavity. The eggs were incubated for 24-48 hours at 35°C then chilled at 4°C overnight. A portion of the eggshell over the airsac was removed and the allantoic fluid was aseptically removed using a 10-ml syringe. The fluid was centrifuged at low speed (3,000 ⁇ g) to remove
- Antisera was prepared as follows. 100-200 micrograms of purified virus in complete Freund's
- the plasmid pMG1H3HA2 (1-221) [SEQ ID NO: 9] was transfected into E. coli strain AR58 [SmithKline Beecham Pharmaceuticals]. Cultures were grown at 32°C to mid-log phase at which time cultures were shifted to 39.5°C for 2 hours. The E. coli cell pellets containing the
- the plasmid encoding the NS1 (1-81) H3HA2 (77-221) peptide [SEQ ID NO: 11 & 12] was expressed as described in part A above. Production of this peptide was confirmed by
- the pellet was resuspended by sonication in 50 mM glycine pH 10.0, 5% glycerol, 2 mM EDTA and then the suspension was treated with 1% Triton X-100 [J.T. Baker Chemicals Co.] at 4°C for 60 minutes and
- the resulting pellet was solubilized in 50 mM Tris, 8 M urea, pH 8.0 and centrifuged to remove any insoluble material. This solubilized material is dialyzed against 10 mM Tris, 1 mM EDTA, pH 8.0 followed, again, by centrifugation of insoluble material.
- the solubilized material is designated as "crude” material and is used in in vitro and in vivo mouse assays. At this point, the material is approximately 40 - 50% pure.
- the "crude” material was electrophoresed through an SDS-PAGE and the appropriate H3HA2 protein bands were visualized by KCl staining according to D. Hager et al, Anal. Biochem. 109:76-86 (1980). The band was cut-out and eluted electrophoretically by the "S&S Elutrap Electro-Separation System” [Schleicher &
- the electro-eluting buffer was the Tris-glycine.
- a concentrated and eluted sample was obtained and exhaustively dialyzed against 0.01 M NH 4 HCO 3 and 0.02% SDS [M. Hunkapiller et al, Method. Enzymol., 91:227-236 (1983)]. This sample was frozen quickly by dry ice and lyophilized to complete dryness. The lyophilized
- the protein is usually greater than 75% pure.
- mice (NIH/Swiss; 15 per group) were vaccinated subcutaneously with 50 or 10 ⁇ g NS1 (1-81) H3HA2 (1-221) [SEQ ID NO: 9 & 10] in aluminum hydroxide on days 0 and 21. The mice were boosted intraperitoneally on day 42 with the protein without adjuvant. On day 47, mice were challenged intranasally with 2 - 3 LD 50 doses of either A/PR/8/34 (H1N1) or A/HK/68 (H3N2) virus, and survival was monitored through day 21.
- A/PR/8/34 H1N1
- H3N2 A/HK/68 virus
- mice vaccinated with NS1 (1-81) H3HA2 (1-221) [SEQ ID NO: 10] and challenged with A/HK/68 (80-93%) was significantly higher than in control mice which were injected with adjuvant only (26% survival).
- vaccination with NS1 ⁇ 1- 81) H3HA2 (1-221) [SEQ ID NO: 10] did not confer protection against challenge with A/PR/8/34, an H1N1 strain (0-26% survival).
- protection elicited by NS1 (1-81) H3HA2 (1 . 221) [SEQ ID NO: 10] is selective for antigenically diverse virus strains within the H3 subtype.
- NS1 (1-81) HA2 (65-222) [SEQ ID NO: 18], derived from the H1N1 subtype) elicits protection from heterosubtypic challenge with H1N1, but not the H3N2 subtype [S Dillon et al,
- mice 5:A1362 (abs. 5749 and Table 1].
- mice were challenged with A/HK/68 (H3N2) on day 47, four weeks after the second injection.
- Control mice were immunized as described above for Table 1, where an ip injection was given at week 6 (5 days prior to challenge).
- the results in Table 2 show that CB6F 1 mice (15 per group) were significantly protected when challenged with the A/HK/68 heterologous H3 virus strain 5-28 days after the last injection.
- mice CB6F 1 were divided randomly into six groups, with fifteen in each group. The mice were injected subcutaneously with proteins in Al +3 (100 ⁇ g) on days 0 and 21, and then were challenged with 2-3 LD 50 doses of virus on day 49. Survival was monitored through day 21. The results of this study are illustrated in
- H3C13 NS1 1-81 H3HA2 1-221 is referred to as H3C13 in the table below.
- mice immunized with a mixture of the D protein and H3C13 protein in aluminum adjuvant were protected against challenge with either
- mice immunized with the D protein were protected against H1 but not H3 challenge. Likewise, mice immunized with the
- MOLECULE TYPE DNA (genomic)
- AGC ACT CAA GCA GCC ATC GAC CAA
- ATC 144 Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
- GAG CTT CTT GTC GCT CTG GAG AAC CAA CAT ACA ATT GAT CTG ACT GAC 336 Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
- MOLECULE TYPE DNA (genomic)
- AGC ACT CAA GCA GCC ATC GAC CAA
- ATC 144 Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
- GAG CTT CTT GTC GCT CTG GAG AAC CAA CAT ACA ATT GAT CTG ACT GAC 336 Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
- MOLECULE TYPE DNA (genomic)
- GGT CTA TTT GGA GCC ATT GCC
- GGG GGA TGG ACT GGA 48 Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
- MOLECULE TYPE DNA (genomic)
- MOLECULE TYPE DNA (genomic)
- xi SEQUENCE DESCRIPTION: SEQ ID NO: 8:
- MOLECULE TYPE DNA (genomic)
- ATC AGA AAT GGG ACT TAT GAC CAT GAT GTA TAC AGA GAC GAA GCA TTA 528 Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
- MOLECULE TYPE DNA (genomic)
- MOLECULE TYPE DNA (genomic)
- MOLECULE TYPE DNA (genomic)
- ATC TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG 672 Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
- MOLECULE TYPE DNA (genomic)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
Abstract
This invention provides vaccine compositions capable of conferring multi-strain immunity against influenza A and influenza B.
Description
VACCINAL POLYPEPTIDES
This is a continuation-in-part of pending
United States patent application Serial Number 751,896; which is a continuation-in-part of United States patent application Serial Number 387,558; which is a
continuation-in-part of United States patent application Serial Number 238,801, now abandoned; which is a
continuation-in-part of United States patent application Serial Number 645,732, now abandoned.
Field of the Invention
The present invention relates generally to a polypeptide useful in a composition for providing
immunity against influenza A and influenza B in an animal.
Background of the Invention
Influenza virus infection causes acute respiratory disease in man, horses, swine and fowl, sometimes of pandemic proportions. Influenza viruses are orthomyxoviruses and, as such, have envelope virions of 80 to 120 nanometers in diameter, with two different
glycoprotein spikes. Three types, A, B and C, infect humans. Type A viruses have been responsible for the majority of human epidemics in modern history, although there are also sporadic outbreaks of Type B infections. Known swine, equine and avian viruses have mostly been Type A, although Type C viruses have also been isolated from swine.
The Type A viruses are divided into subtypes based on the antigenic properties of the hemagglutinin (HA) and neuraminidase (NA) surface glycoproteins.
Within type A, subtypes H1 ("swine flu"), H2 ("asian flu") and H3 ("Hong Kong flu") are predominant in human infections. In swine, the predominant influenza A subtypes are H1 and H3; in horses, H3 and H7; and in avians, H5 and H7. Presently only one Type B virus has been identified, with no subtypes.
Genetic "drift" or "shift", i.e., rapid and unpredictable change in the antigen, occurs at
approximately yearly intervals, and affects antigenic determinants in the HA and NA proteins. Therefore, it has not been possible to prepare a "universal" influenza virus vaccine using conventional killed or attenuated viruses, that is, a vaccine which is non-strain specific.
Recently, attempts have been made to prepare such
universal, or semi-universal, vaccines from reassortant viruses prepared by crossing different strains. More recently, such attempts have involved recombinant DNA techniques focusing primarily on the HA protein.
There remains a need in the art for vaccine formulations and compositions capable of inducing
protective responses in animals against influenza
viruses. Summary of the Invention
The present invention provides compositions containing, and methods for use of, a protein which is capable of inducing protection in animals and avians against challenge with more than one strain of influenza type A and influenza type B.
Thus, one aspect of the invention provides a DNA sequence encoding a modified purified recombinant protein. The DNA sequence of the invention encodes a modified protein sequence derived from the HA2 subunit of a selected hemagglutinin (HA) protein. In one
embodiment, the sequence is derived from an H3N2 subtype influenza virus. These H3N2 fusion proteins are capable of inducing T cell responses in the absence of
neutralizing antibodies. In another embodiment, a DNA sequence of this invention encodes a modified protein sequence derived from the HA2 subunit from a type B influenza virus. Still further embodiments include DNA sequences obtained as described for the two above virus, where the sequences are derived from other Type A
influenza strains infecting animals as well as humans. Such virus include, without limitation, Type A subtypes of H1, H2, H3, H4, H5, H6 and H7.
In another aspect, the invention provides a DNA sequence encoding a recombinant fusion protein, in which the desired Type A subtype HA2 subunit sequence or a portion thereof, is fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein. One embodiment includes the H3N2 subtype HA2 subunit sequence described above fused in frame to another protein or fragment capable of enhancing
expression thereof. Another embodiment of such a fusion protein comprises a type B HA2 sequence, described above, or a portion thereof, fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein. Still other Type A subtype HA2 sequences can be similarly used. It is desirable that this fusion partner protein be an influenza protein sequence or fragment thereof.
In still another aspect a protein encoded by a DNA sequence of the invention is provided. The protein may be a protein sequence derived from the HA2 subunit of a hemagglutinin (HA) protein from a selected Type A subtype virus. Desirably the subtype virus is an H3N2. In another embodiment, the protein may be derived from the HA subunit from a type B influenza virus. Other embodiments include H5 or H7 subtypes. Additionally, preferred embodiments include fusion proteins comprising a protein sequence derived from the HA2 subunit of an HA protein from a Type A virus, e.g., an H3N2 subtype, or from a type B virus fused in frame to a selected
influenza sequence. The proteins of this invention are particularly useful in inducing protection in mammals, especially humans, against challenge by type B or an H3N2 subtype of influenza A. The proteins employing other Type A subtypes, e.g., H5 and H7, are useful in inducing protection in animals against influenza viruses.
In a further aspect the invention provides a vaccine composition containing a purified protein of the invention, as described above. Such a vaccine
composition may include a fusion protein of the
invention. In other embodiments of the invention, the vaccine compositions contain an H3HA2 protein of the invention and other influenza antigens; a type B HA2
protein of the invention and other influenza antigens; or both an H3HA2 protein, a BHA2 protein and other influenza antigens. In a preferred embodiment for human use, a combination vaccine of the invention will contain an H3HA2 and a BHA2 protein of the invention in combination with influenza antigens derived from the other type A influenza virus subtypes, H1 and H2. An embodiment for use in animals may contain an H5HA2 or H7HA2 protein, among others.
A further aspect of this invention is a method for inducing in an animal protection against influenza type A, influenza type B, influenza type C, or
combinations thereof, which comprises internally
administering to the animal an effective imraunogenic amount of a vaccine composition of the present invention.
Still a further aspect of this invention is a method for inducing in an animal protection against multiple strains of influenza types A and B which
comprises internally administering to the animal an effective immunogenic amount of a vaccine composition of the present invention.
Other aspects and advantages of the present invention are described further in the following detailed description of the preferred embodiments thereof.
Brief Description of the Drawings
Fig. 1 illustrates the nucleic acid sequences of the HA2 portions of (a) A/Udorn [SEQ ID NO: 1], (b) A/Victoria [SEQ ID NO: 3], (c) A/PR/8/34 [SEQ ID NO: 5], and (d) a consensus sequence [SEQ ID NO: 7]. Dashes indicate the same nucleotide as the consensus sequence. Different nucleotides from that of the consensus sequence are reported in lower case letters. Dots indicate no corresponding nucleotide when compared to the consensus sequence.
Fig. 2 illustrates the nucleic acid and amino acid sequences of NS1(1-81)H3HA2(1-221) fusion protein [SEQ ID NO: 9 & 10].
Fig. 3 illustrates the nucleic acid and amino acid sequences of the NS1(1-81)H3HA2(77-221) fusion protein [SEQ ID NO: 11 & 12].
Fig. 4 illustrates the nucleic acid and amino acid sequences of the type B fusion protein, NS11-42HA241-223. [SEQ ID NO: 13 & 14]. Detailed Description of the Invention
The present invention provides novel proteins, DNA sequences, pharmaceutical vaccine compositions and methods of use thereof for conferring protection in vaccinated mammals against one strain, or desirably
multiple strains, of influenza viruses. The proteins and vaccine compositions of the present invention demonstrate the ability to stimulate or produce a protective immune response which is capable of recognizing an influenza virus or influenza virus-infected cells and protecting the vaccinated mammal against disease caused thereby. This protective response is desirably a T cell response, produced in the substantial absence of vaccine-induced neutralizing antibody.
While the proteins and DNA sequences specifically described herein are directed to the H3HA2 and BHA2 sequences originating from viral strains to which humans are susceptible, it is expected that similar sequences and molecules can be prepared for veterinary applications. For example, selected HA2 sequences obtained from type A viral strains, e.g., H5HA2, H7HA2 and other strains of interest may be obtained following the teachings described herein for the exemplified H3HA2 and BHA2 sequences. One of skill in the art should understand that this invention is not limited to the exemplified protein and DNA sequences, even though the following disclosure is limited to the two latter sequences for simplicity. Such additional viral HA2 subunits are expected to share the biological
characteristics of the exemplified sequences.
Thus, this invention provides a protein or fragment thereof characterized by an amino acid sequence derived from the HA2 subunit of a hemagglutinin (HA) protein, e.g., from a H3N2 subtype virus. The H3
proteins of the invention are capable of inducing T helper cells, particularly cytotoxic T lymphocytes, in the absence of neutralizing antibodies. Among H3N2 subtype strains of influenza A include A/Udorn and
A/Victoria viruses. Other H3N2 virus strains of
influenza A may also produce HA proteins for use in vaccine compositions according to this invention. Fig. 1 compares the nucleic acid sequences of the HA2 portions of the A/Udorn [SEQ ID NO: 1] and A/Victoria [SEQ ID NO: 3] strains with the nucleic acid sequence of an H1N1 subtype virus, A/PR/8/34 [SEQ ID NO: 5]. A consensus sequence [SEQ ID NO: 7] was computer generated, and may likewise be useful in producing proteins according to this invention. This consensus sequence [SEQ ID NO: 7] can be constructed by a commercially available
computerized sequence analysis program, such as Genetics Computers Group [Univeristy of Wisconsin].
Proteins according to this invention may include unfused HA2 subunits of the influenza A viruses, particularly H3N2 subtype. For example, in one
embodiment, a protein of the invention contains amino
acids 1-221 of a selected H3HA2 subunit. In another embodiment, a protein of the invention contains amino acids 77-221 of the H3HA2 subunit. Other fragments of this HA2 amino acid sequence characterized by the ability to stimulate similar immunological activity in an
immunized animal are also encompassed by this invention.
Proteins of this invention also include fusion proteins comprising a protein sequence derived from the HA2 subunit of an HA protein from a Type A virus, e.g., an H3N2 subtype virus, fused in frame to another protein or protein fragment capable of enhancing expression of the fusion protein. It is desirable that this fusion "partner" protein be an influenza protein sequence or fragment thereof derived from the same or another strain of influenza virus as the HA protein or protein fragment. Preferably, this fusion partner protein is all or a portion of the influenza virus NS1 gene or an HA2
subunit.
In the embodiments exemplified herein, the NS1 portion of the fusion protein is derived from an H1N1 subtype virus, A/PR/8/34. For example, in one
embodiment, the NS1 portion may comprise amino acid residues 1 to 42 of H1NS1. In another embodiment the NS1 portion may comprise amino acid residues 1 to 81 of the selected virus. The HA2 fragment may alternatively be fused to a portion of the NS1 peptide derived from a
selected Type A virus, e.g., an H3 subtype virus (H3HA2), or a type B (BHA2) virus.
However, other non-influenza fusion proteins may also produce desirable fusion proteins with the H3N2, or other Type A, or type B protein or portion thereof. Thus, in still another alternative embodiment, as
discussed below, the HA2 fragment may be fused to any peptide capable of enhancing its expression in the host cell selected. One of skill in the art may readily select a fusion "partner" protein or fragment taking into account the desired host cell and utilizing the teachings herein. The fusion proteins of the present invention are not limited by the selection of the "partner" protein or fragment to which the HA2 fragment is fused.
In yet another embodiment, the present invention provides a modified protein containing a portion of the HA2 subunit of a type B influenza virus. Currently, the preferred human virus strain is B/Lee/40. However, the vaccinal proteins of this invention are not limited to this type B strain, and other strains
infecting other species, or other as yet unidentified type B virus strains, may be used to produce the HA2 protein. These type B HA2 proteins may be fused, as described above for the H3HA2 proteins of this invention, or remain unfused.
In the construction of a fusion protein
according to this invention, a linker sequence may be inserted optionally between the two fused sequences, i.e., between the NS1 portion and the HA2 portion. This optional linker may provide space between the two linked sequences. Alternatively, this linker sequence may encode, if desired, a polypeptide which is selectively cleavable or digestible by conventional chemical or enzymatic methods. For example, the selected cleavage site may be an enzymatic cleavage site, including sites for cleavage by a proteolytic enzyme, such as
enterokinase, factor Xa, trypsin, collagenase and
thrombin. Alternatively, the cleavage site in the linker may be a site capable of being cleaved upon exposure to a selected chemical, e.g., cyanogen bromide or
hydroxylamine. The cleavage site, if inserted into a linker useful in the fusion sequences of this invention, does not limit this invention. Any desired cleavage site, of which many are known in the art, may be used for this purpose.
A presently preferred example of a fusion protein of this invention is NS1(1-81)H3HA2(1-221) [SEQ ID NO: 10], which comprises the first 81 amino acids of NS1 fused to amino acid 1 to 221 of the H3HA2 subunit (amino acids 1-221). Another exemplary fusion protein, NS1(1- 81)H3HA2(77-221) [SEQ ID NO: 12], comprises the first 81 amino
acids of NS1 fused to amino acid 77 to 221 of the
truncated H3HA2 subunit. Yet another preferred example of a fusion protein of this invention is NS11-42BHA241-223 [SEQ ID NO: 14], which comprises the first 42 amino acids of NS1 fused to amino acids 41 to 223 of the truncated BHA2 subunit. These proteins, fusion proteins and similar proteins encoded by the below-described DNA sequences are referred to collectively herein as H3HA2 proteins.
The NS1(1-81)H3HA2(1-221) protein [SEQ ID NO: 10] of the invention has a three-dimensional structure which is substantially similar to that of the NS1(1-81)HA2(1-222) protein [SEQ ID NO: 16] derived from the H1N1 subtype virus
(C13). However, the amino acid sequence of the NS1(1- 81)H3HA2(1-221) protein [SEQ ID NO: 10] has only approximately 50% homology with the amino acid sequence of C13 protein [SEQ ID NO: 16]. Additionally, as illustrated in Fig. 1, the nucleic acid sequence of the H3HA21-221 fragment derived from A/Udorn (nucleotides 25-560 from that virus) [SEQ ID NO: 1] has only approximately 60% homology with the nucleic acid sequence of the H1HA21-222 protein derived from strain A/PR/8/34 (nucleotides 1872-2407 from A/PR/8/34) [SEQ ID NO: 5]. However, the nucleic acid sequence of H3HA21-221 from A/Udorn (nucleotides 1-499 of A/Udorn) [SEQ ID NO: 1] has approximately 99% homology with the nucleic acid sequence of H3HA21-221 from A/Victoria/H3/75
(nucleotides 1226-1725 of A/Victoria) [SEQ ID NO: 3]
[Fiers et al, Cell, 19:683-696 (1980)].
Analogs of the HA2 peptides from a Type A virus, e.g., an H3, or B viruses, included within the definition of this invention, include truncated
polypeptides (including fragments) and HA2 polypeptides, e.g. mutants that retain the epitopes and thus the biological activity of HA2. It is anticipated that, because the NS1 portion of the fusion peptide provides a means of expressing the protein at high levels and does not appear to play as significant a role in the
immunological responses to the HA2 fusion proteins as does the HA2 portion, any number of analogs of this fusion partner can be made.
Typically, the analogs of the HA2 peptides and/or the fusion partner differ by only 1 to about 4 codon changes. Other examples of analogs include
polypeptides with minor amino acid variations from the natural amino acid sequence of HA2; in particular, conservative amino acid replacements. Conservative replacements are those that take place within a family of amino acids that are related in their side chains.
Genetically encoded amino acids are generally divided into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) non-polar = alanine, valine, leucine, isoleucine, proline,
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic amino acids. For example, it is reasonable to expect that an isolated replacement of a leucine with an
isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar conservative replacement of an amino acid with a structurally related amino acid will not have a significant effect on its activity, especially if the replacement does not involve an amino acid at an epitope of the HA2 polypeptide.
The construction of such analogs, given the description herein and conventional methods of protein modification known to one of skill in the art, are believed to be encompassed by this invention.
Currently, it is theorized that the HA2 portion of the fusion peptide (e.g., H3HA21-221, H3HA277-221 and
BHA241-223) confers the majority of the necessary epitopes for antibody binding or T cell (particularly CTL)
targeting. Once these epitope sequences are precisely identified, portions of the HA2 sequence which are not part of these epitopes may be altered without
significantly affecting the bioactivity of the fusion protein.
The present invention also encompasses DNA sequences of this invention encoding the above-described proteins and fusion proteins, the sequences characterized by having an immunogenic determinant of a modified HA2 subunit of an HA protein, derived from a Type A virus, e.g., an H3 subtype, or type B virus. Other DNA
sequences of this invention encode such HA2 subunits, optionally fused to a DNA sequence encoding a protein or peptide which is capable of enhancing expression of the protein in a selected host cell. For example, the consensus sequence illustrated in Fig. 1(d) may provide a source of HA2 DNA. The currently preferred embodiment provides a DNA sequence encoding a Type A virus, e.g., an H3 or type B HA2 protein or fragment thereof fused in frame to a DNA sequence encoding a portion of the
nonstructural influenza protein 1 (NS1).
Coding sequences for the HA2, NS1 and other viral proteins of influenza virus can be prepared
synthetically or can be derived from viral RNA or from available cDNA-containing plasmids by known techniques.
For example, in addition to the above-cited references, a DNA coding sequence for HA from the A/Japan/305/57 strain was cloned, sequenced and reported by Gething et al,
Nature, 287: 301-306 (1980). An HA coding sequence for strain A/NT/60/68 was cloned as reported by Sleigh et al, and by Both et al, in Developments in Cell Biology,
Elsevier Science Publishing Co., pages 69-79 and 81-89, respectively, (1980). An HA coding sequence for strain A/WSN/33 was cloned as reported by Davis et al, Gene.
10:205-218 (1980); and by Hiti et al, Virology. 111:113-124 (1981). An HA coding sequence for fowl plague virus was cloned as reported by Porter et al and by Emtage et al, both in Developments in Cell Biology, cited above, at pages 39-49 and 157-168. Also, influenza viruses, including other strains, subtypes and types, are
available from clinical specimens and from public
depositories, such as the American Type Culture
Collection (ATCC), Rockville, Maryland, U.S.A.
Allelic variations (naturally-occurring base changes in the species population which may or may not result in an amino acid change) of DNA sequences encoding the H3HA2 or BHA2 protein sequences are also included in the present invention, as well as analogs or derivatives thereof. Similarly, DNA sequences which code for H3 or other Type A or type B HA2 proteins of the invention but which differ in codon sequence due to the degeneracies of the genetic code or variations in the DNA sequence encoding H3HA2, other Type A or BHA2 proteins which are caused by point mutations or by induced modifications to enhance the activity, half-life or production of the peptide encoded thereby are also encompassed in the invention. Also covered by this invention are DNA
sequences which hybridize under stringent conditions with the DNA sequences encoding the HA2 subunit proteins, e.g., H3HA2 or BHA2 proteins, of this invention. DNA sequences which hybridize under non-stringent conditions with the disclosed sequences, but which encode proteins or fragments retaining the biological activities of the H3HA2 or BHA2 proteins, are also included in this
invention. Typical conditions for stringent or non-stringent hybridization are known to those of skill in the art. [See, e.g., Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory, NY (1989)].
The fusion proteins of the invention may be prepared by conventional genetic engineering and
recombinant techniques known to those of skill in the art. Similarly, the proteins may be purified from expression in host cell or vector systems by conventional means.
Systems for cloning and expression of the vaccinal polypeptide of this invention in various
microorganisms and cells, including, for example, E.
coli, Bacillus, Streptomyces, Saccharomyces, mammalian and insect cells, are known and available from private and public laboratories and depositories and from commercial vendors. The preferred host is E. coli
because it can be used to produce large amounts of
desired proteins safely and cheaply. The polypeptide employed in the presently preferred embodiment is
expressed in E. coli. To circumvent the requirement of ampicillin for plasmid selection in production
fermentations, a preferred method of production employs an alternative expression system in which the β-lactamase coding sequence is wholly or partially replaced by a coding sequence for an alternative selectable marker such as, for example, kanamycin or chloramphenicol.
To aid in expression of the H3 or other Type A subunit or type B HA2 peptides or fusion protein
described above, these protein sequences or fragments thereof may also be fused to a polypeptide capable of enhancing expression of these fragments in the selected host system. Ordinarily, such a peptide would contain a leader sequence fragment that provides for secretion of the Type A subunit fragment, e.g., the H3HA2 fragment, or type B HA2 fragment in the host cell. The leader
sequence fragment typically encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. There may be processing sites encoded between the leader sequence and the Type A subtype or type B HA2 fragment that can be cleaved either in vivo or in vitro. Alternatively, a promoter sequence may be linked directly with the DNA molecule encoding the HA2 fragment. Such polypeptides,
promoter and leader sequences are known to those of skill in the art and may be readily selected for expression in the selected host.
Construction of expression systems, including expression vectors and transformed host cells are thus within the art. See, generally, methods described in standard texts, such as Sambrook et al, Molecular Cloning A Laboratory Manual. 2d edit., Cold Spring Harbor
Laboratory, Cold Spring Harbor, NY (1989). The present invention is therefore not limited to any particular expression system or vector, nor to any particular purification process from cell lysates or cell medium.
The proteins and fusion proteins of this invention may be employed in vaccine compositions.
Pharmaceutical vaccine compositions of this invention, therefore, contain an effective immunogenic amount of a selected HA2 protein, e.g., H3HA2 or BHA2 protein, of the invention in admixture with a suitable adjuvant in a nontoxic and sterile pharmaceutically acceptable carrier.
Suitable carriers for vaccine use are well known to those of skill in the art. However, exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, gelatin, dextrin, agar, pectin, peanut oil, olive oil, sesame oil, squalene and water.
Additionally, the carrier or diluent may include a time delay material, such as glyceryl monostearate or glyceryl
distearate alone or with a wax. Optionally, suitable chemical stabilizers may be used to improve the stability of the pharmaceutical preparation. Suitable chemical stabilizers are well known to those of skill in the art and include, for example, citric acid and other agents to adjust pH, chelating or sequestering agents, and
antioxidants.
While any aluminum adjuvant may be used in the vaccine compositions of this invention, two desirable adjuvants are commercially marketed under the trademarks Rehsorptar [Armour Pharmaceuticals, Kankakee, IL] and Rehydragel [Reheis Chemical Co., Berkeley Heights, NJ]. These products are aluminum hydroxide gels which contain approximately 2% w/v Al2O3, which is equivalent to
approximately 10.6 mg/ml Al+3.
Vaccine compositions of this invention may employ an immunogenic amount of a purified recombinant protein as described above. A preferred embodiment of the vaccine of the invention is composed of an aqueous suspension or solution containing the recombinant HA2 protein molecule, e.g., H3HA2 or BHA2, together with an adjuvant, preferably an aluminum, most preferably
aluminum hydroxide, buffered at physiological pH, in a form ready for injection. A preferred protein for use in these vaccine compositions includes a protein comprising amino acid residues 1 to 81 from NS1 fused to C-terminal
amino acid residues 1-221 from the hemagglutinin subunit 2 (HA2) from influenza A, subtype H3N2. Another
preferred vaccine composition of this invention employs a purified recombinant protein made up of amino acid residues 1 to 81 from NS1 fused to amino acid residues
77-221 of the HA2 from influenza A, subtype H3N2. Still another preferred vaccine composition of this invention employs a purified recombinant protein made up of amino acid residues 1 to 42 fused to amino acid residues 41-223 of the HA2 from influenza B.
Vaccine compositions of the invention may also employ an immunogenic amount of a recombinant protein of the invention in combination with other influenza
antigens. Suitable influenza antigens for combination in a vaccine composition with the proteins of this invention may be derived from type A, H1 subtype viruses and may include the recombinant fusion proteins described in detail in copending U. S. Patent Application Ser. No.
07/387,200, filed July 28, 1989 and its corresponding European Patent Application No. 366, 238, published May 2, 1990; and in co-pending U. S. Patent Application Ser. No. 07/387,558, filed July 28, 1989 and its corresponding European Patent Application No. 366,239, published May 2, 1990. The C13 protein (NS1(1-81)HA2(1-222,) [SEQ ID NO: 15 & 16], D protein (NS1(1-80)HA2(65-222)) [SEQ ID NO: 17 & 18] and other fusion proteins derived from the H1N1 influenza
virus subtype and the recombinant expression and
purification thereof are disclosed in detail in these applications, and in the parent applications identified in this application, all of which are incorporated by reference herein.
More specifically, suitable H1 subtype immunogenic proteins include C13 (NS1(1-81)-D-L-S-R-HA2(1-222)) [SEQ ID NO: 15 & 16], D (NS1(1-81)-Q-I-P-HA2(65-222)) [SEQ ID NO: 17 & 18], C13 short (NS1(1-42)-M-D-L-S-R-HA2(1-222)) [SEQ ID NO: 19 & 20], D short (NS1(1-42)-M-D-H-M-L-T-S-T-R-S-HA2(66-222))
[SEQ ID NO: 21 & 22], A (NS1(1-81)-Q-I-P-HA2(69-222)) [SEQ ID NO: 23 & 24], C (NS1(1-81)-Q-I-P-HA2(81-222)) [SEQ ID NO: 25 & 26], ΔD (NS1(1-81)HA2(150-222)) [SEQ ID NO: 27], Δ13 (NS1(1-81)-D-L-S-R-HA2(1-70)-S-C-L-T-A-Y-H-R) [SEQ ID NO: 28], M (NS1(1-81)-Q-I-P-HA2(65-196)-G-G-S-Y-S-M-E-H-F-R-W-G-K-P-V) [SEQ ID NO: 29], ΔM (NS1(1-81)-Q-I-P-HA2(65-196)-G-G-S-Y-S-M-L-V-N) [SEQ ID NO: 30], ΔM+ (NS1(1-81)-Q-I-P-HA2(65-200)-L-V-L-L) [SEQ ID NO: 31 & 32], These H1N1 fusion proteins are described in published European Patent Application 366,238 and in copending U.S. Patent Application Ser. No. 07/751,896. Other suitable H1 proteins consist of unfused polypeptides, such as H1HA266-222 [SEQ ID NO: 33 & 34] which is disclosed in copending U. S. Patent Application Ser. No. 07/751,898, incorporated herein by reference. Thus, one desirable combination vaccine to provide protection against Type A
influenza contains NS1(1-81)H3HA2(1-221) protein [SEQ ID NO: 9 & 10] of the invention, one or more proteins derived from subtype H1N1 as described above, and an aluminum
adjuvant.
Preferably, a combination vaccine of the invention will contain an immunogenic amount of the H3 fusion protein of the invention in combination with immunogenic amounts of influenza antigens derived from the other type A influenza virus subtypes, including among others, H1, H2, H3, H4, H5, H6 and H7 as well as a type B fusion protein of the invention. Therefore, other preferred combination vaccines would include the NS1(1- 81)H3HA2(77-221) protein [SEQ ID NO: 11 & 12] in combination with one or more additional influenza antigens derived from the type or subtype influenza viruses described above. Thus, the combination vaccine will protect against influenza infections caused by both type A and type B influenza viruses. Still other combination vaccine compositions will employ other proteins described herein.
The compositions of the present invention are advantageously made up in a dose unit form adapted for the desired mode of administration. Each unit will contain, at a minimum, a predetermined quantity of the selected HA2 subunit protein, e.g., H3HA2 protein and/or
BHA2 protein, and adjuvant calculated to produce the desired therapeutic effect in optional association with a pharmaceutical diluent, carrier, or vehicle.
Dosage protocol can be optimized in accordance with standard vaccination practices. Typically, the vaccine will be administered intramuscularly, although other routes of administration may be used, such as intradermal. It is expected that an effective
immunogenic amount of a protein, fusion protein or combination of proteins of this invention for average adult humans is in the range of 1 to 1000 micrograms.
Another desirable immunogenic amount ranges between 50 to 500 micrograms. Most preferably, the proteins of the invention are in admixture with the same amount or more adjuvant to form a vaccine composition.
While the proteins described herein have been particularly developed for use in humans (e.g., the H3HA2 and BHA2 sequences), it is expected that due to species cross-reactivity, these vaccines will be useful in other animals, particularly swine. Additionally, similar molecules can be prepared for equine and avian veterinary applications utilizing the HA2 proteins from other strains to which animals are susceptible. Combination vaccines for use in swine would preferably include protections against both H1 and H3 viruses. Combination vaccines for use in equine would preferably include
protection against H3 and H7 viruses. Combination vaccines for use in avian species would preferably confer protection against H5 and H7 viruses. Appropriate dosages can be determined by one skilled in veterinary medicine.
It will be understood, however, that the specific effective immunogenic amount for any particular patient will depend upon a variety of factors including the age, general health, sex, and diet of the vaccinee; the species of the vaccinee; the time of administration; the route of administration; interactions with any other drugs being administered; and the degree of protection being sought.
The vaccine can be administered initially in late summer or early fall and can be readministered two to six weeks later, if desirable, or periodically as immunity wanes, for example, every two to five years.
Of course, as stated above, the administration can be repeated at suitable intervals if necessary or desirable.
The following examples illustrate methods for preparing H3HA2 and BHA2 fusion proteins of the invention and demonstrate the subtype specific protection against heterologous virus induced upon vaccination with the H3HA2 proteins. These examples are illustrative only and do not limit the scope of the invention.
EXAMPLE 1 - PLASMID PMS3H3HA
Plasmid pFV88 contains the entire 221 amino acid length HA from A/Udorn, an H3 subtype virus [C. J. Lai et al, Proc. Natl. Acad. Sci. USA. 77:210-214
(1980)], which HA nucleic acid sequence is illustrated in Fig. 1 [SEQ ID NO: 1]. This plasmid was cut with Pst I. The resulting 1900 bp fragment, which contains the entire HA (HA1 and HA2) fragment and some GC tailing, was then inserted into pUC18 [Bethesda Research Laboratories].
The resulting plasmid is termed pMS3 or pMS3H3HA.
EXAMPLE 2 - pPMG1
Plasmid pAPR801 is a pBR322-derived cloning vector which carries the NS1 coding region (A/PR/8/34). It is described by Young et al, in The Origin of Pandemic Influenza Viruses, ed. by W. G. Laver, Elsevier Science Publishing Co. (1983).
Plasmid pAS1 is a pBR322-derived expression vector which contains the PL promoter, an N utilization site (to relieve transcriptional polarity effects in the presence of N protein) and the ell ribosome binding site including the ell translation initiation codon followed immediately by a BamHI site. It is described by
Rosenberg et al, in Methods Enzymol., 101:123-138 (1983).
Plasmid pAS1ΔEH was prepared by deleting a non-essential EcoRI-HindIII region of pBR322 origin from pAS1. A 1236 base pair BamHI fragment of pAPR801, containing the NS1 coding region in 861 base pairs of viral origin and 375 base pairs of pBR322 origin, was inserted into the BamHI site of pAS1ΔEH. The resulting plasmid, pAS1ΔEH/801 expresses authentic NS1 (230 amino acids). The plasmid has an NcoI site between the codons for amino acids 81 and 82 and an NruI site 3' to the NS sequences. The BamHI site between amino acids 1 and 2 is retained.
Plasmid pMG27N, a pAS1 derivative [ Mol . Cell. Biol., 5:1015-1024 (1985)], was cut with BamHI and SacI and ligated to a BamHI/NcoI fragment encoding the first 81 amino acids of NS1 from pAS1ΔEH801 and a synthetic DNA NcoI/SacI fragment of the following sequence:
SEQ ID NO: 35:
5'-CATGGATCATATGTTAACAGATATCAAGGCCTGACTGACTGAGAGCT-3' SEQ ID NO: 36:
3'- CTAGTATACAATTGTCTATAGTTCCGGACTGACTGACTC -5'
The resulting plasmid, pMG1, allows the
insertion of DNA fragments after the first 81 amino acids of NS1 in any of the three reading frames within the synthetic linker fragment followed by termination codons in all three reading frames.
EXAMPLE 3 - PMG1H3HA
Plasmid pMG1, described above in Example 2, was digested with NcoI and XbaI, releasing a 54 bp fragment, which was discarded. pMS3H3HA, described in Example 1 above, was digested with HhaI and XbaI, and a 701 bp fragment containing the coding sequence for the HA2 subunit of influenza strain A/Udorn (H3N2) was isolated, as illustrated in Fig. 1 [SEQ ID NO: 1].
Synthetic oligonucleotides were annealed to generate an NcoI 5' overhang sequence (at the 5' end) and a HhaI 3' overhang sequence (at the 3' end). The
sequence of these oligonucleotides is as follows:
SEQ ID NO: 37: 5' -CATGGGCGCCCATATGGGCATATTCGGCG-3' SEQ ID NO: 38: 3'- CCGCGGGTATACCCGTATAAGCC -5' The annealing reaction was performed as follows. The annealing mixture was made up of 2.5μL each of 5' oligo (1.3 μg/μL), the 3' oligo (1.2 μg/μL), and added water (15 μL) to a final volume of 20 μL. The reaction tubes were then placed in 4 mL culture tubes containing water which had been heated to 65°C for 10 minutes and allowed to cool down slowly. The tubes were then put on ice and used immediately for ligation.
This three part ligation generates pMG1H3HA2(1-221) [SEQ ID NO: 9] which codes for the first 81 amino acids of NS1 fused to four amino acids donated from the linker and amino acids 1-221 of the HA2 subunit. This sequence
is illustrated in Fig. 2 [SEQ ID NO: 9 & 10]. This molecule is also designated NS1(1-81)H3HA2(1-221) [SEQ ID NO: 9 & 10]. EXAMPLE 4 - NS1(1-81)H3HA2(77-221) [SEQ ID NO: 11 & 12]
pMS3H3HA, described in Example 1 above, was digested with EcoRI and end-filled (Klenow).
Subsequently, the vector was digested with XbaI. A 487 bp fragment, which contains the coding sequence for amino acids 77-221 of the HA2 subunit, was isolated and ligated to the HpaI and XbaI sites of pMG1. The resulting vector codes for a fusion polypeptide containing amino acids 1- 81 of NS1 fused to amino acids 77-221 of the HA2 subunit. This molecule has been termed NS1(1-81)H3HA2{77-221) and is illustrated in Fig. 3 [SEQ ID NO: 11 & 12].
EXAMPLE 5 - PMG42BLHA2
To derive a vector similar to pMG1 (described in Example 2), which contains the coding region for the first 42 amino acids of NS1 father than the first 81 amino acids of NS1, pMG1 was digested with BamHI and NcoI and ligated to the BamHI/NcoI fragment encoding amino acids 2 to 42 of NS1 from pNS142TGFα. pNS142TGFα is derived when pASlΔEH801 is cut with NcoI and SalI and ligated to a synthetic DNA encoding human TGFα as an
NcoI/SalI fragment. pNS142TGFα encodes a protein
comprised of the first 42 amino acids of NS1 and the mature TGFα sequence. The NS1 portion of pNS142TGFα contains an amino acid change from Cys to Ser at amino acid #13.
The resulting plasmid, termed pMG42A, was then modified to contain an alternative synthetic linker after the NS142 sequence with a different set of restriction enzyme sites within which to insert foreign DNA fragments into the three reading frames after the NS142. This linker has the following sequence:
SEQ ID NO: 39:
5' -CATGGATCATATGTTAACAAGTACTCGATATCAATGAGTGACTGAAGCT-3 ' SEQ ID NO: 40:
3' - CTAGTATACAATTGTTCATGAGCTATAGTTACTCACTGACT -5'
The resulting plasmid is called pMG42B. This vector is needed to contain the neomycin phosphotransferase-1 (NPT- 1) gene which confers kanamycin resistance.
As described in Shatzman and Rosenberg, Met. Enzymol., 152:661-673 (1987), pOTS207 is a pAS derived cloning vector which carries the kanamycin resistance gene from Tn903 [Berg et al, Microbiology, ed. D.
Schlessinger, pp. 13-15, American Society for
Microbiology (Washington, DC 1978); Nomura et al, The Single-Stranded DNA Phages. ed. D. Denhardt et al,
pp.467-472, Cold Spring Harbor Laboratory (New York
1978); Castellazzi et al, Molecul. Gen. Genet., 117:211-218 (1982)]. It was constructed by digesting plasmid pUC8 [Yanisch-Perron et al, Gene. 33:103-119 (1985)], with BamHI and ligated to a BcII fragment containing the kanamycin gene from Tn903. The resulting plasmid, pUC8-Kan, was digested with EcoRI and PstI, and the fragment containing the kanamycin gene was inserted between the EcoRI and PstI sites of pOTSV [Shatzman and Rosenberg, cited above]. The resulting plasmid is pOTS207.
The pOTS207 was digested with EcoRI and PstI, and the 1467 bp fragment containing the kanamycin
resistance gene was isolated. Synthetic
oligonucleotides:
SEQ ID NO: 41: 5' AATTCGTACCTA 3'
SEQ ID NO: 42: 3' GCATGGATCTAG 5'
were made to link the NPT-1 gene to pMG42B vector. pMG42B was digested with BglII and PstI. The EcoRI/PstI NPT-1 gene fragment and the synthetic oligo linker were ligated to the digested pMG42B. The resulting plasmid, pMG47Kn allows fusions, in three different reading frames, to the NS1-42 gene, while allowing antibiotic selection with kanamycin.
Plasmid pBHA is a pBR322-derived vector, containing the complete nucleotide sequence of the hemagglutinin (HA) gene of a type B influenza virus
(B/Lee/40). It is described by Krystal et al, Proc.
Natl. Acad. Sci. USA. 79: 4900-4804 (1982). pBHA was digested with Rsal and a 813 bp fragment containing the HA subunit was isolated. This fragment was ligated into plasmid pMG42Kn (described above) that had been digested with ScaI. During the cloning, a base (T) was deleted from the ScaI recognition site shifting the gene out of the reading frame. The vector was digested with NcoI, and filled-in using Klenow, putting the gene back into the reading frame.
The resulting construct, pMG42BLHA2 [SEQ ID NO: 14], expresses a fusion polypeptide containing amino acids 1-42 of NS1 and 41-233 of the HA2 subunit. This construct contains the Cys to Ser change at amino acid #13 of the NS1 portion of the fusion peptide.
In preliminary studies with this construct, vaccinated laboratory mice demonstrated protection from challenge with type B influenza in the absence of
neutralizing antibody for the virus. EXAMPLE 6 - PREPARING SEED VIRUS AND RAISING ANTISERA
The seed virus, A/Udorn, was prepared according to the procedures described in P. Palese and J. Schulman, Virol., 57:227-237 (1974). Briefly, this technique is as follows.
Influenza virus strain A/Udorn was inoculated in 10-day old embryonated hen's eggs into the allantoic cavity. The eggs were incubated for 24-48 hours at 35°C then chilled at 4°C overnight. A portion of the eggshell over the airsac was removed and the allantoic fluid was aseptically removed using a 10-ml syringe. The fluid was centrifuged at low speed (3,000 × g) to remove
particulates. This clarified supernatant was centrifuged at high speed using an SW28 Beckman rotor at 27,000 rpm (4°C for 90 minutes), resulting in the virus pellet. The virus was resuspended in 10 mM Tris (pH 7.5) containing 100 mM NaCl, 1 mM EDTA and repelleted as before. The virus was layered on 30-60% sucrose gradient in 1 mM EDTA (NTE) and spun for 3-5 hours at 25,000 rpm. The band in the middle of the tube was withdrawn, diluted in NTE and centrifuged at 27,000 rpm for 90 minutes. The pellet was suspended in phosphate-buffered saline (PBS). These viral particles were used as immunogens for preparation of antisera.
Antisera was prepared as follows. 100-200 micrograms of purified virus in complete Freund's
adjuvant was injected into the subscapula of a New
Zealand White rabbit. A second injection in incomplete Freund's adjuvant was done 4 weeks later, and the animals were bled 7-10 days later.
EXAMPLE 7 - EXPRESSION OF H3HA2 FUSION PROTEINS
A. NS1(1-81)H3HA2(1-221) [SEQ ID NO: 9 & 10]
The plasmid pMG1H3HA2(1-221) [SEQ ID NO: 9] was transfected into E. coli strain AR58 [SmithKline Beecham Pharmaceuticals]. Cultures were grown at 32°C to mid-log phase at which time cultures were shifted to 39.5°C for 2 hours. The E. coli cell pellets containing the
recombinant polypeptide were then stored at -70°C until used.
Production of the NS1(1-81)H3HA2(1-221) protein [SEQ ID
NO: 10] was confirmed by Western blot analysis [Towbin et al, Proc. Natl. Acad. Sci. U.S.A.. 76:4350 (1979)] using antisera prepared against A/Udorn virus, as described in Example 5. A major immunoreactive species was found at a molecular weight of 35,050 daltons.
B. NS1(1-81)H3HA2(77-221) [SEQ ID NO: 11 & 12]
The plasmid encoding the NS1(1-81)H3HA2(77-221) peptide [SEQ ID NO: 11 & 12] was expressed as described in part A above. Production of this peptide was confirmed by
Western blot analysis, as described above. A major immunoreactive species was found at a molecular weight of 26,697 daltons.
EXAMPLE 8 - PARTIAL PURIFICATION OF H3HA2 FUSION PROTEINS E. coli cell pellets containing the recombinant polypeptides, prepared as described in Example 6, were stored at -70°C until used. E. coli cells were thawed and resuspended in lysis buffer A (50 mM Tris-HCl, 5% glycerol, 2 mM EDTA and 0.1 mM DTT, pH 8.0) at 10
mL/gram. The stirred suspension was then treated with lysozyme (0.2 mg/mL) for 45 minutes at room temperature and sonicated 2× for 2-3 minutes each time by a
Sonicator. The resultant suspension was treated with 0.1% DOC for 60 minutes at 4°C, then centrifuged at
25,000 × g. The pellet was resuspended by sonication in 50 mM glycine pH 10.0, 5% glycerol, 2 mM EDTA and then the suspension was treated with 1% Triton X-100 [J.T. Baker Chemicals Co.] at 4°C for 60 minutes and
centrifuged as above.
The resulting pellet was solubilized in 50 mM Tris, 8 M urea, pH 8.0 and centrifuged to remove any insoluble material. This solubilized material is dialyzed against 10 mM Tris, 1 mM EDTA, pH 8.0 followed, again, by centrifugation of insoluble material. The solubilized material is designated as "crude" material and is used in in vitro and in vivo mouse assays. At this point, the material is approximately 40 - 50% pure.
The "crude" material was electrophoresed through an SDS-PAGE and the appropriate H3HA2 protein bands were visualized by KCl staining according to D. Hager et al, Anal. Biochem. 109:76-86 (1980). The band was cut-out and eluted electrophoretically by the "S&S Elutrap Electro-Separation System" [Schleicher &
Schuell]. The electro-eluting buffer was the Tris-glycine. A concentrated and eluted sample was obtained and exhaustively dialyzed against 0.01 M NH4HCO3 and 0.02% SDS [M. Hunkapiller et al, Method. Enzymol., 91:227-236 (1983)]. This sample was frozen quickly by dry ice and lyophilized to complete dryness. The lyophilized
material was brought back into solution using 50 mM Tris pH 8.0 and used for in vitro and in vivo mouse assays.
Following this gel elution step, the protein is usually greater than 75% pure.
EXAMPLE 9 - H3 SUBTYPE HETEROLOGOUS PROTECTION ELICITED BY VACCINATION WITH NS1(1-81)H3HA2(1-221) [SEQ ID NO: 10]
Mice (NIH/Swiss; 15 per group) were vaccinated subcutaneously with 50 or 10 μg NS1(1-81)H3HA2(1-221) [SEQ ID NO: 9 & 10] in aluminum hydroxide on days 0 and 21. The mice were boosted intraperitoneally on day 42 with the protein without adjuvant. On day 47, mice were challenged intranasally with 2 - 3 LD50 doses of either A/PR/8/34 (H1N1) or A/HK/68 (H3N2) virus, and survival was
monitored through day 21. This represents a heterologous challenge (A/PR/8/34) and an H3 heterosubtypic challenge, since the NS1(1-81)H3HA2(1-221) construct [SEQ ID NO: 9 & 10] was derived from A/Udorn/72 cDNA. The control group received adjuvant (CFA) only.
The results in Table 1 below show that survival in mice vaccinated with NS1(1-81)H3HA2(1-221) [SEQ ID NO: 10] and challenged with A/HK/68 (80-93%) was significantly higher than in control mice which were injected with adjuvant only (26% survival). In contrast, vaccination with NS1{1- 81)H3HA2(1-221) [SEQ ID NO: 10] did not confer protection against challenge with A/PR/8/34, an H1N1 strain (0-26% survival). Thus protection elicited by NS1(1-81)H3HA2(1.221) [SEQ ID NO: 10] is selective for antigenically diverse virus strains within the H3 subtype.
Likewise, vaccination with the D protein
(NS1(1-81)HA2(65-222) [SEQ ID NO: 18], derived from the H1N1 subtype) elicits protection from heterosubtypic challenge with H1N1, but not the H3N2 subtype [S Dillon et al,
Nature, in press (1992); Mbawuike et al, Faseb. J.,
5:A1362 (abs. 5749 and Table 1]. These results in outbred mice also suggest that the response to the H1 and H3 proteins will not be restricted to a limited number of individuals with certain major histocompatibility
alleles, and therefore the vaccine will be effective in a majority of individuals.
Table 1
Percent Survival After Challenge:
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Immunization HA A/PR/8/34 A/HK/68
Subtype (H1N1) (H3N2)
50 μg NS11-81H3HA21-221 H3 26 80*
10 μg NS11-81H3HA21-221 H3 0 93*
10 μg NS11-81HA244-222 H1 67* 13
A/HK/68 Virus H3 60* 100*
Control (Al+3) - 0 26
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - p ≤ 0.05 vs. control in Fishers exact probability test
Vaccination of mice with live homologous
(A/HK/68) virus provided complete or partial protection, reflecting protection mediated by neutralizing antibody
(homologous H3N2 challenge) and/or CTL (heterologous H1N1 challenge), respectively.
Duration of protective immunity was tested by immunizing mice subcutaneously with the recombinant influenza protein plus adjuvant on days 0 and 21. Some mice were also given an ip injection of the protein
(without adjuvant) on day 42. Mice were challenged with A/HK/68 (H3N2) on day 47, four weeks after the second injection. Control mice were immunized as described above for Table 1, where an ip injection was given at week 6 (5 days prior to challenge). The results in Table 2 show that CB6F1 mice (15 per group) were significantly protected when challenged with the A/HK/68 heterologous H3 virus strain 5-28 days after the last injection.
Table 2
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Dose (μg per injection) Injection Percent
of NS11-81H3HA21-221 Adjuvant Schedule Survival
50 μg CFA 0,21 86*
50 μg CFA 0,21,42 100*
0 μg CFA 0,21 6
50 μg Al+3 0,21 93*
50 μg Al+3 0,21,42 93*
0 μg Al+3 0,21 0
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
*p ≤ 0.05 v. control in Fisher's exact probability test
EXAMPLE 10 - TYPE A CROSS-PROTECTION WITH D AND H3C13 PROTEIN
Mice (CB6F1) were divided randomly into six groups, with fifteen in each group. The mice were injected subcutaneously with proteins in Al+3 (100 μg) on days 0 and 21, and then were challenged with 2-3 LD50 doses of virus on day 49. Survival was monitored through day 21. The results of this study are illustrated in
Table 3 below. For convenience, NS11-81H3HA21-221 is referred to as H3C13 in the table below.
Table 3
Percent Survival After Challenge with:
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
HA A/PR/8/34 A/HK/68
Immunization Subtype (H1N1) (H3N2}
1. 50 μg H3C13 H3 73* 73*
50 μg D H1
2. 10 μg H3C13 H3 67* 100*
10 μg D H1
3. 1 μg H3C13 H3 86* 73*
1 μg D H1
4. 50 μg H3C13 H3 7 73*
5. 50 μg D H1 47** 7
6. Al+3 control - 7 0
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
* p ≤ 0.001 vs. control group
** p ≤ 0.03 vs. control group
This data demonstrates that mice immunized with a mixture of the D protein and H3C13 protein in aluminum adjuvant were protected against challenge with either
A/PR/8/34 (H1) or A/HK/68 (H3) virus. In contrast, mice immunized with the D protein were protected against H1 but not H3 challenge. Likewise, mice immunized with the
H3C13 protein were protected against the H3 but not the H1 challenge. Therefore, the combination of the D protein and the H3C13 proteins elicited protection against the currently circulating subtypes of influenza A virus. Thus, this combination represents a subtype cross-protective vaccine.
Numerous modifications and variations of the present invention are included in the above-identified specification and are expected to be obvious to one of skill in the art. Such modifications and alterations to the compositions and processes of the present invention are believed to be encompassed in the scope of the claims appended hereto.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Shatzman, Allan
Scott, Miller
Dillon, Susan B.
(ii) TITLE OF INVENTION: Vaccinal Polypeptides
(iii) NUMBER OF SEQUENCES: 42
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SmithKline Beecham Corporation - Corporate
Patents
(B) STREET: U.S. Mailcode VW2220 - 709 Swedeland Road
(C) CITY: King of Prussia
(D ) STATE: Pennsylvania
(E) COUNTRY: USA
(F) ZIP: 19406-2799
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.25
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: US
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Canter, Carol G.
(B) REGISTRATION NUMBER: 31,151
(C) REFERENCE/DOCKET NUMBER: SBC14224-8
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 215-270-5013
(B) TELEFAX: 215-270-5090
(2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 666 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..663
( xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 1 :
GGC ATA TTC GGC GCA ATA GCA GGT TTC ATA GAA AAT GGT TGG GAG GGA 48
Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly
1 5 10 15
ATG ATA GAC GGT TGG TAC GGT TTC AGG CAT CAA AAT TCT GAG GGC ACA 96 Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly Thr
20 25 30
GGA CAA GCA GCA GAT CTT AAA AGC ACT CAA GCA GCC ATC GAC CAA ATC 144 Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
35 40 45
AAT GGG AAA CTG AAT AGG GTA ATC GAG AAG ACG AAC GAG AAA TTC CAT 192 Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe His
50 55 60
CAA ATC GAA AAG GAA TTC TCA GAA GTA GAA GGG AGA ATT CAG GAC CTC 240 Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp Leu
65 70 75 80
GAG AAA TAC GTT GAA GAC ACT AAA ATA GAT CTC TGG TCT TAC AAT GCG 288 Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala
85 90 95
GAG CTT CTT GTC GCT CTG GAG AAC CAA CAT ACA ATT GAT CTG ACT GAC 336 Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
100 105 110
TCG GAA ATG AAC AAA CTG TTT GAA AAA ACA AGG AGG CAA CTG AGG GAA 384 Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg Glu
115 120 125
AAT GCT GAG GAC ATG GGC AAT GGT TGC TTC AAA ATA TAC CAC AAA TGT 432 Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys Cys
130 135 140
GAC AAT GCT TGC ATA GGG TCA ATC AGA AAT GGG ACT TAT GAC CAT GAT 480 Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His Asp
145 150 155 160
GTA TAC AGA GAC GAA GCA TTA AAC AAC CGG TTT CAG ATC AAA GGT GTT 528 Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly Val
165 170 175
GAA CTG AAG TCA GGA TAC AAA GAC TGG ATC CTG TGG ATT TCC TTT GCC 576 Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe Ala
180 185 190
ATA TCA TGC TTT TTG CTT TGT GTT GTT TTG CTG GGG TTC ATC ATG TGG 624 Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met Trp
195 200 205
GCC TGC CAG AAA GGC AAC ATT AGG TGC AAC ATT TGC ATT TGA 666
Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 221 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly 1 5 10 15
Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly Thr
20 25 30
Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
35 40 45
Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe His 50 55 60
Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp Leu 65 70 75 80
Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala
85 90 95
Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
100 105 110
Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg Glu
115 120 125
Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys Cys 130 135 140
Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His Asp 145 150 155 160
Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly Val
165 170 175
Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe Ala
180 185 190
Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met Trp
195 200 205
Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 666 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..663
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
GGC ATA TTC GGC GCA ATA GCA GGT TTC ATA GAA AAT GGT TGG GAG GGA 48 Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly
1 5 10 15
ATG ATA GAC GGT TGG TAC GGT TTC AGG CAT CAA AAT TCC GAG GGC ACA 96 Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly Thr
20 25 30
GGA CAA GCA GCA GAT CTT AAA AGC ACT CAA GCA GCC ATC GAC CAA ATC 144 Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
35 40 45
AAT GGG AAA CTG AAT AGG GTA ATC GAG AAG ACG AAC GAG AAA TTC CAT 192 Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe His
50 55 60
CAA ATC GAA AAG GAA TTC TCA GAA GTA GAA GGG AGA ATT CAG GAC CTC 240 Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp Leu
65 70 75 80
GAG AAA TAC GTT GAA GAC ACT AAA ATA GAT CTC TGG TCT TAC AAT GCG 288 Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala
85 90 95
GAG CTT CTT GTC GCT CTG GAG AAC CAA CAT ACA ATT GAT CTG ACT GAC 336 Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
100 105 110
TCG GAA ATG AAC AAA CTG TTT GAA AAA ACA AGG AGG CAA CTG AGG GAA 384 Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg Glu
115 120 125
AAT GCT GAG GAC ATG GGC AAT GGT TGC TTC AAA ATA TAC CAC AAA TGT 432 Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys Cys
130 135 140
GAC AAT GCT TGC ATA GGG TCA ATC AGA AAT GGG ACT TAT GAC CAT GAT 480 Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His Asp
145 150 155 160
GTA TAC AGA GAC GAA GCA TTA AAC AAC CGG TTT CAG ATC AAA GGT GTT 528 Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly Val
165 170 175
GAA CTG AAG TCA GGA TAC AAA GAC TGG ATC CTG TGG ATT TCC TTT GCC 576 Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe Ala
180 185 190
ATA TCA TGC TTT TTG CTT TGT GTT GTT TTG CTG GGG TTC ATC ATG TGG 624 Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met Trp
195 200 205
GCC TGC CAA AAA GGC AAC ATT AGG TGC AAC ATT TGC ATT TGA 666
Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 221 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:
Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly
1 5 10 15
Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly Thr
20 25 30
Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
35 40 45
Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe His
50 55 60
Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp Leu
65 70 75 80
Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala
85 90 95
Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
100 105 110
Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg Glu
115 120 125
Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys Cys
130 135 140
Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His Asp
145 150 155 160
Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly Val
165 170 175
Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe Ala
180 185 190
Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met Trp
195 200 205
Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 670 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 1..666
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
GGT CTA TTT GGA GCC ATT GCC GGT TTT ATT GAA GGG GGA TGG ACT GGA 48 Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
1 5 10 15
ATG ATA GAT GGA TGG TAC GGT TAT CAT CAT CAG AAT GAA CAG GGA TCA 96 Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser
20 25 30
GGC TAT GCA GCG GAT CAA AAA AGC ACA CAA AAT GCC ATT AAC GGG ATT 144 Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly lie
35 40 45
ACA AAC AAG GTG AAC TCT GTT ATC GAG AAA ATG AAC ATT CAA TTC ACA 192 Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Ile Gln Phe Thr
50 55 60
GCT GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG ATG GAA AAT TTA 240 Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu
65 70 75 80
AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG ACA TAT AAT GCA 288 Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala
85 90 95
GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG GAT TTC CAT GAC 336 Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp
100 105 110
TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC CAA TTA AAG AAT 384 Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn
115 120 125
AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC TAC CAC AAG TGT 432 Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys
130 135 140
GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT TAT GAT TAT CCC 480 Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro
145 150 155 160
AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG GTA GAT GGA GTG 528 Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val
165 170 175
AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG ATC TAC TCA ACT 576 Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr
180 185 190
GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG GCA ATC AGT TTC 624 Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
195 200 205
TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA TGC ATC 666
Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
210 215 220
TGAG 670
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 222 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
1 5 10 15
Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser
20 25 30
Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly Ile
35 40 45
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Ile Gln Phe Thr
50 55 60
Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu
65 70 75 80
Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala
85 90 95
Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp
100 105 110
Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn
115 120 125
Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys
130 135 140
Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro
145 150 155 160
Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val
165 170 175
Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr
180 185 190
Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
195 200 205
Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 670 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..670
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
GGCATATTCG GCGCAATAGC AGGTTTCATA GAAAATGGTT GGGAGGGAAT GATAGACGGT 60
TGGTACGGTT TCAGGCATCA AAATTCNGAG GGCACAGGAC AAGCAGCAGA TCTTAAAAGC 120
ACTCAAGCAG CCATCGACCA AATCAATGGG AAACTGAATA GGGTAATCGA GAAGACGAAC 180
GAGAAATTCC ATCAAATCGA AAAGGAATTC TCAGAAGTAG AAGGGAGAAT TCAGGACCTC 240
GAGAAATACG TTGAAGACAC TAAAATAGAT CTCTGGTCTT ACAATGCGGA GCTTCTTGTC 300
GCTCTGGAGA ACCAACATAC AATTGATCTG ACTGACTCGG AAATGAACAA ACTGTTTGAA 360
AAAACAAGGA GGCAACTGAG GGAAAATGCT GAGGACATGG GCAATGGTTG CTTCAAAATA 420
TACCACAAAT GTGACAATGC TTGCATAGGG TCAATCAGAA ATGGGACTTA TGACCATGAT 480
GTATACAGAG ACGAAGCATT AAACAACCGG TTTCAGATCA AAGGTGTTGA ACTGAAGTCA 540
GGATACAAAG ACTGGATCCT GTGGATTTCC TTTGCCATAT CATGCTTTTT GCTTTGTGTT 600
GTTTTGCTGG GGTTCATCAN NNTGTGGGCC TGCCANAAAG GCAACATTAG GTGCAACATT 660
TGCATTTGAN 670
(2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 222 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:
Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly 1 5 10 15
Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly Thr
20 25 30
Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln Ile
35 40 45
Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe His 50 55 60
Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp Leu 65 70 75 80
Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn Ala
85 90 95
Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr Asp
100 105 110
Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg Glu
115 120 125
Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys Cys 130 135 140
Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His Asp 145 150 155 160
Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly Val
165 170 175
Glu Leu Lys Ser Xaa Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe
180 185 190
Ala Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met
195 200 205
Trp Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile
210 215 220
(2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 918 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic]
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..918
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG GGC GCC CAT ATG GGC ATA TTC GGC GCA ATA GCA GGT TTC ATA GAA 288 Met Gly Ala His Met Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu
85 90 95
AAT GGT TGG GAG GGA ATG ATA GAC GGT TGG TAC GGT TTC AGG CAT CAA 336 Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln
100 105 110
AAT TCT GAG GGC ACA GGA CAA GCA GCA GAT CTT AAA AGC ACT CAA GCA 384 Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala
115 120 125
GCC ATC GAC CAA ATC AAT GGG AAA CTG AAT AGG GTA ATC GAG AAG ACG 432 Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr
130 135 140
AAC GAG AAA TTC CAT CAA ATC GAA AAG GAA TTC TCA GAA GTA GAA GGG 480 Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly
145 150 155 160
AGA ATT CAG GAC CTC GAG AAA TAC GTT GAA GAC ACT AAA ATA GAT CTC 528 Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu
165 170 175
TGG TCT TAC AAT GCG GAG CTT CTT GTC GCT CTG GAG AAC CAA CAT ACA 576 Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr
180 185 190
ATT GAT CTG ACT GAC TCG GAA ATG AAC AAA CTG TTT GAA AAA ACA AGG 624 Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg
195 200 205
AGG CAA CTG AGG GAA AAT GCT GAG GAC ATG GGC AAT GGT TGC TTC AAA 672 Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys
210 215 220
ATA TAC CAC AAA TGT GAC AAT GCT TGC ATA GGG TCA ATC AGA AAT GGG 720 Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly
225 230 235 240
ACT TAT GAC CAT GAT GTA TAC AGA GAC GAA GCA TTA AAC AAC CGG TTT 768 Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe
245 250 255
CAG ATC AAA GGT GTT GAA CTG AAG TCA GGA TAC AAA GAC TGG ATC CTG 816 Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu
260 265 270
TGG ATT TCC TTT GCC ATA TCA TGC TTT TTG CTT TGT GTT GTT TTG CTG 864 Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu
275 280 285
GGG TTC ATC ATG TGG GCC TGC CAA AAA GGC AAC ATT AGG TGC AAC ATT 912 Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile
290 295 300
TGC ATT 918
Cys Ile
305
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 306 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
Met Gly Ala His Met Gly Ile Phe Gly Ala Ile Ala Gly Phe Ile Glu
85 90 95
Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln
100 105 110
Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala
115 120 125
Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr
130 135 140
Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly
145 150 155 160
Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu
165 170 175
Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr
180 185 190
Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg
195 200 205
Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys
210 215 220
Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly
225 230 235 240
Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe
245 250 255 Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu
260 265 270
Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu
275 280 285
Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile
290 295 300
Cys Ile
305
(2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 690 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
( ix ) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 1..690
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG GAT CAT ATG TTA ATT CAG GAC CTC GAG AAA TAC GTT GAA GAC ACT 288 Met Asp His Met Leu Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
85 90 95
AAA ATA GAT CTC TGG TCT TAC AAT GCG GAG CTT CTT GTC GCT CTG GAG 336 Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
100 105 110
AAC CAA CAT ACA ATT GAT CTG ACT GAC TCG GAA ATG AAC AAA CTG TTT 384 Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
115 120 125
GAA AAA ACA AGG AGG CAA CTG AGG GAA AAT GCT GAG GAC ATG GGC AAT 432 Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn
130 135 140
GGT TGC TTC AAA ATA TAC CAC AAA TGT GAC AAT GCT TGC ATA GGG TCA 480 Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
145 150 155 160
ATC AGA AAT GGG ACT TAT GAC CAT GAT GTA TAC AGA GAC GAA GCA TTA 528 Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
165 170 175
AAC AAC CGG TTT CAG ATC AAA GGT GTT GAA CTG AAG TCA GGA TAC AAA 576 Asn Asn Arg Phe Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys
180 185 190
GAC TGG ATC CTG TGG ATT TCC TTT GCC ATA TCA TGC TTT TTG CTT TGT 624 Asp Trp Ile Leu Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys
195 200 205
GTT GTT TTG CTG GGG TTC ATC ATG TGG GCC TGC CAA AAA GGC AAC ATT 672 Val Val Leu Leu Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile
210 215 220
AGG TGC AAC ATT TGC ATT 690
Arg Cys Asn Ile Cys Ile
225 230
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 230 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser 35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Asp His Met Leu Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr
85 90 95 Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu
100 105 110
Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe
115 120 125
Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn 130 135 140
Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser
145 150 155 160 Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu
165 170 175
Asn Asn Arg Phe Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys
180 185 190
Asp Trp Ile Leu Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys
195 200 205
Val Val Leu Leu Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile 210 215 220
Arg Cys Asn Ile Cys Ile
225 230
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 699 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..699
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TCC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Ser Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC ATG CAT GGA TCA TAT GTT 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met His Gly Ser Tyr Val
35 40 45
AAC AAG ACA CAA GAA GCT ATA AAC AAG ATA ACA AAA AAT CTC AAC TAT 192 Asn Lys Thr Gln Glu Ala Ile Asn Lys Ile Thr Lys Asn Leu Asn Tyr
50 55 60
TTA AGT GAG CTA GAA GTA AAA AAC CTT CAA AGA CTA AGC GGA GCA ATG 240 Leu Ser Glu Leu Glu Val Lys Asn Leu Gln Arg Leu Ser Gly Ala Met
65 70 75 80
AAT GAG CTT CAC GAC GAA ATA CTC GAG CTA GAC GAA AAA GTG GAT GAT 288 Asn Glu Leu His Asp Glu Ile Leu Glu Leu Asp Glu Lys Val Asp Asp
85 90 95
CTA AGA GCT GAT ACA ATA AGC TCA CAA ATA GAG CTT GCA GTC TTG CTT 336 Leu Arg Ala Asp Thr Ile Ser Ser Gln Ile Glu Leu Ala Val Leu Leu
100 105 110
TCC AAC GAA GGG ATA ATA AAC AGT GAA GAT GAG CAT CTC TTG GCA CTT 384 Ser Asn Glu Gly Ile Ile Asn Ser Glu Asp Glu His Leu Leu Ala Leu
115 120 125
GAA AGA AAA CTG AAG AAA ATG CTT GGC CCC TCT GCT GTA GAA ATA GGG 432 Glu Arg Lys Leu Lys Lys Met Leu Gly Pro Ser Ala Val Glu Ile Gly
130 135 140
AAT GGG TGC TTT GAA ACC AAA CAC AAA TGC AAC CAG ACT TGC CTA GAC 480 Asn Gly Cys Phe Glu Thr Lys His Lys Cys Asn Gln Thr Cys Leu Asp
145 150 155 160
AGG ATA GCT GCT GGC ACC TTT AAT GCA GGA GAT TTT TCT CTT CCC ACT 528 Arg Ile Ala Ala Gly Thr Phe Asn Ala Gly Asp Phe Ser Leu Pro Thr
165 170 . 175
TTT GAT TCA TTA AAC ATT ACT GCT GCA TCT TTA AAT GAT GAT GGC TTG 576 Phe Asp Ser Leu Asn Ile Thr Ala Ala Ser Leu Asn Asp Asp Gly Leu
180 185 190
GAT AAT CAT ACT ATA CTG CTC TAC TAC TCA ACT GCT GCT TCT AGC TTG 624 Asp Asn His Thr Ile Leu Leu Tyr Tyr Ser Thr Ala Ala Ser Ser Leu
195 200 205
GCT GTA ACA TTA ATG ATA GCT ATC TTC ATT GTC TAC ATG GTC TCC AGA 672 Ala Val Thr Leu Met Ile Ala Ile Phe Ile Val Tyr Met Val Ser Arg
210 215 220
GAC AAT GTT TCT TGT TCC ATC TGT CTG 699
Asp Asn Val Ser Cys Ser Ile Cys Leu
225 230
(2) INFORMATION FOR SEQ ID NO: 14 :
( i ) SEQUENCE CHARACTERISTICS :
(A) LENGTH: 233 amino acids
( B ) TYPE : amino acid
(D ) TOPOLOGY: linear
( ii ) MOLECULE TYPE : protein
( xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 :
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Ser Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met His Gly Ser Tyr Val
35 40 45
Asn Lys Thr Gln Glu Ala Ile Asn Lys Ile Thr Lys Asn Leu Asn Tyr 50 55 60
Leu Ser Glu Leu Glu Val Lys Asn Leu Gln Arg Leu Ser Gly Ala Met 65 70 75 80
Asn Glu Leu His Asp Glu Ile Leu Glu Leu Asp Glu Lys Val Asp Asp
85 90 95
Leu Arg Ala Asp Thr Ile Ser Ser Gln Ile Glu Leu Ala Val Leu Leu
100 105 110
Ser Asn Glu Gly Ile Ile Asn Ser Glu Asp Glu His Leu Leu Ala Leu
115 120 125
Glu Arg Lys Leu Lys Lys Met Leu Gly Pro Ser Ala Val Glu Ile Gly 130 135 140
Asn Gly Cys Phe Glu Thr Lys His Lys Cys Asn Gln Thr Cys Leu Asp 145 150 155 160
Arg Ile Ala Ala Gly Thr Phe Asn Ala Gly Asp Phe Ser Leu Pro Thr
165 170 175
Phe Asp Ser Leu Asn Ile Thr Ala Ala Ser Leu Asn Asp Asp Gly Leu
180 185 190
Asp Asn His Thr Ile Leu Leu Tyr Tyr Ser Thr Ala Ala Ser Ser Leu
195 200 205
Ala Val Thr Leu Met Ile Ala Ile Phe Ile Val Tyr Met Val Ser Arg 210 215 220
Asp Asn Val Ser Cys Ser Ile Cys Leu
225 230
(2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 924 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
( ix ) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..921
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG GAT CTG TCC AGA GGT CTA TTT GGA GCC ATT GCC GGT TTT ATT GAA 288 Met Asp Leu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu
85 90 95
GGG GGA TGG ACT GGA ATG ATA GAT GGA TGG TAC GGT TAT CAT CAT CAG 336 Gly Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln
100 105 110
AAT GAA CAG GGA TCA GGC TAT GCA GCG GAT CAA AAA AGC ACA CAA AAT 384 Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn
115 120 125
GCC ATT AAC GGG ATT ACA AAC AAG GTG AAC TCT GTT ATC GAG AAA ATG 432 Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met
130 135 140
AAC ATT CAA TTC ACA GCT GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA 480 Asn Ile Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys
145 150 155 160
AGG ATG GAA AAT TTA AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT 528 Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile
165 170 175
TGG ACA TAT AAT GCA GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT 576 Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr
180 185 190
CTG GAT TTC CAT GAC TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA 624 Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys
195 200 205
AGC CAA TTA AAG AAT AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG 672 Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu
210 215 220
TTC TAC CAC AAG TGT GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG 720 Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly
225 230 235 240
ACT TAT GAT TAT CCC AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA 768 Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu
245 250 255
AAG GTA GAT GGA GTG AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG 816 Lys Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu
260 265 270
GCG ATC TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG 864 Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu
275 280 285
GGG GCA ATC AGT TTC TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA 912 Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg
290 295 300
ATA TGC ATC TGA 924 Ile Cys Ile
305
(2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 307 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
Met Asp Leu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu 85 90 95
Gly Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln
100 105 110
Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn
115 120 125
Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met 130 135 140
Asn Ile Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys 145 150 155 160
Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile
165 170 175
Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr
180 185 190
Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys
195 200 205
Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu 210 215 220
Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly 225 230 235 240
Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu
245 250 255
Lys Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu
260 265 270
Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu
275 280 285
Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg 290 295 300
Ile Cys Ile
305
(2) INFORMATION FOR SEQ ID NO: 17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 729 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..726
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG CAG ATC CCG GCT GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG 288 Met Gln Ile Pro Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg
85 90 95
ATG GAA AAT TTA AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG 336 Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
100 105 110
ACA TAT AAT GCA GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG 384 Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
115 120 125
GAT TTC CAT GAC TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC 432 Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
130 135 140
CAA TTA AAG AAT AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC 480 Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
145 150 155 160
TAC CAC AAG TGT GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT 528 Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
165 170 175
TAT GAT TAT CCC AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG 576 Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
180 185 190
GTA GAT GGA GTG AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG 624 Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
195 200 205
ATC TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG 672 Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
210 215 220
GCA ATC AGT TTC TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA 720 Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile
225 230 235 240
TGC ATC TGA 729
Cys Ile
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 242 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
Met Gln Ile Pro Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg
85 90 95
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
100 105 110
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
115 120 125
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
130 135 140
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
145 150 155 160
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
165 170 175
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
180 185 190
Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
195 200 205
Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
210 215 220
Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile 225 230 235 240
Cys Ile
(2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..807
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC ATG GAT CTG TCC AGA GGT 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met Asp Leu Ser Arg Gly
35 40 45
CTA TTT GGA GCC ATT GCC GGT TTT ATT GAA GGG GGA TGG ACT GGA ATG 192 Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly Met
50 55 60
ATA GAT GGA TGG TAC GGT TAT CAT CAT CAG AAT GAA CAG GGA TCA GGC 240 Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser Gly
65 70 75 80
TAT GCA GCG GAT CAA AAA AGC ACA CAA AAT GCC ATT AAC GGG ATT ACA 288 Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly Ile Thr
85 90 95
AAC AAG GTG AAC TCT GTT ATC GAG AAA ATG AAC ATT CAA TTC ACA GCT 336 Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Ile Gln Phe Thr Ala
100 105 110
GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG ATG GAA AAT TTA AAT 384 Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu Asn
115 120 125
AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG ACA TAT AAT GCA GAA 432 Lvs Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu
130 135 140
TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG GAT TTC CAT GAC TCA 480 Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
145 150 155 160
AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC CAA TTA AAG AAT AAT 528 Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn
165 170 175
GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC TAC CAC AAG TGT GAC 576 Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
180 185 190
AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT TAT GAT TAT CCC AAA 624 Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro Lys
195 200 205
TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG GTA GAT GGA GTG AAA 672 Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val Lys
210 215 220
TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG ATC TAC TCA ACT GTC 720 Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr Val
225 230 235 240
GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG GCA ATC AGT TTC TGG 768 Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Trp
245 250 255
ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA TGC ATC TGA 810
Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
260 265
(2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 269 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met Asp Leu Ser Arg Gly
35 40 45
Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly Met
50 55 60
Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser Gly
65 70 75 80
Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly Ile Thr
85 90 95
Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Ile Gln Phe Thr Ala
100 105 110
Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu Asn 115 120 125
Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu
130 135 140
Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
145 150 155 160
Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn
165 170 175
Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
180 185 190
Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro Lys
195 200 205
Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val Lys
210 215 220
Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr Val
225 230 235 240
Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Trp
245 250 255
Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
260 265
(2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 1..627
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC ATG GAT CAT ATG TTA ACA 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met Asp His Met Leu Thr
35 40 45
AGT ACT CGA TCT GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG ATG 192 Ser Thr Arg Ser Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met
50 55 60
GAA AAT TTA AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG ACA 240 Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr
65 70 75 80
TAT AAT GCA GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG GAT 288 Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp
85 90 95
TTC CAT GAC TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC CAA 336 Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln
100 105 110
TTA AAG AAT AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC TAC 384 Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr
115 120 125
CAC AAG TGT GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT TAT 432 His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr
130 135 140
GAT TAT CCC AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG GTA 480 Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val
145 150 155 160
GAT GGA GTG AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG ATC 528 Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile
165 170 175
TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG GCA 576 Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala
180 185 190
ATC AGT TTC TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA TGC 624 Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys
195 200 205
ATC TGA 630 Ile
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 209 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Met Asp His Met Leu Thr
35 40 45
Ser Thr Arg Ser Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met
50 55 60
Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr 65 70 75 80
Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp
85 90 95
Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln
100 105 110
Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr
115 120 125
His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr
130 135 140
Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val
145 150 155 160
Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile
165 170 175
Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala
180 185 190
Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys
195 200 205
Ile
(2) INFORMATION FOR SEQ ID NO: 23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 717 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..714
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG CAG ATC CCG GAA TTC AAC AAA TTA GAA AAA AGG ATG GAA AAT TTA 288 Met Gln Ile Pro Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu
85 90 95
AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG ACA TAT AAT GCA 336 Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala
100 105 110
GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG GAT TTC CAT GAC 384 Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp
115 120 125
TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC CAA TTA AAG AAT 432 Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn
130 135 140
AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC TAC CAC AAG TGT 480 Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys
145 150 155 160
GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT TAT GAT TAT CCC 528 Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro
165 170 175
AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG GTA GAT GGA GTG 576 Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val
180 185 190
AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG ATC TAC TCA ACT 624 Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr
195 200 205
GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG GCA ATC AGT TTC 672 Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
210 215 220
TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA TGC ATC 714
Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
225 230 235
TGA 717
(2) INFORMATION FOR SEQ ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 238 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser 35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
Met Gln Ile Pro Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu
85 90 95
Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala
100 105 110
Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp
115 120 125
Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn
130 135 140
Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys 145 150 155 160
Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro
165 170 175
Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val
180 185 190
Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr
195 200 205
Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 210 215 220
Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
225 230 235
(2) INFORMATION FOR SEQ ID NO: 25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 681 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 1..678
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG CAG ATC CCG AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG 288 Met Gln Ile Pro Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
85 90 95
ACA TAT AAT GCA GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG 336 Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
100 105 110
GAT TTC CAT GAC TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC 384 Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
115 120 125
CAA TTA AAG AAT AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC 432 Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
130 135 140
TAC CAC AAG TGT GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT 480 Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
145 150 155 160
TAT GAT TAT CCC AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG 528 Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
165 170 175
GTA GAT GGA GTG AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG 576 Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
180 185 190
ATC TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG 624 Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
195 200 205
GCA ATC AGT TTC TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA 672 Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile
210 215 220
TGC ATC TGA 681
Cys Ile
225
(2) INFORMATION FOR SEQ ID NO: 26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 226 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Gln Ile Pro Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
85 90 95
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
100 105 110
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
115 120 125
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe 130 135 140
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
145 150 155 160
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
165 170 175
Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
180 185 190
Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
195 200 205
Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile 210 215 220
Cys Ile
225
(2) INFORMATION FOR SEQ ID NO: 27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 158 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp 1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Gln Ile Pro Val Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro
85 90 95
Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val
100 105 110
Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr
115 120 125
Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 130 135 140
Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
145 150 155
(2) INFORMATION FOR SEQ ID NO: 28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 163 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Asp Leu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu
85 90 95
Gly Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln
100 105 110
Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn
115 120 125
Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met 130 135 140
Asn Ile Gln Phe Thr Ala Val Gly Lys Glu Phe Ser Cys Leu Thr Ala 145 150 155 160
Tyr His Arg
(2) INFORMATION FOR SEQ ID NO:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 231 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Gln Ile Pro Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg
85 90 95
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
100 105 110
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
115 120 125
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser 130 135 140
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe 145 150 155 160
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
165 170 175
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
180 185 190
Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
195 200 205 Ile Tyr Ser Thr Val Ala Ser Ser Gly Gly Ser Tyr Ser Met Glu His 210 215 220
Phe Arg Trp Gly Lys Pro Val
225 230
(2) INFORMATION FOR SEQ ID NO: 30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 225 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp 1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Gln Ile Pro Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg
85 90 95
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
100 105 110
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
115 120 125
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser 130 135 140
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe 145 150 155 160
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
165 170 175
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
180 185 190
Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
195 200 205
Ile Tyr Ser Thr Val Ala Ser Ser Gly Gly Ser Tyr Ser Met Leu Val
210 215 220
Asn
225
(2) INFORMATION FOR SEQ ID NO: 31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 912 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 1..912
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31:
ATG GAT CCA AAC ACT GTG TCA AGC TTT CAG GTA GAT TGC TTT CTT TGG 48 Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
CAT GTC CGC AAA CGA GTT GCA GAC CAA GAA CTA GGT GAT GCC CCA TTC 96 His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
CTT GAT CGG CTT CGC CGA GAT CAG AAA TCC CTA AGA GGA AGG GGC AGC 144 Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
ACT CTT GGT CTG GAC ATC GAG ACA GCC ACA CGT GCT GGA AAG CAG ATA 192 Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile
50 55 60
GTG GAG CGG ATT CTG AAA GAA GAA TCC GAT GAG GCA CTT AAA ATG ACC 240 Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr
65 70 75 80
ATG CAG ATC CCG GGT CTA TTT GGA GCC ATT GCC GGT TTT ATT GAA GGG 288 Met Gln Ile Pro Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
85 90 95
GGA TGG ACT GGA ATG ATA GAT GGA TGG TAC GGT TAT CAT CAT CAG AAT 336 Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn
100 105 110
GAA CAG GGA TCA GGC TAT GCA GCG GAT CAA AAA AGC ACA CAA AAT GCC 384 Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
115 120 125
ATT AAC GGG ATT ACA AAC AAG GTG AAC TCT GTT ATC GAG AAA ATG AAC 432 Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn
130 135 140
ATT CAA TTC ACA GCT GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG 480 Ile Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg
145 150 155 160
ATG GAA AAT TTA AAT AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG 528 Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
165 170 175
ACA TAT AAT GCA GAA TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG 576 Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
180 185 190
GAT TTC CAT GAC TCA AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC 624 Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
195 200 205
CAA TTA AAG AAT AAT GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC 672 Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
210 215 220
TAC CAC AAG TGT GAC AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT 720 Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr
225 230 235 240
TAT GAT TAT CCC AAA TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG 768 Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
245 250 255
GTA GAT GGA GTG AAA TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG 816 Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
260 265 270
ATC TAC TCA ACT GTC GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG 864 Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
275 280 285
GCA ATC AGT TTC TGG ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA 912 Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile
290 295 300
(2) INFORMATION FOR SEQ ID NO: 32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 304 amino acids
(B) TYPE: amino acid
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32:
Met Asp Pro Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp
1 5 10 15
His Val Arg Lys Arg Val Ala Asp Gln Glu Leu Gly Asp Ala Pro Phe
20 25 30
Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45
Thr Leu Gly Leu Asp Ile Glu Thr Ala Thr Arg Ala Gly Lys Gln Ile 50 55 60
Val Glu Arg Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr 65 70 75 80
Met Gln Ile Pro Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
85 90 95
Gly Trp Thr Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Gln Asn
100 105 110
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
115 120 125
Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn 130 135 140
Ile Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg 145 150 155 160
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
165 170 175
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
180 185 190
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
195 200 205
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe 210 215 220
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr 225 230 235 240
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
245 250 255
Val Asp Gly Val Lys Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala
260 265 270
Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly
275 280 285
Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile 290 295 300
(2) INFORMATION FOR SEQ ID NO: 33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 474 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D ) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
( ix ) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 1..471
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33:
GTG GGT AAA GAA TTC AAC AAA TTA GAA AAA AGG ATG GAA AAT TTA AAT 48 Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu Asn
1 5 10 15
AAA AAA GTT GAT GAT GGA TTT CTG GAC ATT TGG ACA TAT AAT GCA GAA 96 Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu
20 25 30
TTG TTA GTT CTA CTG GAA AAT GAA AGG ACT CTG GAT TTC CAT GAC TCA 144 Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
35 40 45
AAT GTG AAG AAT CTG TAT GAG AAA GTA AAA AGC CAA TTA AAG AAT AAT 192 Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn
50 55 60
GCC AAA GAA ATC GGA AAT GGA TGT TTT GAG TTC TAC CAC AAG TGT GAC 240 Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
65 70 75 80
AAT GAA TGC ATG GAA AGT GTA AGA AAT GGG ACT TAT GAT TAT CCC AAA 288 Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro Lys
85 90 95
TAT TCA GAA GAG TCA AAG TTG AAC AGG GAA AAG GTA GAT GGA GTG AAA 336 Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val Lys
100 105 110
TTG GAA TCA ATG GGG ATC TAT CAG ATT CTG GCG ATC TAC TCA ACT GTC 384 Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr Val
115 120 125
GCC AGT TCA CTG GTG CTT TTG GTC TCC CTG GGG GCA ATC AGT TTC TGG 432 Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Trp
130 135 140
ATG TGT TCT AAT GGA TCT TTG CAG TGC AGA ATA TGC ATC TGA 474
Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
145 150 155
(2) INFORMATION FOR SEQ ID NO: 34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 157 amino acids
(B) TYPE: amino acid
(D ) TOPOLOGY: 1inear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34:
Val Gly Lys Glu Phe Asn Lys Leu Glu Lys Arg Met Glu Asn Leu Asn
1 5 10 15
Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu
20 25 30
Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
35 40 45
Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn
50 55 60
Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
65 70 75 80
Asn Glu Cys Met Glu Ser Val Arg Asn Gly Thr Tyr Asp Tyr Pro Lys
85 90 95
Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Val Asp Gly Val Lys
100 105 110
Leu Glu Ser Met Gly Ile Tyr Gln Ile Leu Ala Ile Tyr Ser Thr Val
115 120 125
Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Trp
130 135 140
Met Cys Ser Asn Gly Ser Leu Gln Cys Arg Ile Cys Ile
145 150 155
(2) INFORMATION FOR SEQ ID NO: 35:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 47 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35:
CATGGATCAT ATGTTAACAG ATATCAAGGC CTGACTGACT GAGAGCT 47
(2) INFORMATION FOR SEQ ID NO: 36:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D ) TOPOLOGY : unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36:
CTAGTATACA ATTGTCTATA GTTCCGGACT GACTGACTC 39
(2) INFORMATION FOR SEQ ID NO: 37:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37:
CATGGGCGCC CATATGGGCA TATTCGGCG 29
(2) INFORMATION FOR SEQ ID NO:38:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:
CCGCGGGTAT ACCCGTATAA GCC 23
(2) INFORMATION FOR SEQ ID NO: 39:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 49 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39:
CATGGATCAT ATGTTAACAA GTACTCGATA TCAATGAGTG ACTGAAGCT 49
(2) INFORMATION FOR SEQ ID NO: 40:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 41 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40:
CTAGTATACA ATTGTTCATG AGCTATAGTT ACTCACTGAC T 41
(2) INFORMATION FOR SEQ ID NO: 41:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 12 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41:
AATTCGTACC TA 12
(2) INFORMATION FOR SEQ ID NO: 42:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 12 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42:
GCATGGATCT AG 12
Claims
1. A vaccine for stimulating protection in animals against infection by influenza virus which comprises a an effective amount of an immunogenic
fragment of the HA2 subunit of an HA protein selected from the group consisting of a type A subtype influenza virus or a type B influenza virus.
2. The vaccine according to claim 1 wherein said type A subunit is H3N2.
3. The vaccine according to claim 1 wherein the polypeptide is fused to a second polypeptide.
4. The vaccine according to claim 2 wherein the second polypeptide comprises the N terminal amino acids of a NS1 protein.
5. The vaccine according to claim 1 wherein the immunogenic fragment of the HA2 subunit is selected from the group consisting of a peptide comprising amino acids 1 to 221 of the H3HA2 subtype, a peptide comprising amino acids 77 to 221 of the H3HA2 subtype, a peptide comprising amino acids 1 to 223 of the BHA2 type, and a peptide comprising amino acids 41 to 223 of the BHA2 type.
6. The vaccine according to claim 5
comprising NS1(1-81)H3HA2(1-221) SEQ ID NO: 10.
7. The vaccine according to claim 5 comprising NS1(1-81)H3HA2(77-221) SEQ ID NO: 12.
8. The vaccine according to claim 5 comprising NS11-42BLHA241-223 SEQ ID NO: 14.
9. A protein comprising an immunogenic fragment of the HA2 subunit of an HA protein selected from the group consisting of Type A subtype or type B influenza virus.
10. The protein according to claim 9 wherein said type A subtype is H3N2.
11. The protein according to claim 9 wherein the peptide containing the immunogenic fragment is fused to a second peptide or protein.
12. The protein according to claim 10 wherein the second peptide comprises the N terminal amino acids of a NS1 protein.
13. The protein according to claim 10 wherein the immunogenic fragment of the HA2 subunit is selected from the group consisting of a peptide comprising amino acids 1 to 221 of the H3HA2 subunit, a peptide comprising amino acids 77 to 221 of the H3HA2 subunit, a peptide comprising amino acids 1-223 of the BHA2 subunit, and a peptide comprising amino acids 41-223 of the BHA2
subunit.
14. A polypeptide NS1(1-81)H3HA2(1-221) SEQ ID NO: 10.
15. A polypeptide NS1(1-81)H3HA2(77-221) SEQ ID NO: 12.
16. A polypeptide NS11-41BLHA241-223 SEQ ID NO: 14.
17. A DNA molecule comprising a coding
sequence for an immunogenic fragment of the HA2 subunit of an HA protein selected from the group consisting of a Type A subtype or type B influenza virus.
18. The DNA molecule according to claim 17 wherein said Type A subunit is H3N2.
19. The DNA molecule according to claim 17 comprising a coding sequence for the polypeptide NS1(1- 81)H3HA2(1-221) SEQ ID NO: 10.
20. The DNA molecule according to claim 17 comprising a coding sequence for the polypeptide NS1(1- 42)H3BLHA2(41-223) SEQ ID NO: 14.
21. The DNA molecule according to claim 17 comprising a coding sequence for the polypeptide NS1(1- 81)H3HA2(77-221) SEQ ID NO: 12.
22. Plasmid pMG13H3HA SEQ ID NO: 9.
23. Plasmid pNS11-41BLHA241-223 SEQ ID NO: 13.
24. A microorganism transformed with a DNA molecule comprising a coding sequence for an immunogenic fragment of the HA2 subunit of an HA protein selected from the group consisting of a Type A subtype or type B influenza virus.
25. The microorganism according to claim 24 wherein said Type A subunit is H3N2.
26. The microorganism according to claim 24 wherein said DNA molecule comprises a coding sequence for the polypeptide NS1(1-81)H3HA2(1-221) SEQ ID NO: 10.
27. A combination vaccine for stimulating protection in animals against infection by influenza virus which comprises a first polypeptide having an immunogenic fragment of the HA2 subunit of an influenza H3 subtype virus and a second polypeptide selected from the group consisting of a polypeptide having an
immunogenic fragment of the HA2 subunit of a type B influenza virus, and a polypeptide having an immunogenic fragment of the HA2 subunit of an H1 subtype influenza virus, and a polypeptide having an immunogenic fragment of the HA2 subunit of an H2 subtype influenza virus.
28. The combination vaccine according to claim 27 wherein the first polypeptide is selected from the group consisting of NS1(1-81)H3HA2(1-221) SEQ ID NO: 10 and NS1(1- 81)H3HA2(77-221) SEQ ID NO: 12.
29. The combination vaccine according to claim 27 wherein the second polypeptide is a polypeptide having an immunogenic fragment of the HA2 subunit of an H1 subtype influenza virus.
30. The combination vaccine according to claim 27 wherein said second polypeptide is selected from the group consisting of C13 SEQ ID NO: 16, D SEQ ID NO: 18, C13 short SEQ ID NO: 20, D short SEQ ID NO: 22, A SEQ ID NO: 24, C SEQ ID NO: 26, ΔD SEQ ID NO: 27, Δ13 SEQ ID NO: 28, M SEQ ID NO: 29, ΔM SEQ ID NO: 30, ΔM+ SEQ ID NO: 32, and H1HA266-222 SEQ ID NO: 34.
31. The combination vaccine according to claim 27 wherein said second polypeptide is NS11-42BLHA241-223 SEQ ID NO: 14.
32. A combination vaccine for stimulating protection in animals against infection by influenza virus which comprises a first polypeptide having an immunogenic fragment of the HA2 subunit of an influenza H3 subtype virus, a second polypeptide having an
immunogenic fragment of the HA2 subunit of an influenza B type virus, and a third polypeptide selected from the group consisting of a polypeptide having an immunogenic fragment of the HA2 subunit of an H1 subtype influenza virus and a polypeptide having an immunogenic fragment of the HA2 subunit of an H2 subtype influenza virus.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US83777392A | 1992-02-18 | 1992-02-18 | |
US07/837,773 | 1992-02-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1993015763A1 true WO1993015763A1 (en) | 1993-08-19 |
Family
ID=25275375
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1993/001451 WO1993015763A1 (en) | 1992-02-18 | 1993-02-18 | Vaccinal polypeptides |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU3724093A (en) |
MX (1) | MX9300883A (en) |
WO (1) | WO1993015763A1 (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1994022917A1 (en) * | 1993-04-05 | 1994-10-13 | University Of Massachusetts Medical Center | Cross-reactive influenza a immunization |
US5674502A (en) * | 1990-08-08 | 1997-10-07 | University Of Massachusetts Medical Center | Cross-reactive influenza a immunization |
WO2001062778A3 (en) * | 2000-02-23 | 2002-04-04 | Smithkline Beecham Biolog | Tumour-specific animal proteins |
WO2002024734A3 (en) * | 2000-09-19 | 2002-08-15 | Chiron Spa | Influenza a virus haemagglutinin subtype h16 proteins and their encoding nuclei c acid |
JP2006316072A (en) * | 1994-01-27 | 2006-11-24 | Univ Of Massachusetts Medical Center | Immunization by inoculation with DNA transcription unit |
WO2008036146A2 (en) | 2006-07-14 | 2008-03-27 | Sanofi Pasteur Biologics Co. | Construction of recombinant virus vaccines by direct transposon-mediated insertion of foreign immunologic determinants into vector virus proteins |
WO2008100290A2 (en) | 2006-09-29 | 2008-08-21 | Sanofi Pasteur Biologics Co | Recombinant rhinovirus vectors |
US7811574B2 (en) | 2000-02-23 | 2010-10-12 | Glaxosmithkline Biologicals S.A. | Tumour-specific animal proteins |
US20100291128A1 (en) * | 2005-11-18 | 2010-11-18 | Montelione Gaetano T | Novel compositions and vaccines against influenza a and influenza b infections |
CN1840178B (en) * | 2000-02-23 | 2014-05-28 | 史密丝克莱恩比彻姆生物有限公司 | Tumour-specific animal proteins |
US8916514B2 (en) | 2009-05-27 | 2014-12-23 | Glaxosmithkline Biologicals, S.A. | CASB7439 constructs |
CN110003314A (en) * | 2019-04-11 | 2019-07-12 | 上海市计划生育科学研究所 | H1N1 influenza virus hemagglutinin can induce epitope peptide and its application of broad spectrum protection antibody |
US10555998B2 (en) | 2014-11-24 | 2020-02-11 | Intervet Inc. | Inactivated equine influenza virus vaccines |
-
1993
- 1993-02-18 WO PCT/US1993/001451 patent/WO1993015763A1/en active Application Filing
- 1993-02-18 AU AU37240/93A patent/AU3724093A/en not_active Abandoned
- 1993-02-18 MX MX9300883A patent/MX9300883A/en unknown
Non-Patent Citations (4)
Title |
---|
FEDERATION OF AMERICAN SOCIETIES FOR EXPERIMENTAL BIOLOGY, 75th Anual Meeting, Volume 5, No. 5, issued 21-25 April 1991, DILLON et al., "Activity of CD8+ CTL in Mice Immunized with Recombinant Influenza NS1-HA2 Fusion Protein or a CTL Epitope Peptide (HA2 189-199)", Abstract 5748, page A1362. * |
JOURNAL OF EXPERIMENTAL MEDICINE, Volume 162, issued August 1985, YAMADA et al., "Influenza Virus Hemagglutinin-Specific Cytotoxic T Cell Response Induced by Polypeptide Produced in Escherichia Coli", pages 663-674. * |
JOURNAL OF EXPERIMENTAL MEDICINE, Volume 162, issued November 1985, YAMADA et al., "Influenza Virus Subtype-Specific Cytotoxic T Lymphocytes Lyse Targer Cells Coated with a Protein Produced in E. Coli", pages 1720-1725. * |
JOURNAL OF IMMUNOLOGY, Volume 140, No. 4, issued 15 February 1988, KUWANO et al., "HA2 Subunit of Influenza A H1 and H2 Subtype Viruses Induces a Protective Cross-Reactive Cytotoxic T Lymphocyte Response", pages 1264-1268. * |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5674502A (en) * | 1990-08-08 | 1997-10-07 | University Of Massachusetts Medical Center | Cross-reactive influenza a immunization |
US5766601A (en) * | 1990-08-08 | 1998-06-16 | University Of Massachusetts Medical Center | Cross-reactive influenza a immunization |
US5882650A (en) * | 1990-08-08 | 1999-03-16 | University Of Massachusetts Medical Center | Cross-reactive influenza A immunization |
WO1994022917A1 (en) * | 1993-04-05 | 1994-10-13 | University Of Massachusetts Medical Center | Cross-reactive influenza a immunization |
JP2006316072A (en) * | 1994-01-27 | 2006-11-24 | Univ Of Massachusetts Medical Center | Immunization by inoculation with DNA transcription unit |
KR100919916B1 (en) | 2000-02-23 | 2009-10-07 | 글락소스미스클라인 바이오로지칼즈 에스.에이. | Tumour-specific animal proteins |
US7803379B2 (en) | 2000-02-23 | 2010-09-28 | Glaxosmithkline Biologicals S.A. | Tumour-specific animal proteins |
CN1840178B (en) * | 2000-02-23 | 2014-05-28 | 史密丝克莱恩比彻姆生物有限公司 | Tumour-specific animal proteins |
EP1650221A3 (en) * | 2000-02-23 | 2006-12-20 | GlaxoSmithKline Biologicals SA | Novel compounds |
CZ303468B6 (en) * | 2000-02-23 | 2012-10-03 | Smithkline Beecham Biologicals S. A. | Immunogenic mixture and pharmaceutical mixture |
KR100848973B1 (en) * | 2000-02-23 | 2008-07-30 | 글락소스미스클라인 바이오로지칼즈 에스.에이. | Tumour-specific animal proteins |
US8207123B2 (en) | 2000-02-23 | 2012-06-26 | Glaxosmithkline Biologicals S.A. | Tumour-specific animal proteins |
WO2001062778A3 (en) * | 2000-02-23 | 2002-04-04 | Smithkline Beecham Biolog | Tumour-specific animal proteins |
AU2006201042B2 (en) * | 2000-02-23 | 2009-10-08 | Smithkline Beecham Biologicals S.A. | Novel compounds |
AU2001256156B2 (en) * | 2000-02-23 | 2006-01-05 | Smithkline Beecham Biologicals S.A. | Novel compounds |
US7811574B2 (en) | 2000-02-23 | 2010-10-12 | Glaxosmithkline Biologicals S.A. | Tumour-specific animal proteins |
WO2002024734A3 (en) * | 2000-09-19 | 2002-08-15 | Chiron Spa | Influenza a virus haemagglutinin subtype h16 proteins and their encoding nuclei c acid |
US20100291128A1 (en) * | 2005-11-18 | 2010-11-18 | Montelione Gaetano T | Novel compositions and vaccines against influenza a and influenza b infections |
US9119810B2 (en) * | 2005-11-18 | 2015-09-01 | Rutgers, The State University Of New Jersey | Compositions and vaccines against influenza A and influenza B infections |
WO2008036146A2 (en) | 2006-07-14 | 2008-03-27 | Sanofi Pasteur Biologics Co. | Construction of recombinant virus vaccines by direct transposon-mediated insertion of foreign immunologic determinants into vector virus proteins |
WO2008100290A2 (en) | 2006-09-29 | 2008-08-21 | Sanofi Pasteur Biologics Co | Recombinant rhinovirus vectors |
US8916514B2 (en) | 2009-05-27 | 2014-12-23 | Glaxosmithkline Biologicals, S.A. | CASB7439 constructs |
US10555998B2 (en) | 2014-11-24 | 2020-02-11 | Intervet Inc. | Inactivated equine influenza virus vaccines |
CN110003314A (en) * | 2019-04-11 | 2019-07-12 | 上海市计划生育科学研究所 | H1N1 influenza virus hemagglutinin can induce epitope peptide and its application of broad spectrum protection antibody |
CN110003314B (en) * | 2019-04-11 | 2023-06-09 | 上海市计划生育科学研究所 | Epitope peptide capable of inducing broad-spectrum protective antibody by H1N1 influenza virus hemagglutinin and application thereof |
Also Published As
Publication number | Publication date |
---|---|
AU3724093A (en) | 1993-09-03 |
MX9300883A (en) | 1994-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6207165B1 (en) | Polynucleotide formula against porcine reproductive and respiratory pathologies | |
US6558673B1 (en) | Complexes of immunogens derived from RSV surface glycoprotein G covalently coupled to a support molecule | |
JP3545418B2 (en) | A novel recombination temperature-sensitive mutant of influenza | |
NZ236294A (en) | Recombinant vaccines using herpes virus of turkeys against mareks disease; infectious bronchitis; newcastle disease and infectious bursal disease | |
AU2006225204A1 (en) | Cold-adapted equine influenza viruses | |
WO1993023422A1 (en) | Compositions and methods for vaccination against coronaviruses | |
WO1993015763A1 (en) | Vaccinal polypeptides | |
EP0366238A2 (en) | Influenza vaccinal polypeptides | |
RU2178807C2 (en) | Isolated dna sequence, vector, method of preparing homogeneous protein gp 350, homogeneous protein gp 350, pharmaceutical composition for treatment of ebv-mediated disease or state | |
EP0176493B1 (en) | Vaccinal polypeptides | |
HU217211B (en) | Process for preparation of ehv-4 glycoprotein vaccine | |
RU2358981C2 (en) | Universal avian influenza virus vaccine | |
WO1993024646A1 (en) | Fowl mycoplasma antigen, gene thereof, recombinant vector containing said gene, and vaccine prepared by utilizing the same | |
EP0423869B1 (en) | Infectious bronchitis virus vaccine | |
AU640348B2 (en) | Vaccinal Polypeptides | |
WO1994006468A1 (en) | Recombinant influenza virus vaccine compositions | |
WO2022085648A1 (en) | Fusion protein and vaccine | |
GB2232675A (en) | Stable forms of antigenic taenia ovis polypeptides | |
EP4501947A1 (en) | Fusion protein and vaccine | |
KR102556305B1 (en) | H9N2 Strain Genetic Recombinant Low Pathogenic Avian Influenza A Virus, Manufacturing Method and Vaccine Composition Thereof Using H9N2 Avian Influenza Virus Belonging to Y280 Lineage | |
CA2402339C (en) | Viral antigen and vaccine against isav (infectious salmon anaemia virus) | |
KR20230004330A (en) | Peptide for enhancing neutralizing antibody-producing ability and vaccine composition comprising the same | |
SONDERMEIJER et al. | Sommaire du brevet 2028045 | |
PT91600B (en) | A process for the preparation of a vaccine against the virus of the virus comprising a vaccine and such polypeptide |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP KR NZ US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: CA |