US20030087384A1 - Fibroblast growth factor receptor-like molecules and uses thereof - Google Patents
Fibroblast growth factor receptor-like molecules and uses thereof Download PDFInfo
- Publication number
- US20030087384A1 US20030087384A1 US10/229,584 US22958402A US2003087384A1 US 20030087384 A1 US20030087384 A1 US 20030087384A1 US 22958402 A US22958402 A US 22958402A US 2003087384 A1 US2003087384 A1 US 2003087384A1
- Authority
- US
- United States
- Prior art keywords
- pro
- gly
- leu
- val
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 102000018233 Fibroblast Growth Factor Human genes 0.000 title abstract description 3
- 108050007372 Fibroblast Growth Factor Proteins 0.000 title abstract description 3
- 229940126864 fibroblast growth factor Drugs 0.000 title abstract description 3
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 269
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 267
- 229920001184 polypeptide Polymers 0.000 claims abstract description 265
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 64
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 58
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 58
- 210000004027 cell Anatomy 0.000 claims abstract description 46
- 238000000034 method Methods 0.000 claims abstract description 25
- 239000013598 vector Substances 0.000 claims abstract description 6
- 239000002773 nucleotide Substances 0.000 claims description 70
- 125000003729 nucleotide group Chemical group 0.000 claims description 70
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 62
- 150000001413 amino acids Chemical class 0.000 claims description 48
- 230000000694 effects Effects 0.000 claims description 39
- 238000006467 substitution reaction Methods 0.000 claims description 37
- 239000012634 fragment Substances 0.000 claims description 21
- 125000000539 amino acid group Chemical group 0.000 claims description 20
- 230000000295 complement effect Effects 0.000 claims description 15
- 238000012217 deletion Methods 0.000 claims description 13
- 230000037430 deletion Effects 0.000 claims description 13
- 230000004048 modification Effects 0.000 claims description 9
- 238000012986 modification Methods 0.000 claims description 9
- 238000003780 insertion Methods 0.000 claims description 8
- 230000037431 insertion Effects 0.000 claims description 8
- 230000000890 antigenic effect Effects 0.000 claims description 3
- 210000004899 c-terminal region Anatomy 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 210000003527 eukaryotic cell Anatomy 0.000 claims 1
- 210000001236 prokaryotic cell Anatomy 0.000 claims 1
- 239000011230 binding agent Substances 0.000 abstract description 10
- 239000008194 pharmaceutical composition Substances 0.000 abstract description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 4
- 208000035475 disorder Diseases 0.000 abstract description 3
- 238000003745 diagnosis Methods 0.000 abstract description 2
- 230000006806 disease prevention Effects 0.000 abstract description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 58
- 235000001014 amino acid Nutrition 0.000 description 58
- 229940024606 amino acid Drugs 0.000 description 44
- 108010050848 glycylleucine Proteins 0.000 description 38
- 108020004414 DNA Proteins 0.000 description 28
- 108010047495 alanylglycine Proteins 0.000 description 27
- 108090000623 proteins and genes Proteins 0.000 description 26
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 25
- 241000282414 Homo sapiens Species 0.000 description 25
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 23
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 20
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 19
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 18
- 108010054155 lysyllysine Proteins 0.000 description 18
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 17
- 241000880493 Leptailurus serval Species 0.000 description 16
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 16
- 108010049041 glutamylalanine Proteins 0.000 description 16
- 108010034529 leucyl-lysine Proteins 0.000 description 16
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 15
- 108010060199 cysteinylproline Proteins 0.000 description 15
- 238000009396 hybridization Methods 0.000 description 15
- 235000018102 proteins Nutrition 0.000 description 15
- 102000004169 proteins and genes Human genes 0.000 description 15
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 14
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 14
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 14
- 108010060035 arginylproline Proteins 0.000 description 14
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 14
- 108010010147 glycylglutamine Proteins 0.000 description 14
- 108010015792 glycyllysine Proteins 0.000 description 14
- 108010057821 leucylproline Proteins 0.000 description 14
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 13
- 241001529936 Murinae Species 0.000 description 13
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 13
- 108010085325 histidylproline Proteins 0.000 description 13
- 108010080629 tryptophan-leucine Proteins 0.000 description 13
- 108010090461 DFG peptide Proteins 0.000 description 12
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 12
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 12
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 12
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 12
- 108010087924 alanylproline Proteins 0.000 description 12
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 12
- 108010092854 aspartyllysine Proteins 0.000 description 12
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 12
- 108010018006 histidylserine Proteins 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 108010029020 prolylglycine Proteins 0.000 description 12
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 11
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 11
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 11
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 11
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 11
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 11
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 11
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 11
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 11
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 11
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 11
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 11
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 11
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 11
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 11
- 241000282326 Felis catus Species 0.000 description 10
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 10
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 10
- 108010093581 aspartyl-proline Proteins 0.000 description 10
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 10
- -1 pseudoisocytosine Chemical compound 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 9
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 9
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 9
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 9
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 9
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 9
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 9
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 9
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 9
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 9
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 9
- 238000000636 Northern blotting Methods 0.000 description 9
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 9
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 9
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 9
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 9
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 9
- 238000007901 in situ hybridization Methods 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 8
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 8
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 8
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 8
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 8
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 8
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 8
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 8
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 8
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 8
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 8
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 8
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 8
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 8
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 8
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 8
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 8
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 8
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 8
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 8
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 8
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 8
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 8
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 8
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 8
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 8
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 8
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 8
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 8
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 8
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 8
- 108010008355 arginyl-glutamine Proteins 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 8
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 7
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 7
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 7
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 7
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 7
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 7
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 7
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 7
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 7
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 7
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 7
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 7
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 7
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 7
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 7
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 7
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 7
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 7
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 7
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 7
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 7
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 7
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 7
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 7
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 7
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 7
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 7
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 7
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 7
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 7
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 7
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 7
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 7
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 7
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 7
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 7
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 7
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 7
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 7
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 7
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 7
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 7
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 7
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 7
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 7
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 7
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 7
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 7
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 7
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 7
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 7
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 7
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 7
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 7
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 7
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 7
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 7
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 7
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 7
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 7
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 7
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 7
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 230000004071 biological effect Effects 0.000 description 7
- 108010084389 glycyltryptophan Proteins 0.000 description 7
- 108010017391 lysylvaline Proteins 0.000 description 7
- 108010034507 methionyltryptophan Proteins 0.000 description 7
- 108010077112 prolyl-proline Proteins 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 6
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 6
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 6
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 6
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 6
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 6
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 6
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 6
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 6
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 6
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 6
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 6
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 6
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 6
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 6
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 6
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 6
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 6
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 6
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 6
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 6
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 6
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 6
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 6
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 6
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 6
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 6
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 6
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 6
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 6
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 6
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 6
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 6
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- WZUVPPKBWHMQCE-UHFFFAOYSA-N Haematoxylin Chemical group C12=CC(O)=C(O)C=C2CC2(O)C1C1=CC=C(O)C(O)=C1OC2 WZUVPPKBWHMQCE-UHFFFAOYSA-N 0.000 description 6
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 6
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 6
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 6
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 6
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 6
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 6
- BIWVMACFGZFIEB-VFAJRCTISA-N Lys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N)O BIWVMACFGZFIEB-VFAJRCTISA-N 0.000 description 6
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 6
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 6
- VVWQHJUYBPJCNS-UMPQAUOISA-N Met-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 VVWQHJUYBPJCNS-UMPQAUOISA-N 0.000 description 6
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 6
- 230000004988 N-glycosylation Effects 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- 108010066427 N-valyltryptophan Proteins 0.000 description 6
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 6
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 6
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 6
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 6
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 6
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 6
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 6
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 6
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 6
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 6
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 6
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 6
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 6
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 6
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 6
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 6
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 6
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 6
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 6
- GIBPOCDKBPNRJB-HSHDSVGOSA-N Thr-Met-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GIBPOCDKBPNRJB-HSHDSVGOSA-N 0.000 description 6
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 6
- FRQRWAMUESPWMT-HSHDSVGOSA-N Thr-Trp-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N)O FRQRWAMUESPWMT-HSHDSVGOSA-N 0.000 description 6
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 6
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 6
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 6
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 6
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 6
- 235000018417 cysteine Nutrition 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 108010018625 phenylalanylarginine Proteins 0.000 description 6
- 229910001415 sodium ion Inorganic materials 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 108010084932 tryptophyl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 5
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 5
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 5
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 5
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 5
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 5
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 5
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 5
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 5
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 5
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 5
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 5
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 5
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 5
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 5
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 5
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 5
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 5
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 5
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 5
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 5
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 5
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 5
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 5
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 5
- 101710149951 Protein Tat Proteins 0.000 description 5
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 5
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 5
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 5
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 5
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 5
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 5
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 5
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 5
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 5
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 5
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 5
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 5
- 108010027338 isoleucylcysteine Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- 108010004914 prolylarginine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 239000001509 sodium citrate Substances 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 4
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 4
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 4
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 4
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 4
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 4
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 4
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 4
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 4
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 4
- 241000699660 Mus musculus Species 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 4
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 4
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 210000004271 bone marrow stromal cell Anatomy 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- 230000008018 melting Effects 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 239000011734 sodium Substances 0.000 description 4
- 229910052708 sodium Inorganic materials 0.000 description 4
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 3
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 3
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 3
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 3
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 3
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 3
- 108091029865 Exogenous DNA Proteins 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 3
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 3
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 3
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 3
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 3
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 3
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 3
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 3
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 3
- 241000269631 Pleurodeles waltl Species 0.000 description 3
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 3
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 3
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 3
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 3
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 3
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 3
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 3
- 210000000577 adipose tissue Anatomy 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 239000005557 antagonist Substances 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 210000004556 brain Anatomy 0.000 description 3
- 125000000837 carbohydrate group Chemical group 0.000 description 3
- 238000001516 cell proliferation assay Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical group [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 3
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 3
- 210000002216 heart Anatomy 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 210000000496 pancreas Anatomy 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000002864 sequence alignment Methods 0.000 description 3
- 210000002027 skeletal muscle Anatomy 0.000 description 3
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000010361 transduction Methods 0.000 description 3
- 230000026683 transduction Effects 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 2
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 2
- HPZMWTNATZPBIH-UHFFFAOYSA-N 1-methyladenine Chemical compound CN1C=NC2=NC=NC2=C1N HPZMWTNATZPBIH-UHFFFAOYSA-N 0.000 description 2
- RFLVMTUMFYRZCB-UHFFFAOYSA-N 1-methylguanine Chemical compound O=C1N(C)C(N)=NC2=C1N=CN2 RFLVMTUMFYRZCB-UHFFFAOYSA-N 0.000 description 2
- YSAJFXWTVFGPAX-UHFFFAOYSA-N 2-[(2,4-dioxo-1h-pyrimidin-5-yl)oxy]acetic acid Chemical compound OC(=O)COC1=CNC(=O)NC1=O YSAJFXWTVFGPAX-UHFFFAOYSA-N 0.000 description 2
- FZWGECJQACGGTI-UHFFFAOYSA-N 2-amino-7-methyl-1,7-dihydro-6H-purin-6-one Chemical compound NC1=NC(O)=C2N(C)C=NC2=N1 FZWGECJQACGGTI-UHFFFAOYSA-N 0.000 description 2
- OVONXEQGWXGFJD-UHFFFAOYSA-N 4-sulfanylidene-1h-pyrimidin-2-one Chemical compound SC=1C=CNC(=O)N=1 OVONXEQGWXGFJD-UHFFFAOYSA-N 0.000 description 2
- OIVLITBTBDPEFK-UHFFFAOYSA-N 5,6-dihydrouracil Chemical compound O=C1CCNC(=O)N1 OIVLITBTBDPEFK-UHFFFAOYSA-N 0.000 description 2
- DCPSTSVLRXOYGS-UHFFFAOYSA-N 6-amino-1h-pyrimidine-2-thione Chemical compound NC1=CC=NC(S)=N1 DCPSTSVLRXOYGS-UHFFFAOYSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108090000381 Fibroblast growth factor 4 Proteins 0.000 description 2
- 102100028072 Fibroblast growth factor 4 Human genes 0.000 description 2
- 108090000382 Fibroblast growth factor 6 Proteins 0.000 description 2
- 102100028075 Fibroblast growth factor 6 Human genes 0.000 description 2
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 2
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 2
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 2
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 2
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 2
- 108700000788 Human immunodeficiency virus 1 tat peptide (47-57) Proteins 0.000 description 2
- 108700039609 IRW peptide Proteins 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- HYVABZIGRDEKCD-UHFFFAOYSA-N N(6)-dimethylallyladenine Chemical compound CC(C)=CCNC1=NC=NC2=C1N=CN2 HYVABZIGRDEKCD-UHFFFAOYSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 230000004989 O-glycosylation Effects 0.000 description 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 2
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 2
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 2
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 2
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 2
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 2
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 2
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 2
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 2
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 2
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 2
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 210000002826 placenta Anatomy 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- YVHCULPWZYVJEK-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 YVHCULPWZYVJEK-IHRRRGAJSA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- SATCOUWSAZBIJO-UHFFFAOYSA-N 1-methyladenine Natural products N=C1N(C)C=NC2=C1NC=N2 SATCOUWSAZBIJO-UHFFFAOYSA-N 0.000 description 1
- WJNGQIYEQLPJMN-IOSLPCCCSA-N 1-methylinosine Chemical compound C1=NC=2C(=O)N(C)C=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O WJNGQIYEQLPJMN-IOSLPCCCSA-N 0.000 description 1
- HLYBTPMYFWWNJN-UHFFFAOYSA-N 2-(2,4-dioxo-1h-pyrimidin-5-yl)-2-hydroxyacetic acid Chemical compound OC(=O)C(O)C1=CNC(=O)NC1=O HLYBTPMYFWWNJN-UHFFFAOYSA-N 0.000 description 1
- SVBOROZXXYRWJL-UHFFFAOYSA-N 2-[(4-oxo-2-sulfanylidene-1h-pyrimidin-5-yl)methylamino]acetic acid Chemical compound OC(=O)CNCC1=CNC(=S)NC1=O SVBOROZXXYRWJL-UHFFFAOYSA-N 0.000 description 1
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- XMSMHKMPBNTBOD-UHFFFAOYSA-N 2-dimethylamino-6-hydroxypurine Chemical compound N1C(N(C)C)=NC(=O)C2=C1N=CN2 XMSMHKMPBNTBOD-UHFFFAOYSA-N 0.000 description 1
- SMADWRYCYBUIKH-UHFFFAOYSA-N 2-methyl-7h-purin-6-amine Chemical compound CC1=NC(N)=C2NC=NC2=N1 SMADWRYCYBUIKH-UHFFFAOYSA-N 0.000 description 1
- KOLPWZCZXAMXKS-UHFFFAOYSA-N 3-methylcytosine Chemical compound CN1C(N)=CC=NC1=O KOLPWZCZXAMXKS-UHFFFAOYSA-N 0.000 description 1
- GJAKJCICANKRFD-UHFFFAOYSA-N 4-acetyl-4-amino-1,3-dihydropyrimidin-2-one Chemical compound CC(=O)C1(N)NC(=O)NC=C1 GJAKJCICANKRFD-UHFFFAOYSA-N 0.000 description 1
- UACOJOVKHNAJPX-UHFFFAOYSA-N 5-(methoxyamino)-6-methyl-2-sulfanylidene-1H-pyrimidin-4-one Chemical compound CONC=1C(NC(NC=1C)=S)=O UACOJOVKHNAJPX-UHFFFAOYSA-N 0.000 description 1
- MQJSSLBGAQJNER-UHFFFAOYSA-N 5-(methylaminomethyl)-1h-pyrimidine-2,4-dione Chemical compound CNCC1=CNC(=O)NC1=O MQJSSLBGAQJNER-UHFFFAOYSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- KELXHQACBIUYSE-UHFFFAOYSA-N 5-methoxy-1h-pyrimidine-2,4-dione Chemical compound COC1=CNC(=O)NC1=O KELXHQACBIUYSE-UHFFFAOYSA-N 0.000 description 1
- ZLAQATDNGLKIEV-UHFFFAOYSA-N 5-methyl-2-sulfanylidene-1h-pyrimidin-4-one Chemical compound CC1=CNC(=S)NC1=O ZLAQATDNGLKIEV-UHFFFAOYSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- HSPHKCOAUOJLIO-UHFFFAOYSA-N 6-(aziridin-1-ylamino)-1h-pyrimidin-2-one Chemical compound N1C(=O)N=CC=C1NN1CC1 HSPHKCOAUOJLIO-UHFFFAOYSA-N 0.000 description 1
- GTSVFOOLVUMMCX-UHFFFAOYSA-N 6-(methylaminomethyl)-2,4-dioxo-1H-pyrimidine-5-carboxylic acid Chemical compound C(=O)(O)C=1C(NC(NC=1CNC)=O)=O GTSVFOOLVUMMCX-UHFFFAOYSA-N 0.000 description 1
- CKOMXBHMKXXTNW-UHFFFAOYSA-N 6-methyladenine Chemical compound CNC1=NC=NC2=C1N=CN2 CKOMXBHMKXXTNW-UHFFFAOYSA-N 0.000 description 1
- SWJYOKZMYFJUOY-KQYNXXCUSA-N 9-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-6-(methylamino)-7h-purin-8-one Chemical compound OC1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O SWJYOKZMYFJUOY-KQYNXXCUSA-N 0.000 description 1
- MSSXOMSJDRHRMC-UHFFFAOYSA-N 9H-purine-2,6-diamine Chemical compound NC1=NC(N)=C2NC=NC2=N1 MSSXOMSJDRHRMC-UHFFFAOYSA-N 0.000 description 1
- 208000036762 Acute promyelocytic leukaemia Diseases 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 101100384618 Chlorobium chlorochromatii (strain CaD3) cobQ gene Proteins 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 206010052360 Colorectal adenocarcinoma Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 102000001493 Cyclophilins Human genes 0.000 description 1
- 108010068682 Cyclophilins Proteins 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 1
- 102000003974 Fibroblast growth factor 2 Human genes 0.000 description 1
- 102100027844 Fibroblast growth factor receptor 4 Human genes 0.000 description 1
- 101710182387 Fibroblast growth factor receptor 4 Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 1
- GHASVSINZRGABV-UHFFFAOYSA-N Fluorouracil Chemical compound FC1=CNC(=O)NC1=O GHASVSINZRGABV-UHFFFAOYSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- UQTKYYNHMVAOAA-HJPIBITLSA-N His-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N UQTKYYNHMVAOAA-HJPIBITLSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- YBDOQKVAGTWZMI-XIRDDKMYSA-N His-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YBDOQKVAGTWZMI-XIRDDKMYSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- 108010070875 Human Immunodeficiency Virus tat Gene Products Proteins 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- ZXJFURYTPZMUNY-VKOGCVSHSA-N Ile-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 ZXJFURYTPZMUNY-VKOGCVSHSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- MVLDERGQICFFLL-ZQINRCPSSA-N Ile-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 MVLDERGQICFFLL-ZQINRCPSSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 235000003332 Ilex aquifolium Nutrition 0.000 description 1
- 241000209027 Ilex aquifolium Species 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- DOQXHOUYYSPISL-SZMVWBNQSA-N Met-Trp-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N DOQXHOUYYSPISL-SZMVWBNQSA-N 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- QYSXJUFSXHHAJI-XFEUOLMDSA-N Vitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C/C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-XFEUOLMDSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 210000003486 adipose tissue brown Anatomy 0.000 description 1
- 210000000593 adipose tissue white Anatomy 0.000 description 1
- 230000001270 agonistic effect Effects 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 210000001188 articular cartilage Anatomy 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000002659 cell therapy Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 150000001945 cysteines Chemical class 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037765 diseases and disorders Diseases 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 210000001198 duodenum Anatomy 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000001400 expression cloning Methods 0.000 description 1
- 229960002949 fluorouracil Drugs 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 210000003405 ileum Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 210000003125 jurkat cell Anatomy 0.000 description 1
- 210000000629 knee joint Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 201000005296 lung carcinoma Diseases 0.000 description 1
- 208000003747 lymphoid leukemia Diseases 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- IZAGSTRIDUNNOY-UHFFFAOYSA-N methyl 2-[(2,4-dioxo-1h-pyrimidin-5-yl)oxy]acetate Chemical compound COC(=O)COC1=CNC(=O)NC1=O IZAGSTRIDUNNOY-UHFFFAOYSA-N 0.000 description 1
- 150000004702 methyl esters Chemical class 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- XJVXMWNLQRTRGH-UHFFFAOYSA-N n-(3-methylbut-3-enyl)-2-methylsulfanyl-7h-purin-6-amine Chemical compound CSC1=NC(NCCC(C)=C)=C2NC=NC2=N1 XJVXMWNLQRTRGH-UHFFFAOYSA-N 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 230000001582 osteoblastic effect Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 210000004896 polypeptide structure Anatomy 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- FQENQNTWSFEDLI-UHFFFAOYSA-J sodium diphosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OP([O-])([O-])=O FQENQNTWSFEDLI-UHFFFAOYSA-J 0.000 description 1
- 229940083575 sodium dodecyl sulfate Drugs 0.000 description 1
- 229940048086 sodium pyrophosphate Drugs 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 235000019818 tetrasodium diphosphate Nutrition 0.000 description 1
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 210000003437 trachea Anatomy 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- QYSXJUFSXHHAJI-YRZJJWOYSA-N vitamin D3 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C\C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-YRZJJWOYSA-N 0.000 description 1
- 235000005282 vitamin D3 Nutrition 0.000 description 1
- 239000011647 vitamin D3 Substances 0.000 description 1
- 229940021056 vitamin d3 Drugs 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/71—Receptors; Cell surface antigens; Cell surface determinants for growth factors; for growth regulators
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/0004—Screening or testing of compounds for diagnosis of disorders, assessment of conditions, e.g. renal clearance, gastric emptying, testing for diabetes, allergy, rheuma, pancreas functions
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
- A61P1/02—Stomatological preparations, e.g. drugs for caries, aphtae, periodontitis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P13/00—Drugs for disorders of the urinary system
- A61P13/12—Drugs for disorders of the urinary system of the kidneys
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P17/00—Drugs for dermatological disorders
- A61P17/02—Drugs for dermatological disorders for treating wounds, ulcers, burns, scars, keloids, or the like
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
- A61P19/08—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
- A61P19/08—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease
- A61P19/10—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease for osteoporosis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/04—Anorexiants; Antiobesity agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/08—Drugs for disorders of the metabolism for glucose homeostasis
- A61P3/10—Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/12—Drugs for disorders of the metabolism for electrolyte homeostasis
- A61P3/14—Drugs for disorders of the metabolism for electrolyte homeostasis for calcium homeostasis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P7/00—Drugs for disorders of the blood or the extracellular fluid
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P7/00—Drugs for disorders of the blood or the extracellular fluid
- A61P7/06—Antianaemics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/12—Materials from mammals; Compositions comprising non-specified tissues or cells; Compositions comprising non-embryonic stem cells; Genetically modified cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2799/00—Uses of viruses
- C12N2799/02—Uses of viruses as vector
- C12N2799/021—Uses of viruses as vector for the expression of a heterologous nucleic acid
Definitions
- the present invention relates to Fibroblast Growth Factor Receptor-Like (FGFR-L) polypeptides and nucleic acid molecules encoding the same.
- the invention also relates to selective binding agents, vectors, host cells, and methods for producing FGFR-L polypeptides.
- the invention further relates to pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, and/or prevention of diseases, disorders, and conditions associated with FGFR-L polypeptides.
- the present invention relates to novel FGFR-L nucleic acid molecules and encoded polypeptides.
- the invention provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- the invention also provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- nucleotide sequence encoding an allelic variant or splice variant of the nucleotide sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4, the nucleotide sequence of the DNA insert in ATCC Deposit No. ______, or (a);
- the invention further provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- the present invention provides for an isolated polypeptide comprising an amino acid sequence selected from the group consisting of:
- the invention also provides for an isolated polypeptide comprising the amino acid sequence selected from the group consisting of
- the invention further provides for an isolated polypeptide comprising the amino acid sequence selected from the group consisting of:
- fusion polypeptides comprising FGFR-L amino acid sequences.
- the present invention also provides for an expression vector comprising the isolated nucleic acid molecules as set forth herein, recombinant host cells comprising the recombinant nucleic acid molecules as set forth herein, and a method of producing an FGFR-L polypeptide comprising culturing the host cells and optionally isolating the polypeptide so produced.
- a transgenic non-human animal comprising a nucleic acid molecule encoding an FGFR-L polypeptide is also encompassed by the invention.
- the FGFR-L nucleic acid molecules are introduced into the animal in a manner that allows expression and increased levels of an FGFR-L polypeptide, which may include increased circulating levels.
- the FGFR-L nucleic acid molecules are introduced into the animal in a manner that prevents expression of endogenous FGFR-L polypeptide (i.e., generates a transgenic animal possessing an FGFR-L polypeptide gene knockout).
- the transgenic non-human animal is preferably a mammal, and more preferably a rodent, such as a rat or a mouse.
- selective binding agents such as antibodies and peptides capable of specifically binding the FGFR-L polypeptides of the invention.
- Such antibodies and peptides may be agonistic or antagonistic.
- compositions comprising the nucleotides, polypeptides, or selective binding agents of the invention and one or more pharmaceutically acceptable formulation agents are also encompassed by the invention.
- the pharmaceutical compositions are used to provide therapeutically effective amounts of the nucleotides or polypeptides of the present invention.
- the invention is also directed to methods of using the polypeptides, nucleic acid molecules, and selective binding agents.
- FGFR-L polypeptides and nucleic acid molecules of the present invention may be used to treat, prevent, ameliorate, and/or detect diseases and disorders, including those recited herein.
- the present invention also provides a method of assaying test molecules to identify a test molecule that binds to an FGFR-L polypeptide.
- the method comprises contacting an FGFR-L polypeptide with a test molecule to determine the extent of binding of the test molecule to the polypeptide.
- the method further comprises determining whether such test molecules are agonists or antagonists of an FGFR-L polypeptide.
- the present invention further provides a method of testing the impact of molecules on the expression of FGFR-L polypeptide or on the activity of FGFR-L polypeptide.
- Methods of regulating expression and modulating (i.e., increasing or decreasing) levels of an FGFR-L polypeptide are also encompassed by the invention.
- One method comprises administering to an animal a nucleic acid molecule encoding an FGFR-L polypeptide.
- a nucleic acid molecule comprising elements that regulate or modulate the expression of an FGFR-L polypeptide may be administered. Examples of these methods include gene therapy, cell therapy, and anti-sense therapy as further described herein.
- the FGFR-L polypeptide can be used for identifying ligands thereof.
- Various forms of “expression cloning” have been used for cloning ligands for receptors (e.g., Davis et al., 1996 , Cell, 87:1161-69). These and other FGFR-L polypeptide ligand cloning experiments are described in greater detail herein. Isolation of an FGFR-L polypeptide ligand allows for the identification or development of novel agonists or antagonists of the FGFR-L polypeptide signaling pathway.
- Such agonists and antagonists include FGFR-L polypeptide ligands, anti-FGFR-L polypeptide ligand is antibodies and derivatives thereof, small molecules, or antisense oligonucleotides, any of which can be used for potentially treating one or more diseases or disorders, including those recited herein.
- FIGS. 1 A- 1 C illustrate the nucleotide sequence of the murine FGFR-L gene (SEQ ID NO: 1) and the deduced amino acid sequence of murine FGFR-L polypeptide (SEQ ID NO: 2).
- the predicted signal peptide (underline) and transmembrane domain (double-underline) are indicated;
- FIGS. 2 A- 2 B illustrate the amino acid sequence alignment of murine FGFR-L polypeptide (Smaf2-00017-f4; SEQ ID NO: 2) and Iberian ribbed newt ( Pleurodeles waltlii ) Fibroblast Growth Factor Receptor-4 (PIR:B49151; SEQ ID NO: 7);
- FIGS. 3 A- 3 B illustrate the nucleotide sequence of a cDNA clone encoding the N-terminal portion of the human FGFR-L gene (SEQ ID NO: 4) and the deduced amino acid sequence of the N-terminal portion of the human FGFR-L polypeptide (SEQ ID NO: 5).
- the predicted signal peptide (underline) and transmembrane domain (double-underline) are indicated;
- FIG. 4 illustrates the amino acid sequence alignment of murine FGFR-L polypeptide (SEQ ID NO: 2) and a virtual human FGFR-L polypeptide sequence (SEQ ID NO: 8) constructed from residues 1-472 of SEQ ID NO: 5 and residues 473-504 of GenBank Accession No. AJ277437.
- the predicted signal peptide underline
- transmembrane domain double-underline
- N-linked glycosylation sites bold
- FIG. 5 illustrates the expression of FGFR-L MRNA as detected by Northern blot analysis in day 7, 11, 15, and 17 mouse embryos
- FIG. 6 illustrates the expression of FGFR-L mrRNA as detected by Northern blot analysis in murine heart, brain, spleen, lung, liver, skeletal muscle, kidney, and testis;
- FIG. 7 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in NIH 3T3 cells and F10, F4, and D3 mouse bone marrow-derived stromal cell lines;
- FIG. 8 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in human brain, heart, skeletal muscle, colon, thymus, spleen, kidney, liver, small intestine, placenta, lung, and peripheral blood leukocytes;
- FIG. 9 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in promyelocytic leukemia HL-60 cells, HeLa S3 cells, chronic myelogenous leukemia L-562 cells, lymphoblastic leukemia MOLT-4 cells, Burkitt's lymphoma Raji cells, colorectal adenocarcinoma SW480 cells, lung carcinoma A549 cells, and melanoma G361 cells;
- FIG. 10 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in human heart, brain, placenta, lung, liver, skeletal muscle, kidney, and pancreas;
- FIG. 11 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in 266-6 cells, AR42J cells, CaPan I cells, HIG-82 cells, OHS4 cells, SW 1353 cells, SW 872 cells, K562 (old, i.e., later passage) cells, K562 (new, i.e., earlier passage) cells, Jurkat cells, and F4 cells;
- FIGS. 12 A- 12 B illustrate the expression of FGFR-L mRNA as detected by Northern blot analysis in human adipose tissue (using a human FGFR-L-derived probe) and murine adipose tissue (using a murine FGFR-L-derived probe);
- FIG. 13 illustrates the expression of FGFR-L mRNA in a number of murine tissues as detected in an RNAse protection assay.
- the absence of the cyclophilin band in the pancreas RNA sample suggests that thi sample was degraded;
- FIG. 17 illustrates the induction of FGFR-L MRNA in osteoblastic ST2 cells under conditions of osteoclastogenesis (i.e., 5-day exposure to vitamin D3 and dexamethasone);
- FIG. 18 illustrates the results of Western blot analysis of E. coli -derived Des7-FGFR-L/ECD and CHO-derived FGFR-L/ECD-Fc proteins using FGFR-L polypeptide antiserum;
- FIG. 19 illustrates the results of Western blot analysis of murine eye (lane 1) and adipose tissue (lane 2) using FGFR-L polypeptide antiserum;
- FIGS. 20 A- 20 B illustrate the results of FACS analysis on F4 and D3 bone marrow stromal cells using FGFR-L polypeptide antiserum
- FIGS. 21 A- 21 D illustrate the results of proliferation assays using D3 bone marrow stromal cells (either untransduced or transduced with a construct encoding FGFR-L polypeptide) following 72 hour exposure to rhuPDGF (panel A), rhuFGF-2 (panel B), rhuFGF-4 (panel C), or rhuFGF-6 (panel D);
- FIG. 22 illustrates the results of proliferation assays using A5-F bone marrow stromal cells following exposure to E. coli -derived Des7-FGFR-L/ECD protein and serum, PDGF, FGF-2, FGF-4, or FGF-6;
- FIG. 23 illustrates the results of proliferation assays using A5-F bone marrow stromal cells following exposure to CHO-derived FGFR-L/ECD-Fc protein and serum, PDGF, FGF-4, or FGF-6;
- FIG. 24 illustrates the expression of the neomycin resistance gene as detected by Northern blot analysis of peripheral blood mononuclear cell (PBMN) RNA from two FGFR-L/neo-transduced mice (lanes 1 and 2) and two neo-transduced control mice (lanes 3 and 4).
- PBMN peripheral blood mononuclear cell
- FGFR-L gene or “FGFR-L nucleic acid molecule” or “FGFR-L polynucleotide” refer to a nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4, a nucleotide sequence encoding the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, a nucleotide sequence of the DNA insert in ATCC Deposit No. ______, and nucleic acid molecules as defined herein.
- FGFR-L polypeptide allelic variant refers to one of several possible naturally occurring alternate forms of a gene occupying a given locus on a chromosome of an organism or a population of organisms.
- FGFR-L polypeptide splice variant refers to a nucleic acid molecule, usually RNA, which is generated by alternative processing of intron sequences in an RNA transcript of FGFR-L polypeptide amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- isolated nucleic acid molecule refers to a nucleic acid molecule of the invention that (1) has been separated from at least about 50 percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found when total nucleic acid is isolated from the source cells, (2) is not linked to all or a portion of a polynucleotide to which the “isolated nucleic acid molecule” is linked in nature, (3) is operably linked to a polynucleotide which it is not linked to in nature, or (4) does not occur in nature as part of a larger polynucleotide sequence.
- the isolated nucleic acid molecule of the present invention is substantially free from any other contaminating nucleic acid molecule(s) or other contaminants that are found in its natural environment that would interfere with its use in polypeptide production or its therapeutic, diagnostic, prophylactic or research use.
- nucleic acid sequence refers to a DNA or RNA sequence.
- the term encompasses molecules formed from any of the known base analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinyl-cytosine, pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxy-methylaminomethyluracil, dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethyl-guanine, 2-methyladenine, -2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine
- vector is used to refer to any molecule (e.g., nucleic acid, plasmid, or virus) used to transfer coding information to a host cell.
- molecule e.g., nucleic acid, plasmid, or virus
- expression vector refers to a vector that is suitable for transformation of a host cell and contains nucleic acid sequences that direct and/or control the expression of inserted heterologous nucleic acid sequences. Expression includes, but is not limited to, processes such as transcription, translation, and RNA splicing, if introns are present.
- flanking sequence operably linked is used herein to refer to an arrangement of flanking sequences wherein the flanking sequences so described are configured or assembled so as to perform their usual function.
- a flanking sequence operably linked to a coding sequence may be capable of effecting the replication, transcription and/or translation of the coding sequence.
- a coding sequence is operably linked to a promoter when the promoter is capable of directing transcription of that coding sequence.
- a flanking sequence need not be contiguous with the coding sequence, so long as it functions correctly.
- intervening untranslated yet transcribed sequences can be present between a promoter sequence and the coding sequence and the promoter sequence can still be considered “operably linked” to the coding sequence.
- the term “host cell” is used to refer to a cell which has been transformed, or is capable of being transformed with a nucleic acid sequence and then of expressing a selected gene of interest.
- the term includes the progeny of the parent cell, whether or not the progeny is identical in morphology or in genetic make-up to the original parent, so long as the selected gene is present.
- FGFR-L polypeptide refers to a polypeptide comprising the amino acid sequence of either SEQ ID NO.: 2 or SEQ ID NO: 5 and related polypeptides.
- Related polypeptides include FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, FGFR-L polypeptide variants, and FGFR-L polypeptide derivatives, which possess at least one activity of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- FGFR-L polypeptides may be mature polypeptides, as defined herein, and may or may not have an amino-terminal methionine residue, depending on the method by which they are prepared.
- FGFR-L polypeptide fragment refers to a polypeptide that comprises a truncation at the amino-terminus (with or without a leader sequence) and/or a truncation at the carboxyl-terminus of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- FGFR-L polypeptide fragment also refers to amino-terminal and/or carboxyl-terminal truncations of FGFR-L polypeptide orthologs, FGFR-L polypeptide derivatives, or FGFR-L polypeptide variants, or to amino-terminal and/or carboxyl-terminal truncations of the polypeptides encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants.
- FGFR-L polypeptide fragments may result from alternative RNA splicing or from in vivo protease activity.
- Membrane-bound forms of an FGFR-L polypeptide are also contemplated by the present invention.
- truncations and/or deletions comprise about 10 amino acids, or about 20 amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or more than about 100 amino acids.
- the polypeptide fragments so produced will comprise about 25 contiguous amino acids, or about 50 amino acids, or about 75. amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids, or more than about 200 amino acids.
- Such FGFR-L polypeptide fragments may optionally comprise an amino-terminal methionine residue. It will be appreciated that such fragments can be used, for example, to generate antibodies to FGFR-L polypeptides.
- FGFR-L polypeptide ortholog refers to a polypeptide from another species that corresponds to FGFR-L polypeptide amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- SEQ ID NO: 2 For example, mouse and human FGFR-L polypeptides are considered orthologs of each other.
- FGFR-L polypeptide variants refers to FGFR-L polypeptides comprising amino acid sequences having one or more amino acid sequence substitutions, deletions (such as internal deletions and/or FGFR-L polypeptide fragments), and/or additions (such as internal additions and/or FGFR-L fusion polypeptides) as compared to the FGFR-L polypeptide amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 (with or without a leader sequence).
- Variants may be naturally occurring (e.g., FGFR-L polypeptide allelic variants, FGFR-L polypeptide orthologs, and FGFR-L polypeptide splice variants) or artificially constructed.
- Such FGFR-L polypeptide variants may be prepared from the corresponding nucleic acid molecules having a DNA sequence that varies accordingly from the DNA sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4.
- the variants have from 1 to 3, or from 1 to 5, or from 1 to 10, or from 1 to 15, or from 1 to 20, or from 1 to 25, or from 1 to 50, or from 1 to 75, or from 1 to 100, or more than 100 amino acid substitutions, insertions, additions and/or deletions, wherein the substitutions may be conservative, or non-conservative, or any combination thereof.
- FGFR-L polypeptide derivatives refers to the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, or FGFR-L polypeptide variants, as defined herein, that have been chemically modified.
- FGFR-L polypeptide derivatives also refers to the polypeptides encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants, as defined herein, that have been chemically modified.
- mature FGFR-L polypeptide refers to an FGFR-L polypeptide lacking a leader sequence.
- a mature FGFR-L polypeptide may also include other modifications such as proteolytic processing of the amino-terminus (with or without a leader sequence) and/or the carboxyl-terminus, cleavage of a smaller polypeptide from a larger precursor, N-linked and/or O-linked glycosylation, and the like.
- An exemplary mature FGFR-L polypeptide is depicted by the amino acid sequence of either SEQ ID NO: 3 or SEQ ID NO: 6.
- FGFR-L fusion polypeptide refers to a fusion of one or more amino acids (such as a heterologous protein or peptide) at the amino- or carboxyl-terminus of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, FGFR-L polypeptide variants, or FGFR-L derivatives, as defined herein.
- FGFR-L fusion polypeptide also refers to a fusion of one or more amino acids at the amino- or carboxyl-terminus of the polypeptide encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants, as defined herein.
- biologically active FGFR-L polypeptides refers to FGFR-L polypeptides having at least one activity characteristic of the polypeptide comprising the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5.
- an FGFR-L polypeptide may be active as an immunogen; that is, the FGFR-L polypeptide contains at least one epitope to which antibodies may be raised.
- isolated polypeptide refers to a polypeptide of the present invention that (1) has been separated from at least about 50 percent of polynucleotides, lipids, carbohydrates, or other materials with which it is naturally found when isolated from the source cell, (2) is not linked (by covalent or noncovalent interaction) to all or a portion of a polypeptide to which the “isolated polypeptide” is linked in nature, (3) is operably linked (by covalent or noncovalent interaction) to a polypeptide with which it is not linked in nature, or (4) does not occur in nature.
- the isolated polypeptide is substantially free from any other contaminating polypeptides or other contaminants that are found in its natural environment that would interfere with its therapeutic, diagnostic, prophylactic or research use.
- identity refers to a relationship between the sequences of two or more polypeptide molecules or two or more nucleic acid molecules, as determined by comparing the sequences.
- identity also means the degree of sequence relatedness between nucleic acid molecules or polypeptides, as the case may be, as determined by the match between strings of two or more nucleotide or two or more amino acid sequences. “Identity” measures the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (i.e., “algorithms”).
- similarity is a related concept, but in contrast to “identity,” “similarity” refers to a measure of relatedness which includes both identical matches and conservative substitution matches. If two polypeptide sequences have, for example, ⁇ fraction (10/20) ⁇ identical amino acids, and the remainder are all non-conservative substitutions, then the percent identity and similarity would both be 50%. If in the same example, there are five more positions where there are conservative substitutions, then the percent identity remains 50%, but the percent similarity would be 75% ( ⁇ fraction (15/20) ⁇ ). Therefore, in cases where there are conservative substitutions, the percent similarity between two polypeptides will be higher than the percent identity between those two polypeptides.
- non-naturally occurring refers to materials which are found in nature and are not manipulated by man.
- non-naturally occurring refers to a material that is not found in nature or that has been structurally modified or synthesized by man.
- FGFR-L polypeptide or FGFR-L nucleic acid molecule used to support an observable level of one or more biological activities of the FGFR-L polypeptides as set forth herein.
- pharmaceutically acceptable carrier or “physiologically acceptable carrier” as used herein refers to one or more formulation materials suitable for accomplishing or enhancing the delivery of the FGFR-L polypeptide, FGFR-L nucleic acid molecule, or FGFR-L selective binding agent as a pharmaceutical composition.
- antigen refers to a molecule or a portion of a molecule capable of being bound by a selective binding agent, such as an antibody, and additionally capable of being used in an animal to produce antibodies capable of binding to an epitope of that antigen.
- a selective binding agent such as an antibody
- An antigen may have one or more epitopes.
- selective binding agent refers to a molecule or molecules having specificity for an FGFR-L polypeptide.
- specific and specificity refer to the ability of the selective binding agents to bind to human FGFR-L polypeptides and not to bind to human non-FGFR-L polypeptides. It will be appreciated, however, that the selective binding agents may also bind orthologs of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, that is, interspecies versions thereof, such as mouse and rat FGFR-L polypeptides.
- transduction is used to refer to the transfer of genes from one bacterium to another, usually by a phage. “Transduction” also refers to the acquisition and transfer of eukaryotic cellular sequences by retroviruses.
- transfection is used to refer to the uptake of foreign or exogenous DNA by a cell, and a cell has been “transfected” when the exogenous DNA has been introduced inside the cell membrane.
- transfection techniques are well known in the art and are disclosed herein. See, e.g., Graham et al., 1973 , Virology 52:456; Sambrook et al., Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Laboratories, 1989); Davis et al., Basic Methods in Molecular Biology (Elsevier, 1986); and Chu et al., 1981 , Gene 13:197.
- Such techniques can be used to introduce one or more exogenous DNA moieties into suitable host cells.
- transformation refers to a change in a cell's genetic characteristics, and a cell has been transformed when it has been modified to contain a new DNA.
- a cell is transformed where it is genetically modified from its native state.
- the transforming DNA may recombine with that of the cell by physically integrating into a chromosome of the cell, may be maintained transiently as an episomal element without being replicated, or may replicate independently as a plasmid.
- a cell is considered to have been stably transformed when the DNA is replicated with the division of the cell.
- nucleic acid molecules include allelic or splice variants of the nucleic acid molecule of either SEQ ID NO: 1 or SEQ ID NO: 4, and include sequences which are complementary to any of the above nucleotide sequences.
- Related nucleic acid molecules also include a nucleotide sequence encoding a polypeptide comprising or consisting essentially of a substitution, modification, addition and/or deletion of one or more amino acid residues compared to the polypeptide in either SEQ ID NO: 2 or SEQ ID NO: 5.
- Such related FGFR-L polypeptides may comprise, for example, an addition and/or a deletion of one or more N-linked or O-linked glycosylation sites or an addition and/or a deletion of one or more cysteine residues.
- nucleic acid molecules also include fragments of FGFR-L nucleic acid molecules which encode a polypeptide of at least about 25 contiguous amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids, or more than about 200 amino acid residues of the FGFR-L polypeptide of either SEQ ID NO: 2 or SEQ ID NO: 5.
- related FGFR-L nucleic acid molecules also include those molecules which comprise nucleotide sequences which hybridize under moderately or highly stringent conditions as defined herein with the fully complementary sequence of the FGFR-L nucleic acid molecule of either SEQ ID NO: 1 or SEQ ID NO: 4, or of a molecule encoding a polypeptide, which polypeptide comprises the amino acid sequence as shown in either SEQ ID NO: 2 or SEQ ID NO: 5, or of a nucleic acid fragment as defined herein, or of a nucleic acid fragment encoding a polypeptide as defined herein.
- Hybridization probes may be prepared using the FGFR-L sequences provided herein to screen cDNA, genomic or synthetic DNA libraries for related sequences. Regions of the DNA and/or amino acid sequence of FGFR-L polypeptide that exhibit significant identity to known sequences are readily determined using sequence alignment algorithms as described herein and those regions may be used to design probes for screening.
- highly stringent conditions refers to those conditions that are designed to permit hybridization of DNA strands whose sequences are highly complementary, and to exclude hybridization of significantly mismatched DNAs. Hybridization stringency is principally determined by temperature, ionic strength, and the concentration of denaturing agents such as formamide. Examples of “highly stringent conditions” for hybridization and washing are 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate at 65-68° C. or 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate, and 50% formamide at 42° C.
- More stringent conditions may also be used—however, the rate of hybridization will be affected.
- Other agents may be included in the hybridization and Washing buffers for the purpose of reducing non-specific and/or background hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinyl-pyrrolidone, 0.1% sodium pyrophosphate, 0.1% sodium dodecylsulfate, NaDodSO 4 , (SDS), ficoll, Denhardt's solution, sonicated salmon sperm DNA (or another non-complementary DNA), and dextran sulfate, although other suitable agents can also be used.
- Factors affecting the stability of DNA duplex include base composition, length, and degree of base pair mismatch. Hybridization conditions can be adjusted by one skilled in the art in order to accommodate these variables and allow DNAs of different sequence relatedness to form hybrids.
- the melting temperature of a perfectly matched DNA duplex can be estimated by the following equation:
- T m ( ° C.) 81.5+16.6( log[Na +])+0.41(% G+C ) ⁇ 600 /N ⁇ 0.72(% formamide)
- N is the length of the duplex formed
- [Na+] is the molar concentration of the sodium ion in the hybridization or washing solution
- % G+C is the percentage of (guanine+cytosine) bases in the hybrid.
- the melting temperature is reduced by approximately 1° C. for each 1% mismatch.
- moderately stringent conditions refers to conditions under which a DNA duplex with a greater degree of base pair mismatching than could occur under “highly stringent conditions” is able to form.
- typical “moderately stringent conditions” are 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate at 50-65° C. or 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate, and 20% formamide at 37-50° C.
- “moderately stringent conditions” of 50° C. in 0.015 M sodium ion will allow about a 21% mismatch.
- Tm 2 ° C . per A ⁇ T base pair+4 ° C . per G ⁇ C base pair
- High stringency washing conditions for oligonucleotides are usually at a temperature of 0-5° C. below the Tm of the oligonucleotide in 6 ⁇ SSC, 0.1% SDS.
- nucleic acid molecules comprise or consist of a nucleotide sequence that is at least about 70 percent identical to the nucleotide sequence as shown in either SEQ ID NO: 1 or SEQ ID NO: 4, or comprise or consist essentially of a nucleotide sequence encoding a polypeptide that is at least about 70 percent identical to the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- the nucleotide sequences are about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the nucleotide sequence as shown in either SEQ ID NO: 1 or SEQ ID NO: 4, or the nucleotide sequences encode a polypeptide that is about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the polypeptide sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- nucleic acid molecules encode polypeptides possessing at least one activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5 will produce a polypeptide having functional and chemical characteristics similar to those of FGFR-L polypeptides.
- substantial modifications in the functional and/or chemical characteristics of FGFR-L polypeptides may be accomplished by selecting substitutions in the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5 that differ significantly in their effect on maintaining (a) the structure of the molecular backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain.
- a “conservative amino acid substitution” may involve a substitution of a native amino acid residue with a normative residue such that there is little or no effect on the polarity or charge of the amino acid residue at that position.
- any native residue in the polypeptide may also be substituted with alanine, as has been previously described for “alanine scanning mutagenesis.”
- amino acid residues that are typically incorporated by chemical peptide synthesis rather than by synthesis in biological systems. These include peptidomimetics, and other reversed or inverted forms of amino acid moieties.
- Naturally occurring residues may be divided into classes based on common side chain properties:
- non-conservative substitutions may involve the exchange of a member of one of these classes for a member from another class.
- Such substituted residues may be introduced into regions of the human FGFR-L polypeptide that are homologous with non-human FGFR-L polypeptides, or into the non-homologous regions of the molecule.
- Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics.
- the hydropathic indices are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine ( ⁇ 0.4); threonine ( ⁇ 0.7); serine ( ⁇ 0.8); tryptophan ( ⁇ 0.9); tyrosine ( ⁇ 1.3); proline ( ⁇ 1.6); histidine ( ⁇ 3.2); glutamate ( ⁇ 3.5); glutamine ( ⁇ 3.5); aspartate ( ⁇ 3.5); asparagine ( ⁇ 3.5); lysine ( ⁇ 3.9); and arginine ( ⁇ 4.5).
- hydrophilicity values have been assigned to these amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ⁇ 1); glutamate (+3.0 ⁇ 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine ( ⁇ 0.4); proline ( ⁇ 0.5 ⁇ 1); alanine ( ⁇ 0.5); histidine ( ⁇ 0.5); cysteine ( ⁇ 1.0); methionine ( ⁇ 1.3); valine ( ⁇ 1.5); leucine ( ⁇ 1.8); isoleucine ( ⁇ 1.8); tyrosine ( ⁇ 2.3); phenylalanine ( ⁇ 2.5); and tryptophan ( ⁇ 3.4).
- Desired amino acid substitutions can be determined by those skilled in the art at the time such substitutions are desired.
- amino acid substitutions can be used to identify important residues of the FGFR-L polypeptide, or to increase or decrease the affinity of the FGFR-L polypeptides described herein.
- Exemplary amino acid substitutions are set forth in Table I.
- a skilled artisan will be able to determine suitable variants of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 using well-known techniques. For identifying suitable areas of the molecule that may be changed without destroying biological activity, one skilled in the art may target areas not believed to be important for activity. For example, when similar polypeptides with similar activities from the same species or from other species are known, one skilled in the art may compare the amino acid sequence of an FGFR-L polypeptide to such similar polypeptides. With such a comparison, one can identify residues and portions of the molecules that are conserved among similar polypeptides.
- One skilled in the art can also analyze the three-dimensional structure and amino acid sequence in relation to that structure in similar polypeptides. In view of such information, one skilled in the art may predict the alignment of amino acid residues of FGFR-L polypeptide with respect to its three dimensional structure. One skilled in the art may choose not to make radical changes to amino acid residues predicted to be on the surface of the protein, since such residues may be involved in important interactions with other molecules. Moreover, one skilled in the art may generate test variants containing a single amino acid substitution at each amino acid residue. The variants could be screened using activity assays known to those with skill in the art. Such variants could be used to gather information about suitable variants.
- One method of predicting secondary structure is based upon homology modeling. For example, two polypeptides or proteins which have a sequence identity of greater than 30%, or similarity greater than 40%, often have similar structural topologies.
- the recent growth of the protein structural database (PDB) has provided enhanced predictability of secondary structure, including the potential number of folds within the structure of a polypeptide or protein. See Holm et al., 1999 , Nucleic Acids Res. 27:244-47. It has been suggested that there are a limited number of folds in a given polypeptide or protein and that once a critical number of structures have been resolved, structural prediction will become dramatically more accurate (Brenner et al., 1997 , Curr. Opin. Struct. Biol. 7:369-76).
- Additional methods of predicting secondary structure include “threading” (Jones, 1997 , Curr. Opin. Struct. Biol. 7:377-87; Sippl et al., 1996 , Structure 4:15-19), “profile analysis” (Bowie et al., 1991 , Science, 253:164-70; Gribskov et al., 1990 , Methods Enzymol. 183:146-59; Gribskov et al., 1987 , Proc. Nat. Acad. Sci. U.S.A. 84:4355-58), and “evolutionary linkage” (See Holm et aL, supra, and Brenner et al., supra).
- FGFR-L polypeptide variants include glycosylation variants wherein the number and/or type of glycosylation sites have been altered compared to the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- FGFR-L polypeptide variants comprise a greater or a lesser number of N-linked glycosylation sites than the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- An N-linked glycosylation site is characterized by the sequence: Asn-X-Ser or Asn-X-Thr, wherein the amino acid residue designated as X may be any amino acid residue except proline.
- substitution of amino acid residues to create this sequence provides a potential new site for the addition of an N-linked carbohydrate chain.
- substitutions that eliminate this sequence will remove an existing N-linked carbohydrate chain.
- rearrangement of N-linked carbohydrate chains wherein one or more N-linked glycosylation sites (typically those that are naturally occurring) are eliminated and one or more new N-linked sites are created.
- Additional preferred FGFR-L variants include cysteine variants, wherein one or more cysteine residues are deleted or substituted with another amino acid (e.g., serine) as compared to the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- Cysteine variants are useful when FGFR-L polypeptides must be refolded into a biologically active conformation such as after the isolation of insoluble inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native protein, and typically have an even number to minimize interactions resulting from unpaired cysteines.
- nucleic acid molecules comprise or consist of a nucleotide sequence encoding a polypeptide as set forth in either Seq Id No: 2 or SEQ ID NO: 5 with at least one amino acid insertion and wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, or a
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Veterinary Medicine (AREA)
- Medicinal Chemistry (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Engineering & Computer Science (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Diabetes (AREA)
- Physical Education & Sports Medicine (AREA)
- Rheumatology (AREA)
- Hematology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Endocrinology (AREA)
- Orthopedic Medicine & Surgery (AREA)
- Urology & Nephrology (AREA)
- Obesity (AREA)
- Biochemistry (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Pathology (AREA)
- Molecular Biology (AREA)
- Epidemiology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Cell Biology (AREA)
- Heart & Thoracic Surgery (AREA)
- Emergency Medicine (AREA)
- Child & Adolescent Psychology (AREA)
- Cardiology (AREA)
Abstract
The present invention provides Fibroblast Growth Factor Receptor-Like (FGFR-L) polypeptides and nucleic acid molecules encoding the same. The invention also provides selective binding agents, vectors, host cells, and methods for producing FGFR-L polypeptides. The invention further provides pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, and/or prevention of diseases, disorders, and conditions associated with FGFR-L polypeptides.
Description
- This application is a continuation of U.S. Provisional Patent Application No. 60/191,379, filed on Mar. 22, 2000, the disclosure of which is explicitly incorporated by reference herein.
- The present invention relates to Fibroblast Growth Factor Receptor-Like (FGFR-L) polypeptides and nucleic acid molecules encoding the same. The invention also relates to selective binding agents, vectors, host cells, and methods for producing FGFR-L polypeptides. The invention further relates to pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, and/or prevention of diseases, disorders, and conditions associated with FGFR-L polypeptides.
- Technical advances in the identification, cloning, expression, and manipulation of nucleic acid molecules and the deciphering of the human genome have greatly accelerated the discovery of novel therapeutics. Rapid nucleic acid sequencing. techniques can now generate sequence information at unprecedented rates and, coupled with computational analyses, allow the assembly of overlapping sequences into partial and entire genomes and the identification of polypeptide-encoding regions. A comparison of a predicted amino acid sequence against a database compilation of known amino acid sequences allows one to determine the extent of homology to previously identified sequences and/or structural landmarks. The cloning and expression of a polypeptide-encoding region of a nucleic acid molecule provides a polypeptide product for structural and functional analyses. The manipulation of nucleic acid molecules and encoded polypeptides may confer advantageous properties on a product for use as a therapeutic.
- In spite of the significant technical advances in genome research over the past decade, the potential for the development of novel therapeutics based on the human genome is still largely unrealized. Many genes encoding potentially beneficial polypeptide therapeutics or those encoding polypeptides, which may act as “targets” for therapeutic molecules, have still not been identified. Accordingly, it is an object of the invention to identify novel polypeptides, and nucleic acid molecules encoding the same, which have diagnostic or therapeutic benefit.
- The present invention relates to novel FGFR-L nucleic acid molecules and encoded polypeptides.
- The invention provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- (a) the nucleotide sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4;
- (b) the nucleotide sequence of the DNA insert in ATCC Deposit No. ______;
- (c) a nucleotide sequence encoding the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (d) a nucleotide sequence which hybridizes under moderately or highly stringent conditions to the complement of any of (a)-(c); and
- (e) a nucleotide sequence complementary to any of (a)-(c).
- The invention also provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- (a) a nucleotide sequence encoding a polypeptide which is at least about 70 percent identical to the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (b) a nucleotide sequence encoding an allelic variant or splice variant of the nucleotide sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4, the nucleotide sequence of the DNA insert in ATCC Deposit No. ______, or (a);
- (c) a region of the nucleotide sequence of either SEQ ID NO: 1 or SEQ ID NO: 4, the DNA insert in ATCC Deposit No. ______, (a), or (b) encoding a polypeptide fragment of at least about 25 amino acid residues, wherein the polypeptide fragment has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, or is antigenic;
- (d) a region of the nucleotide sequence of either SEQ ID NO: 1 or SEQ ID NO: 4, the DNA insert in ATCC Deposit No. ______, or any of (a)-(c) comprising a fragment of at least about 16 nucleotides;
- (e) a nucleotide sequence which hybridizes under moderately or highly stringent conditions to the complement of any of (a)-(d); and
- (f) a nucleotide sequence complementary to any of (a)-(d).
- The invention further provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
- (a) a nucleotide sequence encoding a polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one conservative amino acid substitution, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (b) a nucleotide sequence encoding a polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one amino acid insertion, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (c) a nucleotide sequence encoding a polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one amino acid deletion, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (d) a nucleotide sequence encoding a polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 which has a C- and/or N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (e) a nucleotide sequence encoding a polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one modification selected from the group consisting of amino acid substitutions, amino acid insertions, amino acid deletions, C-terminal truncation, and N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (f) a nucleotide sequence of any of (a)-(e) comprising a fragment of at least about 16 nucleotides;
- (g) a nucleotide sequence which hybridizes under moderately or highly stringent conditions to the complement of any of (a)-(f); and
- (h) a nucleotide sequence complementary to any of (a)-(e).
- The present invention provides for an isolated polypeptide comprising an amino acid sequence selected from the group consisting of:
- (a) the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5; and
- (b) the amino acid sequence encoded by the DNA insert in ATCC Deposit No. ______.
- The invention also provides for an isolated polypeptide comprising the amino acid sequence selected from the group consisting of
- (a) the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO: 6, optionally further comprising an amino-terminal methionine;
- (b) an amino acid sequence for an ortholog of either SEQ ID NO: 2 or SEQ ID NO: 5;
- (c) an amino acid sequence which is at least about 70 percent identical to the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (d) a fragment of the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 comprising at least about 25 amino acid residues, wherein the fragment has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, or is antigenic; and
- (e) an amino acid sequence for an allelic variant or splice variant of the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, the amino acid sequence encoded by the DNA insert in ATCC Deposit No. ______, or any of (a)-(c).
- The invention further provides for an isolated polypeptide comprising the amino acid sequence selected from the group consisting of:
- (a) the amino acid sequence as se t forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one conservative amino acid substitution, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (1) the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one amino acid insertion, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (c) the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one amino acid deletion, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5;
- (d) the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 which has a C- and/or N-terminal truncation, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5; and
- (e) the amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 with at least one modification selected from the group consisting of amino acid substitutions, amino acid insertions, amino acid deletions, C-terminal truncation, and N-terminal truncation, wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- Also provided are fusion polypeptides comprising FGFR-L amino acid sequences.
- The present invention also provides for an expression vector comprising the isolated nucleic acid molecules as set forth herein, recombinant host cells comprising the recombinant nucleic acid molecules as set forth herein, and a method of producing an FGFR-L polypeptide comprising culturing the host cells and optionally isolating the polypeptide so produced.
- A transgenic non-human animal comprising a nucleic acid molecule encoding an FGFR-L polypeptide is also encompassed by the invention. The FGFR-L nucleic acid molecules are introduced into the animal in a manner that allows expression and increased levels of an FGFR-L polypeptide, which may include increased circulating levels. Alternatively, the FGFR-L nucleic acid molecules are introduced into the animal in a manner that prevents expression of endogenous FGFR-L polypeptide (i.e., generates a transgenic animal possessing an FGFR-L polypeptide gene knockout). The transgenic non-human animal is preferably a mammal, and more preferably a rodent, such as a rat or a mouse.
- Also provided are derivatives of the FGPR-L polypeptides of the present invention.
- Additionally provided are selective binding agents such as antibodies and peptides capable of specifically binding the FGFR-L polypeptides of the invention. Such antibodies and peptides may be agonistic or antagonistic.
- Pharmaceutical compositions comprising the nucleotides, polypeptides, or selective binding agents of the invention and one or more pharmaceutically acceptable formulation agents are also encompassed by the invention. The pharmaceutical compositions are used to provide therapeutically effective amounts of the nucleotides or polypeptides of the present invention. The invention is also directed to methods of using the polypeptides, nucleic acid molecules, and selective binding agents.
- The FGFR-L polypeptides and nucleic acid molecules of the present invention may be used to treat, prevent, ameliorate, and/or detect diseases and disorders, including those recited herein.
- The present invention also provides a method of assaying test molecules to identify a test molecule that binds to an FGFR-L polypeptide. The method comprises contacting an FGFR-L polypeptide with a test molecule to determine the extent of binding of the test molecule to the polypeptide. The method further comprises determining whether such test molecules are agonists or antagonists of an FGFR-L polypeptide. The present invention further provides a method of testing the impact of molecules on the expression of FGFR-L polypeptide or on the activity of FGFR-L polypeptide.
- Methods of regulating expression and modulating (i.e., increasing or decreasing) levels of an FGFR-L polypeptide are also encompassed by the invention. One method comprises administering to an animal a nucleic acid molecule encoding an FGFR-L polypeptide. In another method, a nucleic acid molecule comprising elements that regulate or modulate the expression of an FGFR-L polypeptide may be administered. Examples of these methods include gene therapy, cell therapy, and anti-sense therapy as further described herein.
- The FGFR-L polypeptide can be used for identifying ligands thereof. Various forms of “expression cloning” have been used for cloning ligands for receptors (e.g., Davis et al., 1996, Cell, 87:1161-69). These and other FGFR-L polypeptide ligand cloning experiments are described in greater detail herein. Isolation of an FGFR-L polypeptide ligand allows for the identification or development of novel agonists or antagonists of the FGFR-L polypeptide signaling pathway. Such agonists and antagonists include FGFR-L polypeptide ligands, anti-FGFR-L polypeptide ligand is antibodies and derivatives thereof, small molecules, or antisense oligonucleotides, any of which can be used for potentially treating one or more diseases or disorders, including those recited herein.
- FIGS.1A-1C illustrate the nucleotide sequence of the murine FGFR-L gene (SEQ ID NO: 1) and the deduced amino acid sequence of murine FGFR-L polypeptide (SEQ ID NO: 2). The predicted signal peptide (underline) and transmembrane domain (double-underline) are indicated;
- FIGS.2A-2B illustrate the amino acid sequence alignment of murine FGFR-L polypeptide (Smaf2-00017-f4; SEQ ID NO: 2) and Iberian ribbed newt (Pleurodeles waltlii) Fibroblast Growth Factor Receptor-4 (PIR:B49151; SEQ ID NO: 7);
- FIGS.3A-3B illustrate the nucleotide sequence of a cDNA clone encoding the N-terminal portion of the human FGFR-L gene (SEQ ID NO: 4) and the deduced amino acid sequence of the N-terminal portion of the human FGFR-L polypeptide (SEQ ID NO: 5). The predicted signal peptide (underline) and transmembrane domain (double-underline) are indicated;
- FIG. 4 illustrates the amino acid sequence alignment of murine FGFR-L polypeptide (SEQ ID NO: 2) and a virtual human FGFR-L polypeptide sequence (SEQ ID NO: 8) constructed from residues 1-472 of SEQ ID NO: 5 and residues 473-504 of GenBank Accession No. AJ277437. The predicted signal peptide (underline), transmembrane domain (double-underline), and N-linked glycosylation sites (bold) are indicated;
- FIG. 5 illustrates the expression of FGFR-L MRNA as detected by Northern blot analysis in
day - FIG. 6 illustrates the expression of FGFR-L mrRNA as detected by Northern blot analysis in murine heart, brain, spleen, lung, liver, skeletal muscle, kidney, and testis;
- FIG. 7 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in NIH 3T3 cells and F10, F4, and D3 mouse bone marrow-derived stromal cell lines;
- FIG. 8 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in human brain, heart, skeletal muscle, colon, thymus, spleen, kidney, liver, small intestine, placenta, lung, and peripheral blood leukocytes;
- FIG. 9 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in promyelocytic leukemia HL-60 cells, HeLa S3 cells, chronic myelogenous leukemia L-562 cells, lymphoblastic leukemia MOLT-4 cells, Burkitt's lymphoma Raji cells, colorectal adenocarcinoma SW480 cells, lung carcinoma A549 cells, and melanoma G361 cells;
- FIG. 10 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in human heart, brain, placenta, lung, liver, skeletal muscle, kidney, and pancreas;
- FIG. 11 illustrates the expression of FGFR-L mRNA as detected by Northern blot analysis in 266-6 cells, AR42J cells, CaPan I cells, HIG-82 cells, OHS4 cells,
SW 1353 cells,SW 872 cells, K562 (old, i.e., later passage) cells, K562 (new, i.e., earlier passage) cells, Jurkat cells, and F4 cells; - FIGS.12A-12B illustrate the expression of FGFR-L mRNA as detected by Northern blot analysis in human adipose tissue (using a human FGFR-L-derived probe) and murine adipose tissue (using a murine FGFR-L-derived probe);
- FIG. 13 illustrates the expression of FGFR-L mRNA in a number of murine tissues as detected in an RNAse protection assay. The absence of the cyclophilin band in the pancreas RNA sample suggests that thi sample was degraded;
- FIG. 14 illustrates the expression of FGFR-L mRNA as detected by in situ hybridization in the peri-renal, white, and brown adipose tissue of a normal adult mouse (H&E=hematoxylin and eosin counterstaining; ISH=in situ hybridization);
- FIG. 15 illustrates the. expression of FGFR-L mRNA as detected by in situ hybridization in the duodenum, ileum, colon, and pancreas of a normal adult mouse (H&E=hematoxylin and eosin counterstaining; ISH=in situ hybridization);
- FIG. 16 illustrates the expression of FGFR-L mRNA as detected by in situ hybridization in the trachea, articular cartilage of the knee joint, spleen, and uterus of a normal adult mouse (H&E=hematoxylin and eosin counterstaining; ISH=in situ hybridization);
- FIG. 17 illustrates the induction of FGFR-L MRNA in osteoblastic ST2 cells under conditions of osteoclastogenesis (i.e., 5-day exposure to vitamin D3 and dexamethasone);
- FIG. 18 illustrates the results of Western blot analysis ofE. coli-derived Des7-FGFR-L/ECD and CHO-derived FGFR-L/ECD-Fc proteins using FGFR-L polypeptide antiserum;
- FIG. 19 illustrates the results of Western blot analysis of murine eye (lane 1) and adipose tissue (lane 2) using FGFR-L polypeptide antiserum;
- FIGS.20A-20B illustrate the results of FACS analysis on F4 and D3 bone marrow stromal cells using FGFR-L polypeptide antiserum;
- FIGS.21A-21D illustrate the results of proliferation assays using D3 bone marrow stromal cells (either untransduced or transduced with a construct encoding FGFR-L polypeptide) following 72 hour exposure to rhuPDGF (panel A), rhuFGF-2 (panel B), rhuFGF-4 (panel C), or rhuFGF-6 (panel D);
- FIG. 22 illustrates the results of proliferation assays using A5-F bone marrow stromal cells following exposure toE. coli-derived Des7-FGFR-L/ECD protein and serum, PDGF, FGF-2, FGF-4, or FGF-6;
- FIG. 23 illustrates the results of proliferation assays using A5-F bone marrow stromal cells following exposure to CHO-derived FGFR-L/ECD-Fc protein and serum, PDGF, FGF-4, or FGF-6;
- FIG. 24 illustrates the expression of the neomycin resistance gene as detected by Northern blot analysis of peripheral blood mononuclear cell (PBMN) RNA from two FGFR-L/neo-transduced mice (
lanes 1 and 2) and two neo-transduced control mice (lanes 3 and 4). - The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described. All references cited in this application are expressly incorporated by reference herein.
- Definitions
- The terms “FGFR-L gene” or “FGFR-L nucleic acid molecule” or “FGFR-L polynucleotide” refer to a nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4, a nucleotide sequence encoding the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, a nucleotide sequence of the DNA insert in ATCC Deposit No. ______, and nucleic acid molecules as defined herein.
- The term “FGFR-L polypeptide allelic variant” refers to one of several possible naturally occurring alternate forms of a gene occupying a given locus on a chromosome of an organism or a population of organisms.
- The term “FGFR-L polypeptide splice variant” refers to a nucleic acid molecule, usually RNA, which is generated by alternative processing of intron sequences in an RNA transcript of FGFR-L polypeptide amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- The term “isolated nucleic acid molecule” refers to a nucleic acid molecule of the invention that (1) has been separated from at least about 50 percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found when total nucleic acid is isolated from the source cells, (2) is not linked to all or a portion of a polynucleotide to which the “isolated nucleic acid molecule” is linked in nature, (3) is operably linked to a polynucleotide which it is not linked to in nature, or (4) does not occur in nature as part of a larger polynucleotide sequence. Preferably, the isolated nucleic acid molecule of the present invention is substantially free from any other contaminating nucleic acid molecule(s) or other contaminants that are found in its natural environment that would interfere with its use in polypeptide production or its therapeutic, diagnostic, prophylactic or research use.
- The term “nucleic acid sequence” or “nucleic acid molecule” refers to a DNA or RNA sequence. The term encompasses molecules formed from any of the known base analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinyl-cytosine, pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxy-methylaminomethyluracil, dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethyl-guanine, 2-methyladenine, -2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyamino-methyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarbonyl-methyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 2,6-diaminopurine.
- The term “vector” is used to refer to any molecule (e.g., nucleic acid, plasmid, or virus) used to transfer coding information to a host cell.
- The term “expression vector” refers to a vector that is suitable for transformation of a host cell and contains nucleic acid sequences that direct and/or control the expression of inserted heterologous nucleic acid sequences. Expression includes, but is not limited to, processes such as transcription, translation, and RNA splicing, if introns are present.
- The term “operably linked” is used herein to refer to an arrangement of flanking sequences wherein the flanking sequences so described are configured or assembled so as to perform their usual function. Thus, a flanking sequence operably linked to a coding sequence may be capable of effecting the replication, transcription and/or translation of the coding sequence. For example, a coding sequence is operably linked to a promoter when the promoter is capable of directing transcription of that coding sequence. A flanking sequence need not be contiguous with the coding sequence, so long as it functions correctly. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the coding sequence and the promoter sequence can still be considered “operably linked” to the coding sequence.
- The term “host cell” is used to refer to a cell which has been transformed, or is capable of being transformed with a nucleic acid sequence and then of expressing a selected gene of interest. The term includes the progeny of the parent cell, whether or not the progeny is identical in morphology or in genetic make-up to the original parent, so long as the selected gene is present.
- The term “FGFR-L polypeptide” refers to a polypeptide comprising the amino acid sequence of either SEQ ID NO.: 2 or SEQ ID NO: 5 and related polypeptides. Related polypeptides include FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, FGFR-L polypeptide variants, and FGFR-L polypeptide derivatives, which possess at least one activity of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. FGFR-L polypeptides may be mature polypeptides, as defined herein, and may or may not have an amino-terminal methionine residue, depending on the method by which they are prepared.
- The term “FGFR-L polypeptide fragment” refers to a polypeptide that comprises a truncation at the amino-terminus (with or without a leader sequence) and/or a truncation at the carboxyl-terminus of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. The term “FGFR-L polypeptide fragment” also refers to amino-terminal and/or carboxyl-terminal truncations of FGFR-L polypeptide orthologs, FGFR-L polypeptide derivatives, or FGFR-L polypeptide variants, or to amino-terminal and/or carboxyl-terminal truncations of the polypeptides encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants. FGFR-L polypeptide fragments may result from alternative RNA splicing or from in vivo protease activity. Membrane-bound forms of an FGFR-L polypeptide are also contemplated by the present invention. In preferred embodiments, truncations and/or deletions comprise about 10 amino acids, or about 20 amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or more than about 100 amino acids. The polypeptide fragments so produced will comprise about 25 contiguous amino acids, or about 50 amino acids, or about 75. amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids, or more than about 200 amino acids. Such FGFR-L polypeptide fragments may optionally comprise an amino-terminal methionine residue. It will be appreciated that such fragments can be used, for example, to generate antibodies to FGFR-L polypeptides.
- The term “FGFR-L polypeptide ortholog” refers to a polypeptide from another species that corresponds to FGFR-L polypeptide amino acid sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. For example, mouse and human FGFR-L polypeptides are considered orthologs of each other.
- The term “FGFR-L polypeptide variants” refers to FGFR-L polypeptides comprising amino acid sequences having one or more amino acid sequence substitutions, deletions (such as internal deletions and/or FGFR-L polypeptide fragments), and/or additions (such as internal additions and/or FGFR-L fusion polypeptides) as compared to the FGFR-L polypeptide amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 (with or without a leader sequence). Variants may be naturally occurring (e.g., FGFR-L polypeptide allelic variants, FGFR-L polypeptide orthologs, and FGFR-L polypeptide splice variants) or artificially constructed. Such FGFR-L polypeptide variants may be prepared from the corresponding nucleic acid molecules having a DNA sequence that varies accordingly from the DNA sequence as set forth in either SEQ ID NO: 1 or SEQ ID NO: 4. In preferred embodiments, the variants have from 1 to 3, or from 1 to 5, or from 1 to 10, or from 1 to 15, or from 1 to 20, or from 1 to 25, or from 1 to 50, or from 1 to 75, or from 1 to 100, or more than 100 amino acid substitutions, insertions, additions and/or deletions, wherein the substitutions may be conservative, or non-conservative, or any combination thereof.
- The term “FGFR-L polypeptide derivatives” refers to the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, or FGFR-L polypeptide variants, as defined herein, that have been chemically modified. The term “FGFR-L polypeptide derivatives” also refers to the polypeptides encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants, as defined herein, that have been chemically modified.
- The term “mature FGFR-L polypeptide” refers to an FGFR-L polypeptide lacking a leader sequence. A mature FGFR-L polypeptide may also include other modifications such as proteolytic processing of the amino-terminus (with or without a leader sequence) and/or the carboxyl-terminus, cleavage of a smaller polypeptide from a larger precursor, N-linked and/or O-linked glycosylation, and the like. An exemplary mature FGFR-L polypeptide is depicted by the amino acid sequence of either SEQ ID NO: 3 or SEQ ID NO: 6.
- The term “FGFR-L fusion polypeptide” refers to a fusion of one or more amino acids (such as a heterologous protein or peptide) at the amino- or carboxyl-terminus of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, FGFR-L polypeptide fragments, FGFR-L polypeptide orthologs, FGFR-L polypeptide variants, or FGFR-L derivatives, as defined herein. The term “FGFR-L fusion polypeptide” also refers to a fusion of one or more amino acids at the amino- or carboxyl-terminus of the polypeptide encoded by FGFR-L polypeptide allelic variants or FGFR-L polypeptide splice variants, as defined herein.
- The term “biologically active FGFR-L polypeptides” refers to FGFR-L polypeptides having at least one activity characteristic of the polypeptide comprising the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5. In addition, an FGFR-L polypeptide may be active as an immunogen; that is, the FGFR-L polypeptide contains at least one epitope to which antibodies may be raised.
- The term “isolated polypeptide” refers to a polypeptide of the present invention that (1) has been separated from at least about 50 percent of polynucleotides, lipids, carbohydrates, or other materials with which it is naturally found when isolated from the source cell, (2) is not linked (by covalent or noncovalent interaction) to all or a portion of a polypeptide to which the “isolated polypeptide” is linked in nature, (3) is operably linked (by covalent or noncovalent interaction) to a polypeptide with which it is not linked in nature, or (4) does not occur in nature. Preferably, the isolated polypeptide is substantially free from any other contaminating polypeptides or other contaminants that are found in its natural environment that would interfere with its therapeutic, diagnostic, prophylactic or research use.
- The term “identity,” as known in the art, refers to a relationship between the sequences of two or more polypeptide molecules or two or more nucleic acid molecules, as determined by comparing the sequences. In the art, “identity” also means the degree of sequence relatedness between nucleic acid molecules or polypeptides, as the case may be, as determined by the match between strings of two or more nucleotide or two or more amino acid sequences. “Identity” measures the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (i.e., “algorithms”).
- The term “similarity” is a related concept, but in contrast to “identity,” “similarity” refers to a measure of relatedness which includes both identical matches and conservative substitution matches. If two polypeptide sequences have, for example, {fraction (10/20)} identical amino acids, and the remainder are all non-conservative substitutions, then the percent identity and similarity would both be 50%. If in the same example, there are five more positions where there are conservative substitutions, then the percent identity remains 50%, but the percent similarity would be 75% ({fraction (15/20)}). Therefore, in cases where there are conservative substitutions, the percent similarity between two polypeptides will be higher than the percent identity between those two polypeptides.
- The term “naturally occurring” or “native” when used in connection with biological materials such as nucleic acid molecules, polypeptides, host cells, and the like, refers to materials which are found in nature and are not manipulated by man. Similarly, “non-naturally occurring” or “non-native” as used herein refers to a material that is not found in nature or that has been structurally modified or synthesized by man.
- The terms “effective amount” and “therapeutically effective amount” each refer to the amount of an FGFR-L polypeptide or FGFR-L nucleic acid molecule used to support an observable level of one or more biological activities of the FGFR-L polypeptides as set forth herein.
- The term “pharmaceutically acceptable carrier” or “physiologically acceptable carrier” as used herein refers to one or more formulation materials suitable for accomplishing or enhancing the delivery of the FGFR-L polypeptide, FGFR-L nucleic acid molecule, or FGFR-L selective binding agent as a pharmaceutical composition.
- The term “antigen” refers to a molecule or a portion of a molecule capable of being bound by a selective binding agent, such as an antibody, and additionally capable of being used in an animal to produce antibodies capable of binding to an epitope of that antigen. An antigen may have one or more epitopes.
- The term “selective binding agent” refers to a molecule or molecules having specificity for an FGFR-L polypeptide. As used herein, the terms, “specific” and “specificity” refer to the ability of the selective binding agents to bind to human FGFR-L polypeptides and not to bind to human non-FGFR-L polypeptides. It will be appreciated, however, that the selective binding agents may also bind orthologs of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, that is, interspecies versions thereof, such as mouse and rat FGFR-L polypeptides.
- The term “transduction” is used to refer to the transfer of genes from one bacterium to another, usually by a phage. “Transduction” also refers to the acquisition and transfer of eukaryotic cellular sequences by retroviruses.
- The term “transfection” is used to refer to the uptake of foreign or exogenous DNA by a cell, and a cell has been “transfected” when the exogenous DNA has been introduced inside the cell membrane. A number of transfection techniques are well known in the art and are disclosed herein. See, e.g., Graham et al., 1973, Virology 52:456; Sambrook et al., Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Laboratories, 1989); Davis et al., Basic Methods in Molecular Biology (Elsevier, 1986); and Chu et al., 1981, Gene 13:197. Such techniques can be used to introduce one or more exogenous DNA moieties into suitable host cells.
- The term “transformation” as used herein refers to a change in a cell's genetic characteristics, and a cell has been transformed when it has been modified to contain a new DNA. For example, a cell is transformed where it is genetically modified from its native state. Following transfection or transduction, the transforming DNA may recombine with that of the cell by physically integrating into a chromosome of the cell, may be maintained transiently as an episomal element without being replicated, or may replicate independently as a plasmid. A cell is considered to have been stably transformed when the DNA is replicated with the division of the cell.
- Relatedness of Nucleic Acid Molecules and/or Polypeptides
- It is understood that related nucleic acid molecules include allelic or splice variants of the nucleic acid molecule of either SEQ ID NO: 1 or SEQ ID NO: 4, and include sequences which are complementary to any of the above nucleotide sequences. Related nucleic acid molecules also include a nucleotide sequence encoding a polypeptide comprising or consisting essentially of a substitution, modification, addition and/or deletion of one or more amino acid residues compared to the polypeptide in either SEQ ID NO: 2 or SEQ ID NO: 5. Such related FGFR-L polypeptides may comprise, for example, an addition and/or a deletion of one or more N-linked or O-linked glycosylation sites or an addition and/or a deletion of one or more cysteine residues.
- Related nucleic acid molecules also include fragments of FGFR-L nucleic acid molecules which encode a polypeptide of at least about 25 contiguous amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids, or more than about 200 amino acid residues of the FGFR-L polypeptide of either SEQ ID NO: 2 or SEQ ID NO: 5.
- In addition, related FGFR-L nucleic acid molecules also include those molecules which comprise nucleotide sequences which hybridize under moderately or highly stringent conditions as defined herein with the fully complementary sequence of the FGFR-L nucleic acid molecule of either SEQ ID NO: 1 or SEQ ID NO: 4, or of a molecule encoding a polypeptide, which polypeptide comprises the amino acid sequence as shown in either SEQ ID NO: 2 or SEQ ID NO: 5, or of a nucleic acid fragment as defined herein, or of a nucleic acid fragment encoding a polypeptide as defined herein. Hybridization probes may be prepared using the FGFR-L sequences provided herein to screen cDNA, genomic or synthetic DNA libraries for related sequences. Regions of the DNA and/or amino acid sequence of FGFR-L polypeptide that exhibit significant identity to known sequences are readily determined using sequence alignment algorithms as described herein and those regions may be used to design probes for screening.
- The term “highly stringent conditions” refers to those conditions that are designed to permit hybridization of DNA strands whose sequences are highly complementary, and to exclude hybridization of significantly mismatched DNAs. Hybridization stringency is principally determined by temperature, ionic strength, and the concentration of denaturing agents such as formamide. Examples of “highly stringent conditions” for hybridization and washing are 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate at 65-68° C. or 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate, and 50% formamide at 42° C. See Sambrook, Fritsch & Maniatis,Molecular Cloning: A Laboratory Manual (2nd ed., Cold Spring Harbor Laboratory, 1989); Anderson et al., Nucleic Acid Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited).
- More stringent conditions (such as higher temperature, lower ionic strength, higher formamide, or other denaturing agent) may also be used—however, the rate of hybridization will be affected. Other agents may be included in the hybridization and Washing buffers for the purpose of reducing non-specific and/or background hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinyl-pyrrolidone, 0.1% sodium pyrophosphate, 0.1% sodium dodecylsulfate, NaDodSO4, (SDS), ficoll, Denhardt's solution, sonicated salmon sperm DNA (or another non-complementary DNA), and dextran sulfate, although other suitable agents can also be used. The concentration and types of these additives can be changed without substantially affecting the stringency of the hybridization conditions. Hybridization experiments are usually carried out at pH 6.8-7.4; however, at typical ionic strength conditions, the rate of hybridization is nearly independent of pH. See Anderson et al., Nucleic Acid Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited).
- Factors affecting the stability of DNA duplex include base composition, length, and degree of base pair mismatch. Hybridization conditions can be adjusted by one skilled in the art in order to accommodate these variables and allow DNAs of different sequence relatedness to form hybrids. The melting temperature of a perfectly matched DNA duplex can be estimated by the following equation:
- T m(° C.)=81.5+16.6(log[Na+])+0.41(% G+C)−600/N−0.72(% formamide)
- where N is the length of the duplex formed, [Na+] is the molar concentration of the sodium ion in the hybridization or washing solution, % G+C is the percentage of (guanine+cytosine) bases in the hybrid. For imperfectly matched hybrids, the melting temperature is reduced by approximately 1° C. for each 1% mismatch.
- The term “moderately stringent conditions” refers to conditions under which a DNA duplex with a greater degree of base pair mismatching than could occur under “highly stringent conditions” is able to form. Examples of typical “moderately stringent conditions” are 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate at 50-65° C. or 0.015 M sodium FGFR-Loride, 0.0015 M sodium citrate, and 20% formamide at 37-50° C. By way of example, “moderately stringent conditions” of 50° C. in 0.015 M sodium ion will allow about a 21% mismatch.
- It will be appreciated by those skilled in the art that there is no absolute distinction between “highly stringent conditions” and “moderately stringent conditions.” For example, at 0.015 M sodium ion (no formamide), the melting temperature of perfectly matched long DNA is about 71° C. With a wash at 65° C. (at the same ionic strength), this would allow for approximately a 6% mismatch. To capture more distantly related sequences, one skilled in the art can simply lower the temperature or raise the ionic strength.
- A good estimate of the melting temperature in 1M NaCl* for oligonucleotide probes up to about 20nt is given by:
- Tm=2° C. per A−T base pair+4° C. per G−C base pair
- *The sodium ion concentration in 6× salt sodium citrate (SSC) is 1M. See Suggs et al.,Developmental Biology Using Purified Genes 683 (Brown and Fox, eds., 1981).
- High stringency washing conditions for oligonucleotides are usually at a temperature of 0-5° C. below the Tm of the oligonucleotide in 6× SSC, 0.1% SDS.
- In another embodiment, related nucleic acid molecules comprise or consist of a nucleotide sequence that is at least about 70 percent identical to the nucleotide sequence as shown in either SEQ ID NO: 1 or SEQ ID NO: 4, or comprise or consist essentially of a nucleotide sequence encoding a polypeptide that is at least about 70 percent identical to the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. In preferred embodiments, the nucleotide sequences are about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the nucleotide sequence as shown in either SEQ ID NO: 1 or SEQ ID NO: 4, or the nucleotide sequences encode a polypeptide that is about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the polypeptide sequence as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- Related nucleic acid molecules encode polypeptides possessing at least one activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5.
- Differences in the nucleic acid sequence may result in conservative and/or non-conservative modifications of the amino acid sequence relative to the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5.
- Conservative modifications to the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5 (and the corresponding modifications to the encoding nucleotides) will produce a polypeptide having functional and chemical characteristics similar to those of FGFR-L polypeptides. In contrast, substantial modifications in the functional and/or chemical characteristics of FGFR-L polypeptides may be accomplished by selecting substitutions in the amino acid sequence of either SEQ ID NO: 2 or SEQ ID NO: 5 that differ significantly in their effect on maintaining (a) the structure of the molecular backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain.
- For example, a “conservative amino acid substitution” may involve a substitution of a native amino acid residue with a normative residue such that there is little or no effect on the polarity or charge of the amino acid residue at that position. Furthermore, any native residue in the polypeptide may also be substituted with alanine, as has been previously described for “alanine scanning mutagenesis.”
- Conservative amino acid substitutions also encompass non-naturally occurring amino acid residues that are typically incorporated by chemical peptide synthesis rather than by synthesis in biological systems. These include peptidomimetics, and other reversed or inverted forms of amino acid moieties.
- Naturally occurring residues may be divided into classes based on common side chain properties:
- 1) hydrophobic: norleucine, Met, Ala, Val, Leu, Ile;
- 2) neutral hydrophilic: Cys, Ser, Thr;
- 3) acidic: Asp, Glu;
- 4) basic: Asn, Gln, His, Lys, Arg;
- 5) residues that influence chain orientation: Gly, Pro; and
- 6) aromatic: Trp, Tyr, Phe.
- For example, non-conservative substitutions may involve the exchange of a member of one of these classes for a member from another class. Such substituted residues may be introduced into regions of the human FGFR-L polypeptide that are homologous with non-human FGFR-L polypeptides, or into the non-homologous regions of the molecule.
- In making such changes, the hydropathic index of amino acids may be considered.
- Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics. The hydropathic indices are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5); lysine (−3.9); and arginine (−4.5).
- The importance of the hydropathic amino acid index in conferring interactive biological function on a protein is generally understood in the art (Kyte et al., 1982, J. Mol. Biol. 157:105-31). It is known that certain amino acids maybe substituted for other amino acids having a similar hydropathic index or score and still retain a similar biological activity. In making changes based upon the hydropathic index, the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those which are within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred.
- It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity, particularly where the biologically functionally equivalent protein or peptide thereby created is intended for use in immunological embodiments, as in the present case. The greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e., with a biological property of the protein.
- The following hydrophilicity values have been assigned to these amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0±1); glutamate (+3.0±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5±1); alanine (−0.5); histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); and tryptophan (−3.4). In making changes based upon similar hydrophilicity values, the substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those which are within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred. One may also identify epitopes from primary amino acid sequences on the basis of hydrophilicity. These regions are also referred to as “epitopic core regions.”
- Desired amino acid substitutions (whether conservative or non-conservative) can be determined by those skilled in the art at the time such substitutions are desired. For example, amino acid substitutions can be used to identify important residues of the FGFR-L polypeptide, or to increase or decrease the affinity of the FGFR-L polypeptides described herein. Exemplary amino acid substitutions are set forth in Table I.
TABLE I Amino Acid Substitutions Original Residues Exemplary Substitutions Preferred Substitutions Ala Val, Leu, Ile Val Arg Lys, Gln, Asn Lys Asn Gln Gln Asp Glu Glu Cys Ser, Ala Ser Gln Asn Asn Glu Asp Asp Gly Pro, Ala Ala His Asn, Gln, Lys, Arg Arg Ile Leu, Val, Met, Ala, Leu Phe, Norleucine Leu Norleucine, Ile, Ile Val, Met, Ala, Phe Lys Arg, 1,4 Diamino-butyric Arg Acid, Gln, Asn Met Leu, Phe, Ile Leu Phe Leu, Val, Ile, Ala, Leu Tyr Pro Ala Gly Ser Thr, Ala, Cys Thr Thr Ser Ser Trp Tyr, Phe Tyr Tyr Trp, Phe, Thr, Ser Phe Val Ile, Met, Leu, Phe, Leu Ala, Norleucine - A skilled artisan will be able to determine suitable variants of the polypeptide as set forth in either SEQ ID NO: 2 or SEQ ID NO: 5 using well-known techniques. For identifying suitable areas of the molecule that may be changed without destroying biological activity, one skilled in the art may target areas not believed to be important for activity. For example, when similar polypeptides with similar activities from the same species or from other species are known, one skilled in the art may compare the amino acid sequence of an FGFR-L polypeptide to such similar polypeptides. With such a comparison, one can identify residues and portions of the molecules that are conserved among similar polypeptides. It will be appreciated that changes in areas of the FGFR-L molecule that are not conserved relative to such similar polypeptides would be less likely to adversely affect the biological activity and/or structure of an FGFR-L polypeptide. One skilled in the art would also know that, even in relatively conserved regions, one may substitute chemically similar amino acids for the naturally occurring residues while retaining activity (conservative amino acid residue substitutions). Therefore, even areas that may be important for biological activity or for structure may be subject to conservative amino acid substitutions without destroying the biological activity or without adversely affecting the polypeptide structure.
- Additionally, one skilled in the art can review structure-function studies identifying residues in similar polypeptides that are important for activity or structure. In view of such a comparison, one can predict the importance of amino acid residues in an FGFR-L polypeptide that correspond to amino acid residues that are important for activity or structure in similar polypeptides. One skilled in the art may opt for chemically similar amino acid substitutions for such predicted important amino acid residues of FGFR-L polypeptides.
- One skilled in the art can also analyze the three-dimensional structure and amino acid sequence in relation to that structure in similar polypeptides. In view of such information, one skilled in the art may predict the alignment of amino acid residues of FGFR-L polypeptide with respect to its three dimensional structure. One skilled in the art may choose not to make radical changes to amino acid residues predicted to be on the surface of the protein, since such residues may be involved in important interactions with other molecules. Moreover, one skilled in the art may generate test variants containing a single amino acid substitution at each amino acid residue. The variants could be screened using activity assays known to those with skill in the art. Such variants could be used to gather information about suitable variants. For example, if one discovered that a change to a particular amino acid residue resulted in destroyed, undesirably reduced, or unsuitable activity, variants with such a change would be avoided. In other words, based on information gathered from such routine experiments, one skilled in the art can readily determine the amino acids where further substitutions should be avoided either alone or in combination with other mutations.
- A number of scientific publications have been devoted to the prediction of secondary structure. See Moult, 1996, Curr. Opin. Biotechnol. 7:422-27; Chou et al., 1974, Biochemistry 13:222-45; Chou et al., 1974, Biochemistry 113:211-22; Chou et al., 1978, Adv. Enzymol. Relat. Areas Mol. Biol. 47:45-48; Chou et al., 1978, Ann. Rev. Biochem. 47:251-276; and Chou et al., 1979, Biophys. J 26:367-84. Moreover, computer programs are currently available to assist with predicting secondary structure. One method of predicting secondary structure is based upon homology modeling. For example, two polypeptides or proteins which have a sequence identity of greater than 30%, or similarity greater than 40%, often have similar structural topologies. The recent growth of the protein structural database (PDB) has provided enhanced predictability of secondary structure, including the potential number of folds within the structure of a polypeptide or protein. See Holm et al., 1999, Nucleic Acids Res. 27:244-47. It has been suggested that there are a limited number of folds in a given polypeptide or protein and that once a critical number of structures have been resolved, structural prediction will become dramatically more accurate (Brenner et al., 1997, Curr. Opin. Struct. Biol. 7:369-76).
- Additional methods of predicting secondary structure include “threading” (Jones, 1997, Curr. Opin. Struct. Biol. 7:377-87; Sippl et al., 1996, Structure 4:15-19), “profile analysis” (Bowie et al., 1991, Science, 253:164-70; Gribskov et al., 1990, Methods Enzymol. 183:146-59; Gribskov et al., 1987, Proc. Nat. Acad. Sci. U.S.A. 84:4355-58), and “evolutionary linkage” (See Holm et aL, supra, and Brenner et al., supra).
- Preferred FGFR-L polypeptide variants include glycosylation variants wherein the number and/or type of glycosylation sites have been altered compared to the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. In one embodiment, FGFR-L polypeptide variants comprise a greater or a lesser number of N-linked glycosylation sites than the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. An N-linked glycosylation site is characterized by the sequence: Asn-X-Ser or Asn-X-Thr, wherein the amino acid residue designated as X may be any amino acid residue except proline. The substitution of amino acid residues to create this sequence provides a potential new site for the addition of an N-linked carbohydrate chain. Alternatively, substitutions that eliminate this sequence will remove an existing N-linked carbohydrate chain. Also provided is a rearrangement of N-linked carbohydrate chains wherein one or more N-linked glycosylation sites (typically those that are naturally occurring) are eliminated and one or more new N-linked sites are created. Additional preferred FGFR-L variants include cysteine variants, wherein one or more cysteine residues are deleted or substituted with another amino acid (e.g., serine) as compared to the amino acid sequence set forth in either SEQ ID NO: 2 or SEQ ID NO: 5. Cysteine variants are useful when FGFR-L polypeptides must be refolded into a biologically active conformation such as after the isolation of insoluble inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native protein, and typically have an even number to minimize interactions resulting from unpaired cysteines.
- In other embodiments, related nucleic acid molecules comprise or consist of a nucleotide sequence encoding a polypeptide as set forth in either Seq Id No: 2 or SEQ ID NO: 5 with at least one amino acid insertion and wherein the polypeptide has an activity of the polypeptide set forth in either SEQ ID NO: 2 or SEQ ID NO: 5, or a
-
0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 22 <210> SEQ ID NO 1 <211> LENGTH: 2277 <212> TYPE: DNA <213> ORGANISM: Mus musculus <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (87)..(1673) <221> NAME/KEY: sig_peptide <222> LOCATION: (87)..(146) <221> NAME/KEY: misc_feature <222> LOCATION: (1208)..(1271) <223> OTHER INFORMATION: predicted transmembrane domain <400> SEQUENCE: 1 gacctgggtc ttgcgggcct gagccctgag tggcgtccag tccagctccc agtgaccgcg 60 cccctgcttc aggtccgacc ggcgag atg acg cgg agc ccc gcg ctg ctg ctg 113 Met Thr Arg Ser Pro Ala Leu Leu Leu 1 5 ctg cta ttg ggg gcc ctc ccg tcg gct gag gcg gcg cga gga ccc cca 161 Leu Leu Leu Gly Ala Leu Pro Ser Ala Glu Ala Ala Arg Gly Pro Pro 10 15 20 25 aga atg gca gac aaa gtg gtc cca cgg cag gtg gcc cgc ctg ggc cgc 209 Arg Met Ala Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg 30 35 40 act gtg cgg cta cag tgc cca gtg gag ggg gac cca cca ccg ttg acc 257 Thr Val Arg Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr 45 50 55 atg tgg acc aaa gat ggc cgc aca atc cac agt ggc tgg agc cgc ttc 305 Met Trp Thr Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe 60 65 70 cgt gtg ctg ccc cag ggt ctg aag gtg aag gag gtg gag gcc gag gat 353 Arg Val Leu Pro Gln Gly Leu Lys Val Lys Glu Val Glu Ala Glu Asp 75 80 85 gcc ggt gtt tat gtg tgc aag gcc acc aat ggc ttt ggc agc ctc agc 401 Ala Gly Val Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser 90 95 100 105 gtc aac tac act ctc atc atc atg gat gat att agt cca ggg aag gag 449 Val Asn Tyr Thr Leu Ile Ile Met Asp Asp Ile Ser Pro Gly Lys Glu 110 115 120 agc cct ggg cca ggt ggt tct tcg ggg ggc cag gag gac cca gcc agc 497 Ser Pro Gly Pro Gly Gly Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser 125 130 135 cag cag tgg gca cgg cct cgc ttc aca cag ccc tcc aag atg agg cgc 545 Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg 140 145 150 cga gtg att gca cgg cct gtg ggt agc tct gtg cgg ctc aag tgt gtg 593 Arg Val Ile Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val 155 160 165 gcc agt ggg cac cca cgg cca gac atc atg tgg atg aag gat gac cag 641 Ala Ser Gly His Pro Arg Pro Asp Ile Met Trp Met Lys Asp Asp Gln 170 175 180 185 acc ttg acg cat cta gag gct agt gaa cac aga aag aag aag tgg aca 689 Thr Leu Thr His Leu Glu Ala Ser Glu His Arg Lys Lys Lys Trp Thr 190 195 200 ctg agc ttg aag aac ctg aag cct gaa gac agt ggc aag tac acg tgc 737 Leu Ser Leu Lys Asn Leu Lys Pro Glu Asp Ser Gly Lys Tyr Thr Cys 205 210 215 cgt gta tct aac aag gcc ggt gcc atc aac gcc acc tac aaa gtg gat 785 Arg Val Ser Asn Lys Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp 220 225 230 gta atc cag cgg act cgt tcc aag cct gtg ctc aca ggg aca cac cct 833 Val Ile Gln Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro 235 240 245 gtg aac aca acg gtg gac ttc ggt ggg aca acg tcc ttc cag tgc aag 881 Val Asn Thr Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys 250 255 260 265 gtg cgc agt gac gtg aag cct gtg atc cag tgg ctg aag cgg gtg gag 929 Val Arg Ser Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu 270 275 280 tac ggc tcc gag gga cgc cac aac tcc acc att gat gtg ggt ggc cag 977 Tyr Gly Ser Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln 285 290 295 aag ttt gtg gtg ttg ccc acg ggt gat gtg tgg tca cgg cct gat ggc 1025 Lys Phe Val Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly 300 305 310 tcc tac ctc aac aag ctg ctc atc tct cgg gcc cgc cag gat gat gct 1073 Ser Tyr Leu Asn Lys Leu Leu Ile Ser Arg Ala Arg Gln Asp Asp Ala 315 320 325 ggc atg tac atc tgc cta ggt gca aat acc atg ggc tac agt ttc cgt 1121 Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg 330 335 340 345 agc gcc ttc ctc act gta tta cca gac ccc aaa cct cca ggg cct cct 1169 Ser Ala Phe Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro 350 355 360 atg gct tct tca tcg tca tcc aca agc ctg cca tgg cct gtg gtg atc 1217 Met Ala Ser Ser Ser Ser Ser Thr Ser Leu Pro Trp Pro Val Val Ile 365 370 375 ggc atc cca gct ggt gct gtc ttc atc cta ggc act gtg ctg ctc tgg 1265 Gly Ile Pro Ala Gly Ala Val Phe Ile Leu Gly Thr Val Leu Leu Trp 380 385 390 ctt tgc cag acc aag aag aag cca tgt gcc cca gca tct aca ctt cct 1313 Leu Cys Gln Thr Lys Lys Lys Pro Cys Ala Pro Ala Ser Thr Leu Pro 395 400 405 gtg cct ggg cat cgt ccc cca ggg aca tcc cga gaa cgc agt ggt gac 1361 Val Pro Gly His Arg Pro Pro Gly Thr Ser Arg Glu Arg Ser Gly Asp 410 415 420 425 aag gac ctg ccc tca ttg gct gtg ggc ata tgt gag gag cat gga tcc 1409 Lys Asp Leu Pro Ser Leu Ala Val Gly Ile Cys Glu Glu His Gly Ser 430 435 440 gcc atg gcc ccc cag cac atc ctg gcc tct ggc tca act gct ggc ccc 1457 Ala Met Ala Pro Gln His Ile Leu Ala Ser Gly Ser Thr Ala Gly Pro 445 450 455 aag ctg tac ccc aag cta tac aca gat gtg cac aca cac aca cat aca 1505 Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Val His Thr His Thr His Thr 460 465 470 cac acc tgc act cac acg ctc tca tgt gga ggg caa ggt tca tca aca 1553 His Thr Cys Thr His Thr Leu Ser Cys Gly Gly Gln Gly Ser Ser Thr 475 480 485 cca gca tgt cca cta tca gtg cta aat aca gcg aat ctc caa gca ctg 1601 Pro Ala Cys Pro Leu Ser Val Leu Asn Thr Ala Asn Leu Gln Ala Leu 490 495 500 505 tgt cct gag gta ggc ata tgg ggg cca agg caa cag gtt ggg aga att 1649 Cys Pro Glu Val Gly Ile Trp Gly Pro Arg Gln Gln Val Gly Arg Ile 510 515 520 gag aac aat gga gga aga gta tct tagggtgcct tatggtggac actcacaaac 1703 Glu Asn Asn Gly Gly Arg Val Ser 525 ttggccatat agatgtatgt actaccagat gaacagccag ccagattcac acacgcacat 1763 gtttaaacgt gtaaacgtgt gcacaactgc acacacaacc tgagaaacct tcaggaggat 1823 ttgtggtgtg actttgcagt gacatgtagc gatggctagt tgaaggaatc tccctcatgt 1883 cttagtggtc atggccactt ccccacccct gcccatctgt gttcctgcct ggccttggtg 1943 tgcttccgtg tgccctgggt atcaggagcc tatcatcaac ctgactgggg tgagcagtgc 2003 agccatgcct ggaggtttga gccaccctcc ccttgctaga gagaagggcc tcaatattta 2063 tatttaagaa atgaaataat attaataata atgtaaggag ggctgggaca cagggactct 2123 ggccttccct ggggcctggg acctgcctgg ccttgtggtt acattgggta ccctcactgt 2183 ccatggctgc ctggtctctg taattttata tagagtttga gctgaagcct cgtatattta 2243 atttattttg ttaaacaaga aaaaaaaaaa aaaa 2277 <210> SEQ ID NO 2 <211> LENGTH: 529 <212> TYPE: PRT <213> ORGANISM: Mus musculus <400> SEQUENCE: 2 Met Thr Arg Ser Pro Ala Leu Leu Leu Leu Leu Leu Gly Ala Leu Pro 1 5 10 15 Ser Ala Glu Ala Ala Arg Gly Pro Pro Arg Met Ala Asp Lys Val Val 20 25 30 Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro 35 40 45 Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg 50 55 60 Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu 65 70 75 80 Lys Val Lys Glu Val Glu Ala Glu Asp Ala Gly Val Tyr Val Cys Lys 85 90 95 Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Ile Ile 100 105 110 Met Asp Asp Ile Ser Pro Gly Lys Glu Ser Pro Gly Pro Gly Gly Ser 115 120 125 Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg 130 135 140 Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val 145 150 155 160 Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro 165 170 175 Asp Ile Met Trp Met Lys Asp Asp Gln Thr Leu Thr His Leu Glu Ala 180 185 190 Ser Glu His Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Lys 195 200 205 Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser Asn Lys Ala Gly 210 215 220 Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser 225 230 235 240 Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe 245 250 255 Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro 260 265 270 Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ser Glu Gly Arg His 275 280 285 Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr 290 295 300 Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu 305 310 315 320 Ile Ser Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly 325 330 335 Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu 340 345 350 Pro Asp Pro Lys Pro Pro Gly Pro Pro Met Ala Ser Ser Ser Ser Ser 355 360 365 Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val 370 375 380 Phe Ile Leu Gly Thr Val Leu Leu Trp Leu Cys Gln Thr Lys Lys Lys 385 390 395 400 Pro Cys Ala Pro Ala Ser Thr Leu Pro Val Pro Gly His Arg Pro Pro 405 410 415 Gly Thr Ser Arg Glu Arg Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala 420 425 430 Val Gly Ile Cys Glu Glu His Gly Ser Ala Met Ala Pro Gln His Ile 435 440 445 Leu Ala Ser Gly Ser Thr Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr 450 455 460 Thr Asp Val His Thr His Thr His Thr His Thr Cys Thr His Thr Leu 465 470 475 480 Ser Cys Gly Gly Gln Gly Ser Ser Thr Pro Ala Cys Pro Leu Ser Val 485 490 495 Leu Asn Thr Ala Asn Leu Gln Ala Leu Cys Pro Glu Val Gly Ile Trp 500 505 510 Gly Pro Arg Gln Gln Val Gly Arg Ile Glu Asn Asn Gly Gly Arg Val 515 520 525 Ser <210> SEQ ID NO 3 <211> LENGTH: 509 <212> TYPE: PRT <213> ORGANISM: Mus musculus <220> FEATURE: <221> NAME/KEY: TRANSMEM <222> LOCATION: (355)..(375) <400> SEQUENCE: 3 Ala Arg Gly Pro Pro Arg Met Ala Asp Lys Val Val Pro Arg Gln Val 1 5 10 15 Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro Val Glu Gly Asp 20 25 30 Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg Thr Ile His Ser 35 40 45 Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu Lys Val Lys Glu 50 55 60 Val Glu Ala Glu Asp Ala Gly Val Tyr Val Cys Lys Ala Thr Asn Gly 65 70 75 80 Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Ile Ile Met Asp Asp Ile 85 90 95 Ser Pro Gly Lys Glu Ser Pro Gly Pro Gly Gly Ser Ser Gly Gly Gln 100 105 110 Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro 115 120 125 Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val Gly Ser Ser Val 130 135 140 Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro Asp Ile Met Trp 145 150 155 160 Met Lys Asp Asp Gln Thr Leu Thr His Leu Glu Ala Ser Glu His Arg 165 170 175 Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Lys Pro Glu Asp Ser 180 185 190 Gly Lys Tyr Thr Cys Arg Val Ser Asn Lys Ala Gly Ala Ile Asn Ala 195 200 205 Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser Lys Pro Val Leu 210 215 220 Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe Gly Gly Thr Thr 225 230 235 240 Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro Val Ile Gln Trp 245 250 255 Leu Lys Arg Val Glu Tyr Gly Ser Glu Gly Arg His Asn Ser Thr Ile 260 265 270 Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr Gly Asp Val Trp 275 280 285 Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu Ile Ser Arg Ala 290 295 300 Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr Met 305 310 315 320 Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu Pro Asp Pro Lys 325 330 335 Pro Pro Gly Pro Pro Met Ala Ser Ser Ser Ser Ser Thr Ser Leu Pro 340 345 350 Trp Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile Leu Gly 355 360 365 Thr Val Leu Leu Trp Leu Cys Gln Thr Lys Lys Lys Pro Cys Ala Pro 370 375 380 Ala Ser Thr Leu Pro Val Pro Gly His Arg Pro Pro Gly Thr Ser Arg 385 390 395 400 Glu Arg Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala Val Gly Ile Cys 405 410 415 Glu Glu His Gly Ser Ala Met Ala Pro Gln His Ile Leu Ala Ser Gly 420 425 430 Ser Thr Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Val His 435 440 445 Thr His Thr His Thr His Thr Cys Thr His Thr Leu Ser Cys Gly Gly 450 455 460 Gln Gly Ser Ser Thr Pro Ala Cys Pro Leu Ser Val Leu Asn Thr Ala 465 470 475 480 Asn Leu Gln Ala Leu Cys Pro Glu Val Gly Ile Trp Gly Pro Arg Gln 485 490 495 Gln Val Gly Arg Ile Glu Asn Asn Gly Gly Arg Val Ser 500 505 <210> SEQ ID NO 4 <211> LENGTH: 1450 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (33)..(1448) <221> NAME/KEY: sig_peptide <222> LOCATION: (33)..(104) <221> NAME/KEY: misc_feature <222> LOCATION: (1167)..(1229) <400> SEQUENCE: 4 gcggccgcga ccccaggtcc ggacaggccg ag atg acg ccg agc ccc ctg ttg 53 Met Thr Pro Ser Pro Leu Leu 1 5 ctg ctc ctg ctg ccg ccg ctg ctg ctg ggg gcc ttc cca ccg gcc gcc 101 Leu Leu Leu Leu Pro Pro Leu Leu Leu Gly Ala Phe Pro Pro Ala Ala 10 15 20 gcc gcc cga ggc ccc cca aag atg gcg gac aag gtg gtc cca cgg cag 149 Ala Ala Arg Gly Pro Pro Lys Met Ala Asp Lys Val Val Pro Arg Gln 25 30 35 gtg gcc cgg ctg ggc cgc act gtg cgg ctg cag tgc cca gtg gag ggg 197 Val Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro Val Glu Gly 40 45 50 55 gac ccg ccg ccg ctg acc atg tgg acc aag gat ggc cgc acc atc cac 245 Asp Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg Thr Ile His 60 65 70 agc ggc tgg agc cgc ttc cgc gtg ctg ccg cag ggg ctg aag gtg aag 293 Ser Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu Lys Val Lys 75 80 85 cag gtg gag cgg gag gat gcc ggc gtg tac gtg tgc aag gcc acc aac 341 Gln Val Glu Arg Glu Asp Ala Gly Val Tyr Val Cys Lys Ala Thr Asn 90 95 100 ggc ttc ggc agc ctg agc gtc aac tac acc ctc gtc gtg ctg gat gac 389 Gly Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Val Val Leu Asp Asp 105 110 115 att agc cca ggg aag gag agc ctg ggg ccc gac agc tcc tct ggg ggt 437 Ile Ser Pro Gly Lys Glu Ser Leu Gly Pro Asp Ser Ser Ser Gly Gly 120 125 130 135 caa gag gac ccc gcc agc cag cag tgg gca cga ccg cgc ttc aca cag 485 Gln Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln 140 145 150 ccc tcc aag atg agg cgc cgg gtg atc gca cgg ccc gtg ggt agc tcc 533 Pro Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val Gly Ser Ser 155 160 165 gtg cgg ctc aag tgc gtg gcc agc ggg cac cct cgg ccc gac atc acg 581 Val Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro Asp Ile Thr 170 175 180 tgg atg aag gac gac cag gcc ttg acg cgc cca gag gcc gct gag ccc 629 Trp Met Lys Asp Asp Gln Ala Leu Thr Arg Pro Glu Ala Ala Glu Pro 185 190 195 agg aag aag aag tgg aca ctg agc ctg aag aac ctg cgg ccg gag gac 677 Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Arg Pro Glu Asp 200 205 210 215 agc ggc aaa tac acc tgc cgc gtg tcg aac cgc gcg ggc gcc atc aac 725 Ser Gly Lys Tyr Thr Cys Arg Val Ser Asn Arg Ala Gly Ala Ile Asn 220 225 230 gcc acc tac aag gtg gat gtg atc cag cgg acc cgt tcc aag ccc gtg 773 Ala Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser Lys Pro Val 235 240 245 ctc aca ggc acg cac ccc gtg aac acg acg gtg gac ttc ggg ggg acc 821 Leu Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe Gly Gly Thr 250 255 260 acg tcc ttc cag tgc aag gtg cgc agc gac gtg aag ccg gtg atc cag 869 Thr Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro Val Ile Gln 265 270 275 tgg ctg aag cgc gtg gag tac ggc gct gag ggc cgc cac aac tcc acc 917 Trp Leu Lys Arg Val Glu Tyr Gly Ala Glu Gly Arg His Asn Ser Thr 280 285 290 295 atc gat gtg ggc ggc cag aag ttt gtg gtg ctg ccc acg ggt gac gtg 965 Ile Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr Gly Asp Val 300 305 310 tgg tcg cgg ccc gac ggc tcc tac ctc aat aag ctg ctc atc acc cgt 1013 Trp Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu Ile Thr Arg 315 320 325 gcc cgc cag gac gat gcg ggc atg tac atc tgc ctt ggc gcc aac acc 1061 Ala Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr 330 335 340 atg ggc tac agc ttc cgc agc gcc ttc ctc acc gtg ctg cca gac cca 1109 Met Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu Pro Asp Pro 345 350 355 aaa ccg cca ggg cca cct gtg gcc tcc tcg tcc tcg gcc act agc ctg 1157 Lys Pro Pro Gly Pro Pro Val Ala Ser Ser Ser Ser Ala Thr Ser Leu 360 365 370 375 ccg tgg ccc gtg gtc atc ggc atc cca gcc ggc gct gtc ttc atc ctg 1205 Pro Trp Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile Leu 380 385 390 ggc acc ctg ctc ctg tgg ctt tgc cag gcc cag aag aag ccg tgc acc 1253 Gly Thr Leu Leu Leu Trp Leu Cys Gln Ala Gln Lys Lys Pro Cys Thr 395 400 405 ccc gcg cct gcc cct ccc ctg cct ggg cac cgc ccg ccg ggg acg gcc 1301 Pro Ala Pro Ala Pro Pro Leu Pro Gly His Arg Pro Pro Gly Thr Ala 410 415 420 cgc gac cgc agc gga gac aag gac ctt ccc tcg ttg gcc gcc ctc agc 1349 Arg Asp Arg Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala Ala Leu Ser 425 430 435 gct ggc cct ggt gtg ggg ctg tgt gag gag cat ggg tct ccg gca gcc 1397 Ala Gly Pro Gly Val Gly Leu Cys Glu Glu His Gly Ser Pro Ala Ala 440 445 450 455 ccc cag cac tta ctg ggc cca ggc cca gtt gct ggc cct aag ttg tac 1445 Pro Gln His Leu Leu Gly Pro Gly Pro Val Ala Gly Pro Lys Leu Tyr 460 465 470 ccc ta 1450 Pro <210> SEQ ID NO 5 <211> LENGTH: 472 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 5 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu 1 5 10 15 Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 20 25 30 Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg 35 40 45 Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 50 55 60 Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu 65 70 75 80 Pro Gln Gly Leu Lys Val Lys Gln Val Glu Arg Glu Asp Ala Gly Val 85 90 95 Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 100 105 110 Thr Leu Val Val Leu Asp Asp Ile Ser Pro Gly Lys Glu Ser Leu Gly 115 120 125 Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp 130 135 140 Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile 145 150 155 160 Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 165 170 175 His Pro Arg Pro Asp Ile Thr Trp Met Lys Asp Asp Gln Ala Leu Thr 180 185 190 Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 195 200 205 Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 210 215 220 Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln 225 230 235 240 Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 245 250 255 Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser 260 265 270 Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ala 275 280 285 Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val 290 295 300 Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 305 310 315 320 Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr 325 330 335 Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 340 345 350 Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro Val Ala Ser 355 360 365 Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro 370 375 380 Ala Gly Ala Val Phe Ile Leu Gly Thr Leu Leu Leu Trp Leu Cys Gln 385 390 395 400 Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 405 410 415 His Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu 420 425 430 Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 435 440 445 Glu His Gly Ser Pro Ala Ala Pro Gln His Leu Leu Gly Pro Gly Pro 450 455 460 Val Ala Gly Pro Lys Leu Tyr Pro 465 470 <210> SEQ ID NO 6 <211> LENGTH: 448 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: TRANSMEM <222> LOCATION: (355)..(375) <400> SEQUENCE: 6 Ala Arg Gly Pro Pro Lys Met Ala Asp Lys Val Val Pro Arg Gln Val 1 5 10 15 Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro Val Glu Gly Asp 20 25 30 Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg Thr Ile His Ser 35 40 45 Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu Lys Val Lys Gln 50 55 60 Val Glu Arg Glu Asp Ala Gly Val Tyr Val Cys Lys Ala Thr Asn Gly 65 70 75 80 Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Val Val Leu Asp Asp Ile 85 90 95 Ser Pro Gly Lys Glu Ser Leu Gly Pro Asp Ser Ser Ser Gly Gly Gln 100 105 110 Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro 115 120 125 Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val Gly Ser Ser Val 130 135 140 Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro Asp Ile Thr Trp 145 150 155 160 Met Lys Asp Asp Gln Ala Leu Thr Arg Pro Glu Ala Ala Glu Pro Arg 165 170 175 Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Arg Pro Glu Asp Ser 180 185 190 Gly Lys Tyr Thr Cys Arg Val Ser Asn Arg Ala Gly Ala Ile Asn Ala 195 200 205 Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser Lys Pro Val Leu 210 215 220 Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe Gly Gly Thr Thr 225 230 235 240 Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro Val Ile Gln Trp 245 250 255 Leu Lys Arg Val Glu Tyr Gly Ala Glu Gly Arg His Asn Ser Thr Ile 260 265 270 Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr Gly Asp Val Trp 275 280 285 Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu Ile Thr Arg Ala 290 295 300 Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr Met 305 310 315 320 Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu Pro Asp Pro Lys 325 330 335 Pro Pro Gly Pro Pro Val Ala Ser Ser Ser Ser Ala Thr Ser Leu Pro 340 345 350 Trp Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile Leu Gly 355 360 365 Thr Leu Leu Leu Trp Leu Cys Gln Ala Gln Lys Lys Pro Cys Thr Pro 370 375 380 Ala Pro Ala Pro Pro Leu Pro Gly His Arg Pro Pro Gly Thr Ala Arg 385 390 395 400 Asp Arg Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala Ala Leu Ser Ala 405 410 415 Gly Pro Gly Val Gly Leu Cys Glu Glu His Gly Ser Pro Ala Ala Pro 420 425 430 Gln His Leu Leu Gly Pro Gly Pro Val Ala Gly Pro Lys Leu Tyr Pro 435 440 445 <210> SEQ ID NO 7 <211> LENGTH: 574 <212> TYPE: PRT <213> ORGANISM: Pleurodeles waltlii <400> SEQUENCE: 7 Met Gly Val Gln Lys Asp Ser Arg Asp Ile Arg Trp Asn Arg Thr Thr 1 5 10 15 Arg Pro Leu Ala Leu Leu Leu Cys Gly Leu Leu Ala Phe Ser Ala Leu 20 25 30 Ser Cys Ala Arg Thr Leu Pro Glu Gly Arg Lys Ala Asn Leu Ala Glu 35 40 45 Leu Val Ser Glu Glu Glu Glu His Phe Leu Leu Asp Pro Gly Asn Ala 50 55 60 Leu Arg Leu Phe Cys Asp Thr Asn Gln Thr Thr Ile Val Asn Trp Tyr 65 70 75 80 Thr Glu Ser Thr Arg Leu Gln His Gly Gly Arg Ile Arg Leu Thr Asp 85 90 95 Thr Val Leu Glu Ile Ala Asp Val Thr Tyr Glu Asp Ser Gly Leu Tyr 100 105 110 Leu Cys Val Val Pro Gly Thr Gly His Ile Leu Arg Asn Phe Thr Ile 115 120 125 Ser Val Val Asp Ser Leu Ala Ser Gly Asp Asp Asp Asp Glu Asp His 130 135 140 Gly Arg Glu Asp Ser Ala Gly Asp Met Gly Glu Asp Pro Pro Tyr Ser 145 150 155 160 Thr Ser Tyr Arg Ala Pro Phe Trp Ser Gln Pro Gln Arg Met Asp Lys 165 170 175 Lys Leu Tyr Ala Val Pro Ala Gly Asn Thr Val Lys Phe Arg Cys Pro 180 185 190 Ser Ala Gly Asn Pro Thr Pro Gly Ile Arg Trp Leu Lys Asn Gly Arg 195 200 205 Glu Phe Gly Gly Glu His Arg Ile Gly Gly Ile Arg Leu Arg His Gln 210 215 220 His Trp Ser Leu Val Met Glu Ser Val Val Pro Ser Asp Arg Gly Asn 225 230 235 240 Tyr Thr Cys Leu Val Glu Asn Lys Phe Gly Ser Ile Ser Tyr Ser Tyr 245 250 255 Leu Leu Asp Val Leu Glu Arg Ser Pro His Arg Pro Ile Leu Gln Ala 260 265 270 Gly Leu Pro Ala Asn Thr Thr Ala Met Leu Gly Ser Asp Val Gln Phe 275 280 285 Phe Cys Lys Val Tyr Ser Asp Ala Gln Pro His Ile Gln Trp Leu Lys 290 295 300 His Ile Glu Val Asn Gly Ser Arg Tyr Gly Pro Asp Gly Val Pro Phe 305 310 315 320 Val Gln Val Leu Lys Thr Ala Asp Ile Asn Ser Ser Glu Val Glu Val 325 330 335 Leu Tyr Leu His Asn Val Ser Phe Glu Asp Ala Gly Glu Tyr Thr Cys 340 345 350 Leu Ala Gly Asn Ser Ile Gly Leu Ser Tyr Gln Ser Ala Trp Leu Thr 355 360 365 Val Leu Pro Glu Glu Asp Phe Ala Lys Glu Ala Glu Gly Pro Glu Thr 370 375 380 Arg Tyr Thr Asp Ile Ile Ile Tyr Thr Ser Gly Ser Leu Ala Leu Leu 385 390 395 400 Met Ala Ala Val Ile Val Val Leu Cys Arg Met Gln Leu Pro Pro Thr 405 410 415 Lys Thr His Leu Glu Pro Ala Thr Val His Lys Leu Ser Arg Phe Pro 420 425 430 Leu Met Arg Gln Phe Ser Leu Glu Ser Ser Ser Ser Gly Lys Ser Ser 435 440 445 Thr Ser Leu Val Arg Val Thr Arg Leu Ser Ser Ser Cys Thr Pro Met 450 455 460 Leu Pro Gly Val Leu Glu Phe Asp Leu Pro Leu Asp Ser Lys Trp Glu 465 470 475 480 Phe Pro Arg Glu Arg Leu Val Leu Gly Lys Pro Leu Gly Glu Gly Cys 485 490 495 Phe Gly Gln Val Val Arg Ala Glu Ala Tyr Gly Ile Asn Lys Asp Gln 500 505 510 Pro Asp Lys Ala Ile Thr Val Ala Ile Lys Ile Val Lys Asp Lys Gly 515 520 525 Thr Asp Lys Glu Leu Ser Asp Leu Ile Ser Glu Met Glu Leu Met Lys 530 535 540 Leu Met Gly Lys His Lys Asn Ile Ile Asn Leu Leu Gly Val Cys Thr 545 550 555 560 Gln Asp Gly Pro Leu Tyr Met Ile Val Glu Tyr Ala Ser Lys 565 570 <210> SEQ ID NO 8 <211> LENGTH: 504 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: virtual human FGFR-L amino acid sequence comprising residues 1-472 of SEQ ID NO: 5 and residues 473-504 of GenBank accession no. AJ277437 <400> SEQUENCE: 8 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu 1 5 10 15 Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 20 25 30 Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg 35 40 45 Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 50 55 60 Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu 65 70 75 80 Pro Gln Gly Leu Lys Val Lys Gln Val Glu Arg Glu Asp Ala Gly Val 85 90 95 Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 100 105 110 Thr Leu Val Val Leu Asp Asp Ile Ser Pro Gly Lys Glu Ser Leu Gly 115 120 125 Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp 130 135 140 Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile 145 150 155 160 Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 165 170 175 His Pro Arg Pro Asp Ile Thr Trp Met Lys Asp Asp Gln Ala Leu Thr 180 185 190 Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 195 200 205 Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 210 215 220 Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln 225 230 235 240 Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 245 250 255 Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser 260 265 270 Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ala 275 280 285 Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val 290 295 300 Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 305 310 315 320 Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr 325 330 335 Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 340 345 350 Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro Val Ala Ser 355 360 365 Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro 370 375 380 Ala Gly Ala Val Phe Ile Leu Gly Thr Leu Leu Leu Trp Leu Cys Gln 385 390 395 400 Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 405 410 415 His Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu 420 425 430 Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 435 440 445 Glu His Gly Ser Pro Ala Ala Pro Gln His Leu Leu Gly Pro Gly Pro 450 455 460 Val Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Ile His Thr 465 470 475 480 His Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val 485 490 495 His Gln His Ile His Tyr Gln Cys 500 <210> SEQ ID NO 9 <211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM: Human immunodeficiency virus type 1 <400> SEQUENCE: 9 Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg 1 5 10 <210> SEQ ID NO 10 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: internalizing domain derived from HIV tat protein <400> SEQUENCE: 10 Gly Gly Gly Gly Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg 1 5 10 15 <210> SEQ ID NO 11 <211> LENGTH: 20 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: predicted signal peptide of murine FGFR-L polypeptide <400> SEQUENCE: 11 Met Thr Arg Ser Pro Ala Leu Leu Leu Leu Leu Leu Gly Ala Leu Pro 1 5 10 15 Ser Ala Glu Ala 20 <210> SEQ ID NO 12 <211> LENGTH: 25 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: predicted transmmebrane domain for murine FRL polypeptide <400> SEQUENCE: 12 Leu Pro Trp Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile 1 5 10 15 Leu Gly Thr Val Leu Leu Trp Leu Cys 20 25 <210> SEQ ID NO 13 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: oligonucleotide; PCR primer <400> SEQUENCE: 13 cgctgaccat gtggaccaag gatg 24 <210> SEQ ID NO 14 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: oligonucleotide; PCR primer <400> SEQUENCE: 14 cttgacccca gaaggagctg tcgg 24 <210> SEQ ID NO 15 <211> LENGTH: 504 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 15 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu 1 5 10 15 Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 20 25 30 Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg 35 40 45 Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 50 55 60 Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu 65 70 75 80 Pro Gln Gly Leu Lys Val Lys Gln Val Glu Arg Glu Asp Ala Gly Val 85 90 95 Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 100 105 110 Thr Leu Val Val Leu Asp Asp Ile Ser Pro Gly Lys Glu Ser Leu Gly 115 120 125 Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp 130 135 140 Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile 145 150 155 160 Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 165 170 175 His Pro Arg Pro Asp Ile Thr Trp Met Lys Asp Asp Gln Ala Leu Thr 180 185 190 Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 195 200 205 Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 210 215 220 Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln 225 230 235 240 Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 245 250 255 Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser 260 265 270 Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ala 275 280 285 Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val 290 295 300 Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 305 310 315 320 Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr 325 330 335 Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 340 345 350 Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro Val Ala Ser 355 360 365 Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro 370 375 380 Ala Gly Ala Val Phe Ile Leu Gly Thr Leu Leu Leu Trp Leu Cys Gln 385 390 395 400 Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 405 410 415 His Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu 420 425 430 Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 435 440 445 Glu His Gly Ser Pro Ala Ala Pro Gln His Leu Leu Gly Pro Gly Pro 450 455 460 Val Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Ile His Thr 465 470 475 480 His Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val 485 490 495 His Gln His Ile His Tyr Gln Cys 500 <210> SEQ ID NO 16 <211> LENGTH: 3112 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (25)..(1536) <400> SEQUENCE: 16 gaccccaggt ccggacaggc cgag atg acg ccg agc ccc ctg ttg ctg ctc 51 Met Thr Pro Ser Pro Leu Leu Leu Leu 1 5 ctg ctg ccg ccg ctg ctg ctg ggg gcc ttc cca ccg gcc gcc gcc gcc 99 Leu Leu Pro Pro Leu Leu Leu Gly Ala Phe Pro Pro Ala Ala Ala Ala 10 15 20 25 cga ggc ccc cca aag atg gcg gac aag gtg gtc cca cgg cag gtg gcc 147 Arg Gly Pro Pro Lys Met Ala Asp Lys Val Val Pro Arg Gln Val Ala 30 35 40 cgg ctg ggc cgc act gtg cgg ctg cag tgc cca gtg gag ggg gac ccg 195 Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro Val Glu Gly Asp Pro 45 50 55 ccg ccg ctg acc atg tgg acc aag gat ggc cgc acc atc cac agc ggc 243 Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg Thr Ile His Ser Gly 60 65 70 tgg agc cgc ttc cgc gtg ctg ccg cag ggg ctg aag gtg aag cag gtg 291 Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu Lys Val Lys Gln Val 75 80 85 gag cgg gag gat gcc ggc gtg tac gtg tgc aag gcc acc aac ggc ttc 339 Glu Arg Glu Asp Ala Gly Val Tyr Val Cys Lys Ala Thr Asn Gly Phe 90 95 100 105 ggc agc ctt agc gtc aac tac acc ctc gtc gtg ctg gat gac att agc 387 Gly Ser Leu Ser Val Asn Tyr Thr Leu Val Val Leu Asp Asp Ile Ser 110 115 120 cca ggg aag gag agc ctg ggg ccc gac agc tcc tct ggg ggt caa gag 435 Pro Gly Lys Glu Ser Leu Gly Pro Asp Ser Ser Ser Gly Gly Gln Glu 125 130 135 gac ccc gcc agc cag cag tgg gca cga ccg cgc ttc aca cag ccc tcc 483 Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro Ser 140 145 150 aag atg agg cgc cgg gtg atc gca cgg ccc gtg ggt agc tcc gtg cgg 531 Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val Gly Ser Ser Val Arg 155 160 165 ctc aag tgc gtg gcc agc ggg cac cct cgg ccc gac atc acg tgg atg 579 Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro Asp Ile Thr Trp Met 170 175 180 185 aag gac gac cag gcc ttg acg cgc cca gag gcc gct gag ccc agg aag 627 Lys Asp Asp Gln Ala Leu Thr Arg Pro Glu Ala Ala Glu Pro Arg Lys 190 195 200 aag aag tgg aca ctg agc ctg aag aac ctg cgg ccg gag gac agc ggc 675 Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Arg Pro Glu Asp Ser Gly 205 210 215 aaa tac acc tgc cgc gtg tcg aac cgc gcg ggc gcc atc aac gcc acc 723 Lys Tyr Thr Cys Arg Val Ser Asn Arg Ala Gly Ala Ile Asn Ala Thr 220 225 230 tac aag gtg gat gtg atc cag cgg acc cgt tcc aag ccc gtg ctc aca 771 Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser Lys Pro Val Leu Thr 235 240 245 ggc acg cac ccc gtg aac acg acg gtg gac ttc ggg ggg acc acg tcc 819 Gly Thr His Pro Val Asn Thr Thr Val Asp Phe Gly Gly Thr Thr Ser 250 255 260 265 ttc cag tgc aag gtg cgc agc gac gtg aag ccg gtg atc cag tgg ctg 867 Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro Val Ile Gln Trp Leu 270 275 280 aag cgc gtg gag tac ggc gcc gag ggc cgc cac aac tcc acc atc gat 915 Lys Arg Val Glu Tyr Gly Ala Glu Gly Arg His Asn Ser Thr Ile Asp 285 290 295 gtg ggc ggc cag aag ttt gtg gtg ctg ccc acg ggt gac gtg tgg tcg 963 Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr Gly Asp Val Trp Ser 300 305 310 cgg ccc gac ggc tcc tac ctc aat aag ctg ctc atc acc cgt gcc cgc 1011 Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu Ile Thr Arg Ala Arg 315 320 325 cag gac gat gcg ggc atg tac atc tgc ctt ggc gcc aac acc atg ggc 1059 Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr Met Gly 330 335 340 345 tac agc ttc cgc agc gcc ttc ctc acc gtg ctg cca gac cca aaa ccg 1107 Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu Pro Asp Pro Lys Pro 350 355 360 caa ggg cca cct gtg gcc tcc tcg tcc tcg gcc act agc ctg ccg tgg 1155 Gln Gly Pro Pro Val Ala Ser Ser Ser Ser Ala Thr Ser Leu Pro Trp 365 370 375 ccc gtg gtc atc ggc atc cca gcc ggc gct gtc ttc atc ctg ggc acc 1203 Pro Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile Leu Gly Thr 380 385 390 ctg ctc ctg tgg ctt tgc cag gcc cag aag aag ccg tgc acc ccc gcg 1251 Leu Leu Leu Trp Leu Cys Gln Ala Gln Lys Lys Pro Cys Thr Pro Ala 395 400 405 cct gcc cct ccc ctg cct ggg cac cgc ccg ccg ggg acg gcc ctc gac 1299 Pro Ala Pro Pro Leu Pro Gly His Arg Pro Pro Gly Thr Ala Leu Asp 410 415 420 425 cgc agc gga gac aag gac ctt ccc tcg ttg gcc gcc ctc agc gct ggc 1347 Arg Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala Ala Leu Ser Ala Gly 430 435 440 cct ggt gtg ggg ctg tgt gag gag cat ggg tct ccg gca gcc ccc cag 1395 Pro Gly Val Gly Leu Cys Glu Glu His Gly Ser Pro Ala Ala Pro Gln 445 450 455 cac tta ctg ggc cca ggc cca gtt gct ggc cct aag ttg tac ccc aaa 1443 His Leu Leu Gly Pro Gly Pro Val Ala Gly Pro Lys Leu Tyr Pro Lys 460 465 470 ctc tac aca gac atc cac aca cac aca cac aca cac tct cac aca cac 1491 Leu Tyr Thr Asp Ile His Thr His Thr His Thr His Ser His Thr His 475 480 485 tca cac gtg gag ggc aag gtc cac cag cac atc cac tat cag tgc 1536 Ser His Val Glu Gly Lys Val His Gln His Ile His Tyr Gln Cys 490 495 500 tagacggcac cgtatctgca gtgggcacgg gggggccggc cagacaggca gactgggagg 1596 atggaggacg gagctgcaga cgaaggcagg ggacccatgg cgaggaggaa tggccagcac 1656 cccaggcagt ctgtgtgtga ggcatagccc ctggacacac acacacagac acacacacta 1716 cctggatgca tgtatgcaca cacatgcgcg cacacgtgct ccctgaaggc acacgtacgc 1776 acacacgcac atgcacagat atgccgcctg ggcacacaga taagctgccc aaatgcacgc 1836 acacgcacag agacatgcca gaacatacaa ggacatgctg cctgaacata cacacgcaca 1896 cccatgcgca gatgtgctgc ctggacacac acacacacac ggatatgctg tctggacgca 1956 cacacgtgca gatatggtat ccggacacac acgtgcacag atatgctgcc tggacacaca 2016 gataatgctg ccttgacaca cacatgcacg gatattgcct ggacacacac acacacacgc 2076 gtgcacagat atgctgtctg gacaggcaca cacatgcaga tatgctgcct ggacacacac 2136 ttccagacac acgtgcacag gcgcagatat gctgcctgga cacacgcaga tatgctgtct 2196 agtcacacac acacgcagac atgctgtccg gacacacaca cgcatgcaca gatatgctgt 2256 ccggacacac acacgcacgc agatatgctg cctggacaca cacacagata atgctgcctc 2316 aacactcaca cacgtgcaga tattgcctgg acacacacat gtgcacagat atgctgtctg 2376 gacatgcaca cacgtgcaga tatgctgtcc ggatacacac gcacgcacac atgcagatat 2436 gctgcctggg cacacacttc cggacacaca tgcacacaca ggtgcagata tgctgcctgg 2496 acacacgcag actgacgtgc ttttgggagg gtgtgccgtg aagcctgcag tacgtgtgcc 2556 gtgaggctca tagttgatga gggactttcc ctgctccacc gtcactcccc caactctgcc 2616 cgcctctgtc cccgcctcag tccccgcctc catccccgcc tctgtcccct ggccttggcg 2676 gctatttttg ccacctgcct tgggtgccca ggagtcccct actgctgtgg gctggggttg 2736 ggggcacagc agccccaagc ctgagaggct ggagcccatg gctagtggct catccccact 2796 gcattctccc cctgacacag agaaggggcc ttggtattta tatttaagaa atgaagataa 2856 tattaataat gatggaagga agactgggtt gcagggactg tggtctctcc tggggcccgg 2916 gacccgcctg gtctttcagc catgctgatg accacacccc gtccaggcca gacaccaccc 2976 cccaccccac tgtcgtggtg gccccagatc tctgtaattt tatgtagagt ttgagctgaa 3036 gccccgtata tttaatttat tttgttaaac atgaaagtgc atcctttccc tccaaaaaaa 3096 aaaaaaaaaa aaaaaa 3112 <210> SEQ ID NO 17 <211> LENGTH: 504 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 17 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu 1 5 10 15 Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 20 25 30 Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg 35 40 45 Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 50 55 60 Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu 65 70 75 80 Pro Gln Gly Leu Lys Val Lys Gln Val Glu Arg Glu Asp Ala Gly Val 85 90 95 Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 100 105 110 Thr Leu Val Val Leu Asp Asp Ile Ser Pro Gly Lys Glu Ser Leu Gly 115 120 125 Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp 130 135 140 Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile 145 150 155 160 Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 165 170 175 His Pro Arg Pro Asp Ile Thr Trp Met Lys Asp Asp Gln Ala Leu Thr 180 185 190 Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 195 200 205 Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 210 215 220 Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln 225 230 235 240 Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 245 250 255 Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser 260 265 270 Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ala 275 280 285 Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val 290 295 300 Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 305 310 315 320 Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr 325 330 335 Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 340 345 350 Leu Thr Val Leu Pro Asp Pro Lys Pro Gln Gly Pro Pro Val Ala Ser 355 360 365 Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro 370 375 380 Ala Gly Ala Val Phe Ile Leu Gly Thr Leu Leu Leu Trp Leu Cys Gln 385 390 395 400 Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 405 410 415 His Arg Pro Pro Gly Thr Ala Leu Asp Arg Ser Gly Asp Lys Asp Leu 420 425 430 Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 435 440 445 Glu His Gly Ser Pro Ala Ala Pro Gln His Leu Leu Gly Pro Gly Pro 450 455 460 Val Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Ile His Thr 465 470 475 480 His Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val 485 490 495 His Gln His Ile His Tyr Gln Cys 500 <210> SEQ ID NO 18 <211> LENGTH: 3080 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (23)..(1534) <400> SEQUENCE: 18 ccccaggtcc ggacaggccg ag atg acg ccg agc ccc ctg ttg ctg ctc ctg 52 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu 1 5 10 ctg ccg ccg ctg ctg ctg ggg gcc ttc cca ccg gcc gcc gcc gcc cga 100 Leu Pro Pro Leu Leu Leu Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg 15 20 25 ggc ccc cca aag atg gcg gac aag gtg gtc cca cgg cag gtg gcc cgg 148 Gly Pro Pro Lys Met Ala Asp Lys Val Val Pro Arg Gln Val Ala Arg 30 35 40 ctg ggc cgc act gtg cgg ctg cag tgc cca gtg gag ggg gac ccg ccg 196 Leu Gly Arg Thr Val Arg Leu Gln Cys Pro Val Glu Gly Asp Pro Pro 45 50 55 ccg ctg acc atg tgg acc aag gat ggc cgc acc atc cac agc ggc tgg 244 Pro Leu Thr Met Trp Thr Lys Asp Gly Arg Thr Ile His Ser Gly Trp 60 65 70 agc cgc ttc cgc gtg ctg ccg cag ggg ctg aag gtg aag cag gtg gag 292 Ser Arg Phe Arg Val Leu Pro Gln Gly Leu Lys Val Lys Gln Val Glu 75 80 85 90 cgg gag gat gcc ggc gtg tac gtg tgc aag gcc acc aac ggc ttc ggc 340 Arg Glu Asp Ala Gly Val Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly 95 100 105 agc ctt agc gtc aac tac acc ctc gtc gtg ctg gat gac att agc cca 388 Ser Leu Ser Val Asn Tyr Thr Leu Val Val Leu Asp Asp Ile Ser Pro 110 115 120 ggg aag gag agc ctg ggg ccc gac agc tcc tct ggg ggt caa gag gac 436 Gly Lys Glu Ser Leu Gly Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp 125 130 135 ccc gcc agc cag cag tgg gca cga ccg cgc ttc aca cag ccc tcc aag 484 Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys 140 145 150 atg agg cgc cgg gtg atc gca cgg ccc gtg ggt agc tcc gtg cgg ctc 532 Met Arg Arg Arg Val Ile Ala Arg Pro Val Gly Ser Ser Val Arg Leu 155 160 165 170 aag tgc gtg gcc agc ggg cac cct cgg ccc gac atc acg tgg atg aag 580 Lys Cys Val Ala Ser Gly His Pro Arg Pro Asp Ile Thr Trp Met Lys 175 180 185 gac gac cag gcc ttg acg cgc cca gag gcc gct gag ccc agg aag aag 628 Asp Asp Gln Ala Leu Thr Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys 190 195 200 aag tgg aca ctg agc ctg aag aac ctg cgg ccg gag gac agc ggc aaa 676 Lys Trp Thr Leu Ser Leu Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys 205 210 215 tac acc tgc cgc gtg tcg aac cgc gcg ggc gcc atc aac gcc acc tac 724 Tyr Thr Cys Arg Val Ser Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr 220 225 230 aag gtg gat gtg atc cag cgg acc cgt tcc aag ccc gtg ctc aca ggc 772 Lys Val Asp Val Ile Gln Arg Thr Arg Ser Lys Pro Val Leu Thr Gly 235 240 245 250 acg cac ccc gtg aac acg acg gtg gac ttc ggg ggg acc acg tcc ttc 820 Thr His Pro Val Asn Thr Thr Val Asp Phe Gly Gly Thr Thr Ser Phe 255 260 265 cag tgc aag gtg cgc agc gac gtg aag ccg gtg atc cag tgg ctg aag 868 Gln Cys Lys Val Arg Ser Asp Val Lys Pro Val Ile Gln Trp Leu Lys 270 275 280 cgc gtg gag tac ggc gcc gag ggc cgc cac aac tcc acc atc gat gtg 916 Arg Val Glu Tyr Gly Ala Glu Gly Arg His Asn Ser Thr Ile Asp Val 285 290 295 ggc ggc cag aag ttt gtg gtg ctg ccc acg ggt gac gtg tgg tcg cgg 964 Gly Gly Gln Lys Phe Val Val Leu Pro Thr Gly Asp Val Trp Ser Arg 300 305 310 ccc gac ggc tcc tac ctc aat aag ctg ctc atc acc cgt gcc cgc cag 1012 Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln 315 320 325 330 gac gat gcg ggc atg tac atc tgc ctt ggc gcc aac acc atg ggc tac 1060 Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr 335 340 345 agc ttc cgc agc gcc ttc ctc acc gtg ctg cca gac cca aaa ccg caa 1108 Ser Phe Arg Ser Ala Phe Leu Thr Val Leu Pro Asp Pro Lys Pro Gln 350 355 360 ggg cca cct gtg gcc tcc tcg tcc tcg gcc act agc ctg ccg tgg ccc 1156 Gly Pro Pro Val Ala Ser Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro 365 370 375 gtg gtc atc ggc atc cca gcc ggc gct gtc ttc atc ctg ggc acc ctg 1204 Val Val Ile Gly Ile Pro Ala Gly Ala Val Phe Ile Leu Gly Thr Leu 380 385 390 ctc ctg tgg ctt tgc cag gcc cag aag aag ccg tgc acc ccc gcg cct 1252 Leu Leu Trp Leu Cys Gln Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro 395 400 405 410 gcc cct ccc ctg cct ggg cac cgc ccg ccg ggg acg gcc cgc gac cgc 1300 Ala Pro Pro Leu Pro Gly His Arg Pro Pro Gly Thr Ala Arg Asp Arg 415 420 425 agc gga gac aag gac ctt ccc tcg ttg gcc gcc ctc agc gct ggc cct 1348 Ser Gly Asp Lys Asp Leu Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro 430 435 440 ggt gtg ggg ctg tgt gag gag cat ggg tct ccg gca gcc ccc cag cac 1396 Gly Val Gly Leu Cys Glu Glu His Gly Ser Pro Ala Ala Pro Gln His 445 450 455 tta ctg ggc cca ggc cca gtt gct ggc cct aag ttg tac ccc aaa ctc 1444 Leu Leu Gly Pro Gly Pro Val Ala Gly Pro Lys Leu Tyr Pro Lys Leu 460 465 470 tac aca gac atc cac aca cac aca cac aca cac tct cac aca cac tca 1492 Tyr Thr Asp Ile His Thr His Thr His Thr His Ser His Thr His Ser 475 480 485 490 cac gtg gag ggc aag gtc cac cag cac atc cac tat cag tgc 1534 His Val Glu Gly Lys Val His Gln His Ile His Tyr Gln Cys 495 500 tagacggcac cgtatctgca gtgggcacgg gggggccggc cagacaggca gactgggagg 1594 atggaggacg gagctgcaga cgaaggcagg ggacccatgg cgaggaggaa tggccagcac 1654 cccaggcagt ctgtgtgtga ggcatagccc ctggacacac acacacagac acacacacta 1714 cctggatgca tgtatgcaca cacatgcgcg cacacgtgct ccctgaaggc acacgtacgc 1774 acacacgcac atgcacagat atgccgcctg ggcacacaga taagctgccc aaatgcacgc 1834 acacgcacag agacatgcca gaacatacaa ggacatgctg cctgaacata cacacgcaca 1894 cccatgcgca gatgtgctgc ctggacacac acacacacac ggatatgctg tctggacgca 1954 cacacgtgca gatatggtat ccggacacac acgtgcacag atatgctgcc tggacacaca 2014 gataatgctg ccttgacaca cacatgcacg gatattgcct ggacacacac acacacacgc 2074 gtgcacagat atgctgtctg gacaggcaca cacatgcaga tatgctgcct ggacacacac 2134 ttccagacac acgtgcacag gcgcagatat gctgcctgga cacacgcaga tatgctgtct 2194 agtcacacac acacgcagac atgctgtccg gacacacaca cgcatgcaca gatatgctgt 2254 ccggacacac acacgcacgc agatatgctg cctggacaca cacacagata atgctgcctc 2314 aacactcaca cacgtgcaga tattgcctgg acacacacat gtgcacagat atgctgtctg 2374 gacatgcaca cacgtgcaga tatgctgtcc ggatacacac gcacgcacac atgcagatat 2434 gctgcctggg cacacacttc cggacacaca tgcacacaca ggtgcagata tgctgcctgg 2494 acacacgcag actgacgtgc ttttgggagg gtgtgccgtg aagcctgcag tacgtgtgcc 2554 gtgaggctca tagttgatga gggactttcc ctgctccacc gtcactcccc caactctgcc 2614 cgcctctgtc cccgcctcag tccccgcctc catccccgcc tctgtcccct ggccttggcg 2674 gctatttttg ccacctgcct tgggtgccca ggagtcccct actgctgtgg gctggggttg 2734 ggggcacagc agccccaagc ctgagaggct ggagcccatg gctagtggct catccccact 2794 gcattctccc cctgacacag agaaggggcc ttggtattta tatttaagaa atgaagataa 2854 tattaataat gatggaagga agactgggtt gcagggactg tggtctctcc tggggcccgg 2914 gacccgcctg gtctttcagc catgctgatg accacacccc gtccaggcca gacaccaccc 2974 cccaccccac tgtcgtggtg gccccagatc tctgtaattt tatgtagagt ttgagctgaa 3034 gccccgtata tttaatttat tttgttaaac atgaaagtgc atcctt 3080 <210> SEQ ID NO 19 <211> LENGTH: 504 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 19 Met Thr Pro Ser Pro Leu Leu Leu Leu Leu Leu Pro Pro Leu Leu Leu 1 5 10 15 Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 20 25 30 Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg 35 40 45 Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 50 55 60 Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu 65 70 75 80 Pro Gln Gly Leu Lys Val Lys Gln Val Glu Arg Glu Asp Ala Gly Val 85 90 95 Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 100 105 110 Thr Leu Val Val Leu Asp Asp Ile Ser Pro Gly Lys Glu Ser Leu Gly 115 120 125 Pro Asp Ser Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp 130 135 140 Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile 145 150 155 160 Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 165 170 175 His Pro Arg Pro Asp Ile Thr Trp Met Lys Asp Asp Gln Ala Leu Thr 180 185 190 Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 195 200 205 Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 210 215 220 Asn Arg Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln 225 230 235 240 Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 245 250 255 Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser 260 265 270 Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ala 275 280 285 Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val 290 295 300 Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 305 310 315 320 Asn Lys Leu Leu Ile Thr Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr 325 330 335 Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 340 345 350 Leu Thr Val Leu Pro Asp Pro Lys Pro Gln Gly Pro Pro Val Ala Ser 355 360 365 Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val Ile Gly Ile Pro 370 375 380 Ala Gly Ala Val Phe Ile Leu Gly Thr Leu Leu Leu Trp Leu Cys Gln 385 390 395 400 Ala Gln Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 405 410 415 His Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu 420 425 430 Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 435 440 445 Glu His Gly Ser Pro Ala Ala Pro Gln His Leu Leu Gly Pro Gly Pro 450 455 460 Val Ala Gly Pro Lys Leu Tyr Pro Lys Leu Tyr Thr Asp Ile His Thr 465 470 475 480 His Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val 485 490 495 His Gln His Ile His Tyr Gln Cys 500 <210> SEQ ID NO 20 <211> LENGTH: 342 <212> TYPE: PRT <213> ORGANISM: Mus musculus <400> SEQUENCE: 20 Met Ala Asp Lys Val Val Pro Arg Gln Val Ala Arg Leu Gly Arg Thr 1 5 10 15 Val Arg Leu Gln Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met 20 25 30 Trp Thr Lys Asp Gly Arg Thr Ile His Ser Gly Trp Ser Arg Phe Arg 35 40 45 Val Leu Pro Gln Gly Leu Lys Val Lys Glu Val Glu Ala Glu Asp Ala 50 55 60 Gly Val Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val 65 70 75 80 Asn Tyr Thr Leu Ile Ile Met Asp Asp Ile Ser Pro Gly Lys Glu Ser 85 90 95 Pro Gly Pro Gly Gly Ser Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln 100 105 110 Gln Trp Ala Arg Pro Arg Phe Thr Gln Pro Ser Lys Met Arg Arg Arg 115 120 125 Val Ile Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala 130 135 140 Ser Gly His Pro Arg Pro Asp Ile Met Trp Met Lys Asp Asp Gln Thr 145 150 155 160 Leu Thr His Leu Glu Ala Ser Glu His Arg Lys Lys Lys Trp Thr Leu 165 170 175 Ser Leu Lys Asn Leu Lys Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg 180 185 190 Val Ser Asn Lys Ala Gly Ala Ile Asn Ala Thr Tyr Lys Val Asp Val 195 200 205 Ile Gln Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val 210 215 220 Asn Thr Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gln Cys Lys Val 225 230 235 240 Arg Ser Asp Val Lys Pro Val Ile Gln Trp Leu Lys Arg Val Glu Tyr 245 250 255 Gly Ser Glu Gly Arg His Asn Ser Thr Ile Asp Val Gly Gly Gln Lys 260 265 270 Phe Val Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser 275 280 285 Tyr Leu Asn Lys Leu Leu Ile Ser Arg Ala Arg Gln Asp Asp Ala Gly 290 295 300 Met Tyr Ile Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser 305 310 315 320 Ala Phe Leu Thr Val Leu Pro Asp Pro Lys Pro Pro Gly Pro Pro Met 325 330 335 Ala Ser Ser Ser Ser Ser 340 <210> SEQ ID NO 21 <211> LENGTH: 1788 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: murine FGFR-L extracellular domain-Fc fusion polypeptide <221> NAME/KEY: CDS <222> LOCATION: (1)..(1782) <400> SEQUENCE: 21 atg acg cgg agc ccc gcg ctg ctg ctg ctg cta ttg ggg gcc ctc ccg 48 Met Thr Arg Ser Pro Ala Leu Leu Leu Leu Leu Leu Gly Ala Leu Pro 1 5 10 15 tcg gct gag gcg gcg cga gga ccc cca aga atg gca gac aaa gtg gtc 96 Ser Ala Glu Ala Ala Arg Gly Pro Pro Arg Met Ala Asp Lys Val Val 20 25 30 cca cgg cag gtg gcc cgc ctg ggc cgc act gtg cgg cta cag tgc cca 144 Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro 35 40 45 gtg gag ggg gac cca cca ccg ttg acc atg tgg acc aaa gat ggc cgc 192 Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg 50 55 60 aca atc cac agt ggc tgg agc cgc ttc cgt gtg ctg ccc cag ggt ctg 240 Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu 65 70 75 80 aag gtg aag gag gtg gag gcc gag gat gcc ggt gtt tat gtg tgc aag 288 Lys Val Lys Glu Val Glu Ala Glu Asp Ala Gly Val Tyr Val Cys Lys 85 90 95 gcc acc aat ggc ttt ggc agc ctc agc gtc aac tac act ctc atc atc 336 Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Ile Ile 100 105 110 atg gat gat att agt cca ggg aag gag agc cct ggg cca ggt ggt tct 384 Met Asp Asp Ile Ser Pro Gly Lys Glu Ser Pro Gly Pro Gly Gly Ser 115 120 125 tcg ggg ggc cag gag gac cca gcc agc cag cag tgg gca cgg cct cgc 432 Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg 130 135 140 ttc aca cag ccc tcc aag atg agg cgc cga gtg att gca cgg cct gtg 480 Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val 145 150 155 160 ggt agc tct gtg cgg ctc aag tgt gtg gcc agt ggg cac cca cgg cca 528 Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro 165 170 175 gac atc atg tgg atg aag gat gac cag acc ttg acg cat cta gag gct 576 Asp Ile Met Trp Met Lys Asp Asp Gln Thr Leu Thr His Leu Glu Ala 180 185 190 agt gaa cac aga aag aag aag tgg aca ctg agc ttg aag aac ctg aag 624 Ser Glu His Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Lys 195 200 205 cct gaa gac agt ggc aag tac acg tgc cgt gta tct aac aag gcc ggt 672 Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser Asn Lys Ala Gly 210 215 220 gcc atc aac gcc acc tac aaa gtg gat gta atc cag cgg act cgt tcc 720 Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser 225 230 235 240 aag cct gtg ctc aca ggg aca cac cct gtg aac aca acg gtg gac ttc 768 Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe 245 250 255 ggt ggg aca acg tcc ttc cag tgc aag gtg cgc agt gac gtg aag cct 816 Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro 260 265 270 gtg atc cag tgg ctg aag cgg gtg gag tac ggc tcc gag gga cgc cac 864 Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ser Glu Gly Arg His 275 280 285 aac tcc acc att gat gtg ggt ggc cag aag ttt gtg gtg ttg ccc acg 912 Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr 290 295 300 ggt gat gtg tgg tca cgg cct gat ggc tcc tac ctc aac aag ctg ctc 960 Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu 305 310 315 320 atc tct cgg gcc cgc cag gat gat gct ggc atg tac atc tgc cta ggt 1008 Ile Ser Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly 325 330 335 gca aat acc atg ggc tac agt ttc cgt agc gcc ttc ctc act gta tta 1056 Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu 340 345 350 cca gac ccc aaa cct cca ggg cct cct atg gct tct tca tcg gtc gac 1104 Pro Asp Pro Lys Pro Pro Gly Pro Pro Met Ala Ser Ser Ser Val Asp 355 360 365 aaa act cac aca tgc cca ccg tgc cca gca cct gaa ctc ctg ggg gga 1152 Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly 370 375 380 ccg tca gtc ttc ctc ttc ccc cca aaa ccc aag gac acc ctc atg atc 1200 Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile 385 390 395 400 tcc cgg acc cct gag gtc aca tgc gtg gtg gtg gac gtg agc cac gaa 1248 Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu 405 410 415 gac cct gag gtc aag ttc aac tgg tac gtg gac ggc gtg gag gtg cat 1296 Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His 420 425 430 aat gcc aag aca aag ccg cgg gag gag cag tac aac agc acg tac cgt 1344 Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg 435 440 445 gtg gtc agc gtc ctc acc gtc ctg cac cag gac tgg ctg aat ggc aag 1392 Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys 450 455 460 gag tac aag tgc aag gtc tcc aac aaa gcc ctc cca gcc ccc atc gag 1440 Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu 465 470 475 480 aaa acc atc tcc aaa gcc aaa ggg cag ccc cga gaa cca cag gtg tac 1488 Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr 485 490 495 acc ctg ccc cca tcc cgg gat gag ctg acc aag aac cag gtc agc ctg 1536 Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu 500 505 510 acc tgc ctg gtc aaa ggc ttc tat ccc agc gac atc gcc gtg gag tgg 1584 Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp 515 520 525 gag agc aat ggg cag ccg gag aac aac tac aag acc acg cct ccc gtg 1632 Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val 530 535 540 ctg gac tcc gac ggc tcc ttc ttc ctc tac agc aag ctc acc gtg gac 1680 Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp 545 550 555 560 aag agc agg tgg cag cag ggg aac gtc ttc tca tgc tcc gtg atg cat 1728 Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His 565 570 575 gag gct ctg cac aac cac tac acg cag aag agc ctc tcc ctg tct ccg 1776 Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro 580 585 590 ggt aaa tgataa 1788 Gly Lys <210> SEQ ID NO 22 <211> LENGTH: 594 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial Sequence: murine FGFR-L extracellular domain-Fc fusion polypeptide <400> SEQUENCE: 22 Met Thr Arg Ser Pro Ala Leu Leu Leu Leu Leu Leu Gly Ala Leu Pro 1 5 10 15 Ser Ala Glu Ala Ala Arg Gly Pro Pro Arg Met Ala Asp Lys Val Val 20 25 30 Pro Arg Gln Val Ala Arg Leu Gly Arg Thr Val Arg Leu Gln Cys Pro 35 40 45 Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr Lys Asp Gly Arg 50 55 60 Thr Ile His Ser Gly Trp Ser Arg Phe Arg Val Leu Pro Gln Gly Leu 65 70 75 80 Lys Val Lys Glu Val Glu Ala Glu Asp Ala Gly Val Tyr Val Cys Lys 85 90 95 Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr Thr Leu Ile Ile 100 105 110 Met Asp Asp Ile Ser Pro Gly Lys Glu Ser Pro Gly Pro Gly Gly Ser 115 120 125 Ser Gly Gly Gln Glu Asp Pro Ala Ser Gln Gln Trp Ala Arg Pro Arg 130 135 140 Phe Thr Gln Pro Ser Lys Met Arg Arg Arg Val Ile Ala Arg Pro Val 145 150 155 160 Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly His Pro Arg Pro 165 170 175 Asp Ile Met Trp Met Lys Asp Asp Gln Thr Leu Thr His Leu Glu Ala 180 185 190 Ser Glu His Arg Lys Lys Lys Trp Thr Leu Ser Leu Lys Asn Leu Lys 195 200 205 Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser Asn Lys Ala Gly 210 215 220 Ala Ile Asn Ala Thr Tyr Lys Val Asp Val Ile Gln Arg Thr Arg Ser 225 230 235 240 Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr Thr Val Asp Phe 245 250 255 Gly Gly Thr Thr Ser Phe Gln Cys Lys Val Arg Ser Asp Val Lys Pro 260 265 270 Val Ile Gln Trp Leu Lys Arg Val Glu Tyr Gly Ser Glu Gly Arg His 275 280 285 Asn Ser Thr Ile Asp Val Gly Gly Gln Lys Phe Val Val Leu Pro Thr 290 295 300 Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu Asn Lys Leu Leu 305 310 315 320 Ile Ser Arg Ala Arg Gln Asp Asp Ala Gly Met Tyr Ile Cys Leu Gly 325 330 335 Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe Leu Thr Val Leu 340 345 350 Pro Asp Pro Lys Pro Pro Gly Pro Pro Met Ala Ser Ser Ser Val Asp 355 360 365 Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly 370 375 380 Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile 385 390 395 400 Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu 405 410 415 Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His 420 425 430 Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg 435 440 445 Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys 450 455 460 Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu 465 470 475 480 Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr 485 490 495 Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu 500 505 510 Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp 515 520 525 Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val 530 535 540 Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp 545 550 555 560 Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His 565 570 575 Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro 580 585 590 Gly Lys
Claims (10)
1. An isolated nucleic acid molecule comprising:
(a) the nucleotide sequence as set forth in SEQ ID NO: 4;
(b) a nucleotide sequence encoding the polypeptide as set forth in SEQ ID NO: 5;
(c) a nucleotide sequence which hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of either (a) or (b), wherein the encoded polypeptide has an activity of the polypeptide as set forth in SEQ ID NO: 5; or
(d) a nucleotide sequence complementary to the nucleotide sequence of any of (a)-(c).
2. An isolated nucleic acid molecule comprising:
(a) a nucleotide sequence encoding a polypeptide that is at least about 70 percent identical to the polypeptide as set forth in SEQ ID NO: 5, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(b) a nucleotide sequence encoding an allelic variant or splice variant of the nucleotide sequence as set forth in SEQ ID NO: 4, or the nucleotide sequence of (a);
(c) a region of the nucleotide sequence of SEQ ID NO: 4, or the nucleotide sequence of either (a) or (b), encoding a polypeptide fragment of at least about 25 amino acid residues, wherein the polypeptide fragment has an activity of the encoded polypeptide as set forth in SEQ ID NO: 5, or is antigenic;
(d) a region of the nucleotide sequence of SEQ ID NO: 4, or the nucleotide sequence of any of (a)-(c) comprising a fragment of at least about 16 nucleotides;
(e) a nucleotide sequence that hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of any of (a)-(d), wherein the encoded polypeptide has an activity of the polypeptide as set forth in SEQ ID NO: 5; or
(f) a nucleotide sequence complementary to the nucleotide sequence of any of (a)-(e).
3. An isolated nucleic acid molecule comprising:
(a) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 5 with at least one conservative amino acid substitution, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(b) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 5 with at least one amino acid insertion, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(c) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 5 with at least one amino acid deletion, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(d) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 5 which has a C- and/or N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(e) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 5 with at least one modification that is an amino acid substitution, amino acid insertion, amino acid deletion, C-terminal truncation, or N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 5;
(f) a nucleotide sequence of any of (a)-(e) comprising a fragment of at least about 16 nucleotides;
(g) a nucleotide sequence that hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of any of (a)-(f), wherein the encoded polypeptide has an activity of the polypeptide as set forth in SEQ ID NO: 5; or
(h) a nucleotide sequence complementary to any of (a)-(g).
4. A vector comprising the nucleic acid molecule of any of claims 1, 2, or 3.
5. A host cell comprising the vector of claim 4 .
6. The host cell of claim 5 that is a eukaryotic cell.
7. The host cell of claim 5 that is a prokaryotic cell.
8. A process of producing an FGFR-L polypeptide comprising culturing the host cell of claim 5 under suitable conditions to express the polypeptide, and optionally isolating the polypeptide from the culture.
9. The process of claim 8 , wherein the nucleic acid molecule comprises promoter DNA other than the promoter DNA for the native FGFR-L polypeptide operatively linked to the DNA encoding the FGFR-L polypeptide.
10. The isolated nucleic acid molecule according to claim 2 , wherein the percent identity is determined using a computer program that is GAP, BLASTN, FASTA, BLASTA, BLASTX, BestFit, or the Smith-Waterman algorithm.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/229,584 US20030087384A1 (en) | 2000-03-22 | 2002-08-28 | Fibroblast growth factor receptor-like molecules and uses thereof |
US11/838,136 US20080044859A1 (en) | 2000-03-22 | 2007-08-13 | Fibroblast Growth Factor Receptor-Like Molecules and Uses Thereof |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US19137900P | 2000-03-22 | 2000-03-22 | |
US09/815,108 US7348162B2 (en) | 2000-03-22 | 2001-03-22 | Nucleic acids encoding fibroblast growth factor receptor-like proteins and uses thereof |
US10/229,584 US20030087384A1 (en) | 2000-03-22 | 2002-08-28 | Fibroblast growth factor receptor-like molecules and uses thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/815,108 Division US7348162B2 (en) | 2000-03-22 | 2001-03-22 | Nucleic acids encoding fibroblast growth factor receptor-like proteins and uses thereof |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/838,136 Continuation US20080044859A1 (en) | 2000-03-22 | 2007-08-13 | Fibroblast Growth Factor Receptor-Like Molecules and Uses Thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030087384A1 true US20030087384A1 (en) | 2003-05-08 |
Family
ID=22705247
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/815,108 Expired - Fee Related US7348162B2 (en) | 2000-03-22 | 2001-03-22 | Nucleic acids encoding fibroblast growth factor receptor-like proteins and uses thereof |
US10/229,584 Abandoned US20030087384A1 (en) | 2000-03-22 | 2002-08-28 | Fibroblast growth factor receptor-like molecules and uses thereof |
US11/838,136 Abandoned US20080044859A1 (en) | 2000-03-22 | 2007-08-13 | Fibroblast Growth Factor Receptor-Like Molecules and Uses Thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/815,108 Expired - Fee Related US7348162B2 (en) | 2000-03-22 | 2001-03-22 | Nucleic acids encoding fibroblast growth factor receptor-like proteins and uses thereof |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/838,136 Abandoned US20080044859A1 (en) | 2000-03-22 | 2007-08-13 | Fibroblast Growth Factor Receptor-Like Molecules and Uses Thereof |
Country Status (7)
Country | Link |
---|---|
US (3) | US7348162B2 (en) |
EP (1) | EP1268793A2 (en) |
JP (3) | JP2003527858A (en) |
AU (1) | AU2001252940A1 (en) |
CA (1) | CA2403572A1 (en) |
MX (1) | MXPA02009224A (en) |
WO (1) | WO2001070977A2 (en) |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030143676A1 (en) * | 1999-03-25 | 2003-07-31 | Genesis Research And Development Corporation Limited | Fibroblast growth factor receptors and methods for their use |
US7083791B2 (en) * | 1999-03-25 | 2006-08-01 | Genesis Research & Development Corporation Limited | Methods for enhancing immune responses by fibroblast growth factor receptor 5 polypeptides |
US6797271B2 (en) | 1999-03-25 | 2004-09-28 | Genesis Research & Development Corporation Limited | Methods for enhancing immune responses by fibroblast growth factor receptor 5 polypeptides |
US20020136720A1 (en) * | 2000-11-22 | 2002-09-26 | Chipman Stewart D. | Methods of using IMXP-888 and IMXP-888 antagonists |
US20060115483A1 (en) * | 2002-05-28 | 2006-06-01 | Genesis Research And Development Corporation Limited | Fibroblast growth factor receptors and methods for their use |
US20070110752A1 (en) * | 2002-05-28 | 2007-05-17 | Genesis Research And Development Corporation Limited | Fibroblast growth factor receptors and methods for their use |
US20050009750A1 (en) * | 2003-07-03 | 2005-01-13 | Genesis Research And Development Corporation Limited | Fibroblast growth factor receptors and methods for their use |
US20050112642A1 (en) * | 2002-05-28 | 2005-05-26 | Genesis Research And Development Corporation Limited | Fibroblast growth factor receptors and methods for their use |
US20050282733A1 (en) * | 2002-06-27 | 2005-12-22 | Prins Johannes B | Differentiation modulating agents and uses therefor |
EP2267117A3 (en) * | 2002-06-27 | 2011-07-13 | Verva Pharmaceuticals Pty Ltd | Differentiation modulating agents and uses therefor |
NZ569957A (en) | 2006-02-10 | 2012-03-30 | Genentech Inc | Anti-FGF19 antibodies and methods using same |
US8236766B2 (en) * | 2006-11-10 | 2012-08-07 | Cara Therapeutics, Inc. | Uses of synthetic peptide amides |
US8906859B2 (en) * | 2006-11-10 | 2014-12-09 | Cera Therapeutics, Inc. | Uses of kappa opioid synthetic peptide amides |
US7842662B2 (en) | 2006-11-10 | 2010-11-30 | Cara Therapeutics, Inc. | Synthetic peptide amide dimers |
US7713937B2 (en) * | 2006-11-10 | 2010-05-11 | Cara Therapeutics, Inc. | Synthetic peptide amides and dimeric forms thereof |
MY148144A (en) | 2006-11-10 | 2013-03-15 | Cara Therapeutics Inc | Synthetic peptide amides |
NZ584827A (en) | 2007-10-01 | 2012-10-26 | Isis Pharmaceuticals Inc | Antisense modulation of fibroblast growth factor receptor 4 expression |
CN102196771A (en) | 2008-09-19 | 2011-09-21 | 拜尔健康护理有限责任公司 | Analyte sensors, systems, testing apparatus and manufacturing methods |
US20120189641A1 (en) | 2009-02-25 | 2012-07-26 | OSI Pharmaceuticals, LLC | Combination anti-cancer therapy |
WO2010099138A2 (en) | 2009-02-27 | 2010-09-02 | Osi Pharmaceuticals, Inc. | Methods for the identification of agents that inhibit mesenchymal-like tumor cells or their formation |
US8642834B2 (en) | 2009-02-27 | 2014-02-04 | OSI Pharmaceuticals, LLC | Methods for the identification of agents that inhibit mesenchymal-like tumor cells or their formation |
JP2012519282A (en) | 2009-02-27 | 2012-08-23 | オーエスアイ・ファーマシューティカルズ,エルエルシー | Methods for identifying mesenchymal tumor cells or agents that inhibit their production |
EP2692358A4 (en) * | 2011-03-31 | 2014-12-10 | Univ Kyoto | THERAPEUTIC AGENT FOR CANCER, AND METHOD FOR THE PROGNOSTIC DETERMINATION OF CANCER |
WO2012149014A1 (en) | 2011-04-25 | 2012-11-01 | OSI Pharmaceuticals, LLC | Use of emt gene signatures in cancer drug discovery, diagnostics, and treatment |
HUE033584T2 (en) * | 2011-05-16 | 2017-12-28 | Hoffmann La Roche | Fgfr1 agonists and methods of use |
CA2839437A1 (en) | 2011-06-16 | 2012-12-20 | Isis Pharmaceuticals, Inc. | Antisense modulation of fibroblast growth factor receptor 4 expression |
WO2013152252A1 (en) | 2012-04-06 | 2013-10-10 | OSI Pharmaceuticals, LLC | Combination anti-cancer therapy |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6242419B1 (en) * | 1999-03-25 | 2001-06-05 | Genesis Research & Development Corporation Ltd. | Compositions isolated from stromal cells and methods for their use |
US6812339B1 (en) * | 2000-09-08 | 2004-11-02 | Applera Corporation | Polymorphisms in known genes associated with human disease, methods of detection and uses thereof |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999063088A2 (en) | 1998-06-02 | 1999-12-09 | Genentech, Inc. | Membrane-bound proteins and nucleic acids encoding the same |
US20030054987A1 (en) * | 1997-06-16 | 2003-03-20 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
WO2000024756A1 (en) * | 1998-10-23 | 2000-05-04 | Human Genome Sciences, Inc. | Fibroblast growth factor receptor-5 |
-
2001
- 2001-03-22 WO PCT/US2001/009073 patent/WO2001070977A2/en active Application Filing
- 2001-03-22 JP JP2001569360A patent/JP2003527858A/en not_active Withdrawn
- 2001-03-22 CA CA002403572A patent/CA2403572A1/en not_active Abandoned
- 2001-03-22 US US09/815,108 patent/US7348162B2/en not_active Expired - Fee Related
- 2001-03-22 EP EP01926402A patent/EP1268793A2/en not_active Withdrawn
- 2001-03-22 AU AU2001252940A patent/AU2001252940A1/en not_active Abandoned
- 2001-03-22 MX MXPA02009224A patent/MXPA02009224A/en not_active Application Discontinuation
-
2002
- 2002-08-28 US US10/229,584 patent/US20030087384A1/en not_active Abandoned
-
2006
- 2006-07-18 JP JP2006196258A patent/JP2006340720A/en active Pending
-
2007
- 2007-08-13 US US11/838,136 patent/US20080044859A1/en not_active Abandoned
-
2009
- 2009-05-22 JP JP2009124657A patent/JP2009278975A/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6242419B1 (en) * | 1999-03-25 | 2001-06-05 | Genesis Research & Development Corporation Ltd. | Compositions isolated from stromal cells and methods for their use |
US6812339B1 (en) * | 2000-09-08 | 2004-11-02 | Applera Corporation | Polymorphisms in known genes associated with human disease, methods of detection and uses thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2001070977A2 (en) | 2001-09-27 |
US20020009776A1 (en) | 2002-01-24 |
JP2006340720A (en) | 2006-12-21 |
AU2001252940A1 (en) | 2001-10-03 |
US20080044859A1 (en) | 2008-02-21 |
US7348162B2 (en) | 2008-03-25 |
WO2001070977A3 (en) | 2002-03-28 |
JP2003527858A (en) | 2003-09-24 |
EP1268793A2 (en) | 2003-01-02 |
CA2403572A1 (en) | 2001-09-27 |
MXPA02009224A (en) | 2003-03-12 |
JP2009278975A (en) | 2009-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20030087384A1 (en) | Fibroblast growth factor receptor-like molecules and uses thereof | |
AU783682B2 (en) | Fhm, a novel member of the TNF ligand supergene family | |
AU2001265198A1 (en) | Cystine-knot polypeptides: cloaked-2 molecules and uses thereof | |
JP2004519205A (en) | Thymic stromal lymphopoietin receptor molecule and its use | |
EP1257645A2 (en) | Fibroblast growth factor-23 molecules and uses thereof | |
AU2001279024B2 (en) | C3b/C4b complement receptor-like molecules and uses thereof | |
US20080027000A1 (en) | APO-A-I Regulation of T-Cell Signaling | |
US6599716B1 (en) | Nucleic acids encoding NTR3, a member of the TNF-receptor supergene family | |
JP2006238894A (en) | Fibroblast growth factor-23 molecules and uses thereof | |
JP2008001708A (en) | Tumor endothelial marker 7-alpha molecules and uses thereof | |
EP1354039A2 (en) | Atp-binding cassette transporter-like molecules and uses thereof | |
AU2002219947A1 (en) | Transforming growth factor-beta-related molecules and uses thereof | |
AU2006200794A1 (en) | Fhm, A novel member of the TNF ligand supergene family | |
MXPA02007119A (en) | Chondromodulin i related peptide. | |
CA2402772A1 (en) | Apolipoprotein-a-i regulation of t-cell signalling | |
AU2006202629A1 (en) | Chondromodulin-I Related Peptide | |
EP1268782A2 (en) | Apolipoprotein-a-i regulation of t-cell signalling | |
AU2001297848A1 (en) | ATP-Binding cassette transporter-like molecules and uses thereof | |
WO2005063292A2 (en) | Heh4 molecules and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |