CA2450470C - Method for rapid screening of bacterial transformants and novel simian adenovirus proteins - Google Patents
Method for rapid screening of bacterial transformants and novel simian adenovirus proteins Download PDFInfo
- Publication number
- CA2450470C CA2450470C CA2450470A CA2450470A CA2450470C CA 2450470 C CA2450470 C CA 2450470C CA 2450470 A CA2450470 A CA 2450470A CA 2450470 A CA2450470 A CA 2450470A CA 2450470 C CA2450470 C CA 2450470C
- Authority
- CA
- Canada
- Prior art keywords
- leu
- ala
- gly
- ser
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 268
- 102000004169 proteins and genes Human genes 0.000 title abstract description 162
- 238000000034 method Methods 0.000 title abstract description 71
- 238000012216 screening Methods 0.000 title abstract description 11
- 230000001580 bacterial effect Effects 0.000 title description 11
- 241000990167 unclassified Simian adenoviruses Species 0.000 title description 3
- 241000701161 unidentified adenovirus Species 0.000 claims abstract description 131
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 40
- 241000700605 Viruses Species 0.000 claims description 73
- 239000012634 fragment Substances 0.000 claims description 70
- 150000001413 amino acids Chemical class 0.000 claims description 50
- 238000012384 transportation and delivery Methods 0.000 claims description 43
- 210000000234 capsid Anatomy 0.000 claims description 31
- 238000004519 manufacturing process Methods 0.000 claims description 25
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 claims description 23
- 239000000835 fiber Substances 0.000 claims description 23
- 230000002163 immunogen Effects 0.000 claims description 23
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 21
- 101710094396 Hexon protein Proteins 0.000 claims description 20
- 241000598171 Human adenovirus sp. Species 0.000 claims description 20
- 230000001225 therapeutic effect Effects 0.000 claims description 19
- CXURGFRDGROIKG-UHFFFAOYSA-N 3,3-bis(chloromethyl)oxetane Chemical compound ClCC1(CCl)COC1 CXURGFRDGROIKG-UHFFFAOYSA-N 0.000 claims description 11
- 101710145505 Fiber protein Proteins 0.000 claims description 10
- 101710173835 Penton protein Proteins 0.000 claims description 8
- 108700026758 Adenovirus hexon capsid Proteins 0.000 claims description 3
- 241000193096 Human adenovirus B3 Species 0.000 claims description 2
- 239000003814 drug Substances 0.000 claims description 2
- 239000008194 pharmaceutical composition Substances 0.000 claims 2
- 229940123373 Adenovirus E1A gene Drugs 0.000 claims 1
- 241001135569 Human adenovirus 5 Species 0.000 claims 1
- 241000282577 Pan troglodytes Species 0.000 abstract description 55
- 239000000203 mixture Substances 0.000 abstract description 34
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 31
- 229920001184 polypeptide Polymers 0.000 abstract description 21
- 230000003053 immunization Effects 0.000 abstract description 10
- 238000002649 immunization Methods 0.000 abstract description 10
- 238000002560 therapeutic procedure Methods 0.000 abstract description 2
- 239000013598 vector Substances 0.000 description 135
- 210000004027 cell Anatomy 0.000 description 111
- 241000282414 Homo sapiens Species 0.000 description 58
- 108700019146 Transgenes Proteins 0.000 description 49
- 230000014509 gene expression Effects 0.000 description 44
- 239000000427 antigen Substances 0.000 description 40
- 108091007433 antigens Proteins 0.000 description 40
- 102000036639 antigens Human genes 0.000 description 40
- 239000013612 plasmid Substances 0.000 description 40
- 230000003612 virological effect Effects 0.000 description 40
- 108090000565 Capsid Proteins Proteins 0.000 description 36
- 102100023321 Ceruloplasmin Human genes 0.000 description 35
- 239000000047 product Substances 0.000 description 35
- 239000013603 viral vector Substances 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 32
- 108010050848 glycylleucine Proteins 0.000 description 31
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 26
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 25
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 25
- 239000005090 green fluorescent protein Substances 0.000 description 24
- 230000037430 deletion Effects 0.000 description 21
- 238000012217 deletion Methods 0.000 description 21
- 230000028993 immune response Effects 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- 241001217856 Chimpanzee adenovirus Species 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 19
- 108010038633 aspartylglutamate Proteins 0.000 description 18
- 239000002245 particle Substances 0.000 description 18
- 108010078144 glutaminyl-glycine Proteins 0.000 description 17
- 239000003550 marker Substances 0.000 description 17
- 230000003472 neutralizing effect Effects 0.000 description 17
- 241000880493 Leptailurus serval Species 0.000 description 14
- 230000004048 modification Effects 0.000 description 14
- 238000012986 modification Methods 0.000 description 14
- 108010070643 prolylglutamic acid Proteins 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 230000006801 homologous recombination Effects 0.000 description 13
- 238000002744 homologous recombination Methods 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 description 12
- 108091008874 T cell receptors Proteins 0.000 description 12
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 12
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 12
- 230000001939 inductive effect Effects 0.000 description 12
- 208000015181 infectious disease Diseases 0.000 description 12
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 150000007523 nucleic acids Chemical class 0.000 description 12
- 108010051242 phenylalanylserine Proteins 0.000 description 12
- 230000037452 priming Effects 0.000 description 12
- 230000008685 targeting Effects 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 11
- 108090000790 Enzymes Proteins 0.000 description 11
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- 108010060035 arginylproline Proteins 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- -1 for example Proteins 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 238000004806 packaging method and process Methods 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 238000001890 transfection Methods 0.000 description 11
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 10
- 238000010367 cloning Methods 0.000 description 10
- 230000000295 complement effect Effects 0.000 description 10
- 238000010276 construction Methods 0.000 description 10
- 201000010099 disease Diseases 0.000 description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 230000001717 pathogenic effect Effects 0.000 description 10
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- 241000725303 Human immunodeficiency virus Species 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 108010044940 alanylglutamine Proteins 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 8
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 8
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 8
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 8
- 241000701022 Cytomegalovirus Species 0.000 description 8
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 8
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 8
- 241000713311 Simian immunodeficiency virus Species 0.000 description 8
- 210000001744 T-lymphocyte Anatomy 0.000 description 8
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 7
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 7
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 7
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 7
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 7
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 7
- 108060003951 Immunoglobulin Proteins 0.000 description 7
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 7
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 7
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 7
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 7
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 7
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 7
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 7
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 7
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 7
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 230000029087 digestion Effects 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 102000018358 immunoglobulin Human genes 0.000 description 7
- 101150066555 lacZ gene Proteins 0.000 description 7
- 108010012058 leucyltyrosine Proteins 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 239000013605 shuttle vector Substances 0.000 description 7
- 229960005486 vaccine Drugs 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 6
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 6
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 6
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 6
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 6
- 241000124008 Mammalia Species 0.000 description 6
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 6
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 6
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 6
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 6
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 6
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 108010069495 cysteinyltyrosine Proteins 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 108010036413 histidylglycine Proteins 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 102000005962 receptors Human genes 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 238000011285 therapeutic regimen Methods 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- 239000003053 toxin Substances 0.000 description 6
- 231100000765 toxin Toxicity 0.000 description 6
- 108700012359 toxins Proteins 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 5
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 5
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 5
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 5
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 5
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 5
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 5
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 5
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 5
- 208000023275 Autoimmune disease Diseases 0.000 description 5
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 5
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 5
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 5
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 5
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 5
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 5
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 5
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 5
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 5
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 5
- 241000282560 Macaca mulatta Species 0.000 description 5
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 5
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 5
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 5
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 5
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 5
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 238000007796 conventional method Methods 0.000 description 5
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 5
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 108010024607 phenylalanylalanine Proteins 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 108010004914 prolylarginine Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 230000001681 protective effect Effects 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 238000011282 treatment Methods 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 5
- 239000003981 vehicle Substances 0.000 description 5
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 4
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 4
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 4
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 4
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 4
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 4
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 4
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 4
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 4
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 4
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 4
- 241000193738 Bacillus anthracis Species 0.000 description 4
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- 241000711573 Coronaviridae Species 0.000 description 4
- 108010041986 DNA Vaccines Proteins 0.000 description 4
- 229940021995 DNA vaccine Drugs 0.000 description 4
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 4
- 101710096438 DNA-binding protein Proteins 0.000 description 4
- 102100033295 Glial cell line-derived neurotrophic factor Human genes 0.000 description 4
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 4
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 4
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 4
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 4
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 4
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 4
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 101710087110 ORF6 protein Proteins 0.000 description 4
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 4
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 4
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 4
- 101710149951 Protein Tat Proteins 0.000 description 4
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 4
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 4
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 4
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 4
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 4
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- 108010009583 Transforming Growth Factors Proteins 0.000 description 4
- 102000009618 Transforming Growth Factors Human genes 0.000 description 4
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 239000002671 adjuvant Substances 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 108010004073 cysteinylcysteine Proteins 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 210000000987 immune system Anatomy 0.000 description 4
- 230000036039 immunity Effects 0.000 description 4
- 229940072221 immunoglobulins Drugs 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010053037 kyotorphin Proteins 0.000 description 4
- 239000003446 ligand Substances 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 201000006417 multiple sclerosis Diseases 0.000 description 4
- 244000052769 pathogen Species 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 206010039073 rheumatoid arthritis Diseases 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 3
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 3
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 3
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 3
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 3
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 3
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 3
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 3
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 3
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 3
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 3
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 3
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 3
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 3
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 3
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 3
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 3
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 3
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 3
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 3
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 3
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 3
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 3
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 3
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 3
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 3
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 3
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 3
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- 108010069091 Dystrophin Proteins 0.000 description 3
- 102000001039 Dystrophin Human genes 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 3
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 3
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 3
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 3
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 3
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 3
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 3
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 3
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 3
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 3
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 3
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 3
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 3
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 3
- 101150032643 IVa2 gene Proteins 0.000 description 3
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 3
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 3
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 3
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 102000015696 Interleukins Human genes 0.000 description 3
- 108010063738 Interleukins Proteins 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 3
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 3
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 3
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- 108060001084 Luciferase Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 3
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 3
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 3
- VIZLHGTVGKBBKO-AVGNSLFASA-N Met-Arg-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VIZLHGTVGKBBKO-AVGNSLFASA-N 0.000 description 3
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 3
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 102400000058 Neuregulin-1 Human genes 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 241000282579 Pan Species 0.000 description 3
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 3
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 3
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 3
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 3
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 241000714474 Rous sarcoma virus Species 0.000 description 3
- 206010039710 Scleroderma Diseases 0.000 description 3
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 3
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 3
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 3
- 241000011102 Thera Species 0.000 description 3
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 3
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 3
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 3
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 3
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 3
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 3
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 3
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 3
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 3
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 3
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 3
- 108020005202 Viral DNA Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 108091008324 binding proteins Proteins 0.000 description 3
- 210000000988 bone and bone Anatomy 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000002716 delivery method Methods 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 238000011049 filling Methods 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 102000034356 gene-regulatory proteins Human genes 0.000 description 3
- 108091006104 gene-regulatory proteins Proteins 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 239000003102 growth factor Substances 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000002458 infectious effect Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 210000002845 virion Anatomy 0.000 description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- JRMDFAKCPRMZKA-UHFFFAOYSA-N 6-n,6-n,2-trimethylacridin-10-ium-3,6-diamine;chloride Chemical compound [Cl-].C1=C(C)C(N)=CC2=NC3=CC([NH+](C)C)=CC=C3C=C21 JRMDFAKCPRMZKA-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- 102100027211 Albumin Human genes 0.000 description 2
- 108010088751 Albumins Proteins 0.000 description 2
- 241000710929 Alphavirus Species 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 2
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 2
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 2
- AHPWQERCDZTTNB-FXQIFTODSA-N Arg-Cys-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AHPWQERCDZTTNB-FXQIFTODSA-N 0.000 description 2
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 2
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- JTRDJYIZIKCIRC-AJNGGQMLSA-N Asp-Leu-Leu-Gln Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTRDJYIZIKCIRC-AJNGGQMLSA-N 0.000 description 2
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- 241000304886 Bacilli Species 0.000 description 2
- 208000003508 Botulism Diseases 0.000 description 2
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 2
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 2
- 241000589562 Brucella Species 0.000 description 2
- 102100031168 CCN family member 2 Human genes 0.000 description 2
- 108010009575 CD55 Antigens Proteins 0.000 description 2
- 101710132601 Capsid protein Proteins 0.000 description 2
- 206010061041 Chlamydial infection Diseases 0.000 description 2
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 2
- 102000011022 Chorionic Gonadotropin Human genes 0.000 description 2
- 108010062540 Chorionic Gonadotropin Proteins 0.000 description 2
- 108010005939 Ciliary Neurotrophic Factor Proteins 0.000 description 2
- 102100031614 Ciliary neurotrophic factor Human genes 0.000 description 2
- 108010039419 Connective Tissue Growth Factor Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 2
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 2
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 2
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 2
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 2
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 2
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 2
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- 101150066038 E4 gene Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 208000004232 Enteritis Diseases 0.000 description 2
- 241000709661 Enterovirus Species 0.000 description 2
- 102000003951 Erythropoietin Human genes 0.000 description 2
- 108090000394 Erythropoietin Proteins 0.000 description 2
- 241000713800 Feline immunodeficiency virus Species 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 102000003971 Fibroblast Growth Factor 1 Human genes 0.000 description 2
- 108090000386 Fibroblast Growth Factor 1 Proteins 0.000 description 2
- 102000003974 Fibroblast growth factor 2 Human genes 0.000 description 2
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 2
- 102000012673 Follicle Stimulating Hormone Human genes 0.000 description 2
- 108010079345 Follicle Stimulating Hormone Proteins 0.000 description 2
- 101150066002 GFP gene Proteins 0.000 description 2
- 208000005577 Gastroenteritis Diseases 0.000 description 2
- 108091010837 Glial cell line-derived neurotrophic factor Proteins 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 2
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 2
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 2
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 2
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- 206010018693 Granuloma inguinale Diseases 0.000 description 2
- 108010051696 Growth Hormone Proteins 0.000 description 2
- 239000000095 Growth Hormone-Releasing Hormone Substances 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 108010010234 HDL Lipoproteins Proteins 0.000 description 2
- 102000015779 HDL Lipoproteins Human genes 0.000 description 2
- 206010061192 Haemorrhagic fever Diseases 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000700721 Hepatitis B virus Species 0.000 description 2
- 102000003745 Hepatocyte Growth Factor Human genes 0.000 description 2
- 108090000100 Hepatocyte Growth Factor Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 101710155188 Hexon-interlacing protein Proteins 0.000 description 2
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 2
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 108010007622 LDL Lipoproteins Proteins 0.000 description 2
- 102000007330 LDL Lipoproteins Human genes 0.000 description 2
- 101710192606 Latent membrane protein 2 Proteins 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000009151 Luteinizing Hormone Human genes 0.000 description 2
- 108010073521 Luteinizing Hormone Proteins 0.000 description 2
- 102000008072 Lymphokines Human genes 0.000 description 2
- 108010074338 Lymphokines Proteins 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 2
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- 101150078498 MYB gene Proteins 0.000 description 2
- 102000050019 Membrane Cofactor Human genes 0.000 description 2
- 101710146216 Membrane cofactor protein Proteins 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 2
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 2
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 2
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 2
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 2
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 2
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 2
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 2
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 2
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- 101710081079 Minor spike protein H Proteins 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 108010025020 Nerve Growth Factor Proteins 0.000 description 2
- 108090000556 Neuregulin-1 Proteins 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 241000713112 Orthobunyavirus Species 0.000 description 2
- 241000150452 Orthohantavirus Species 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 102000003982 Parathyroid hormone Human genes 0.000 description 2
- 108090000445 Parathyroid hormone Proteins 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 2
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 2
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 2
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 2
- 108010004729 Phycoerythrin Proteins 0.000 description 2
- 101000777480 Phyllodiscus semoni DELTA-alicitoxin-Pse1b Proteins 0.000 description 2
- 241000709664 Picornaviridae Species 0.000 description 2
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 2
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- 101710118538 Protease Proteins 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 108010001267 Protein Subunits Proteins 0.000 description 2
- 102000002067 Protein Subunits Human genes 0.000 description 2
- 101710149136 Protein Vpr Proteins 0.000 description 2
- 201000004681 Psoriasis Diseases 0.000 description 2
- 206010037688 Q fever Diseases 0.000 description 2
- 241000702263 Reovirus sp. Species 0.000 description 2
- 101100368917 Schizosaccharomyces pombe (strain 972 / ATCC 24843) taz1 gene Proteins 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 102000004446 Serum Response Factor Human genes 0.000 description 2
- 108010042291 Serum Response Factor Proteins 0.000 description 2
- 208000001203 Smallpox Diseases 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102100022831 Somatoliberin Human genes 0.000 description 2
- 101710142969 Somatoliberin Proteins 0.000 description 2
- 102100038803 Somatotropin Human genes 0.000 description 2
- 101710109576 Terminal protein Proteins 0.000 description 2
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 2
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 102000036693 Thrombopoietin Human genes 0.000 description 2
- 108010041111 Thrombopoietin Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 2
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 2
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 2
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 2
- 208000034784 Tularaemia Diseases 0.000 description 2
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 2
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 2
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- 108010062497 VLDL Lipoproteins Proteins 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- 241000700647 Variola virus Species 0.000 description 2
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 description 2
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 2
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 2
- 101710201961 Virion infectivity factor Proteins 0.000 description 2
- 208000003152 Yellow Fever Diseases 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010004469 allophycocyanin Proteins 0.000 description 2
- 102000013529 alpha-Fetoproteins Human genes 0.000 description 2
- 108010026331 alpha-Fetoproteins Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000003302 anti-idiotype Effects 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 239000003124 biologic agent Substances 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229940077737 brain-derived neurotrophic factor Drugs 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000012411 cloning technique Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 230000009260 cross reactivity Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000000120 cytopathologic effect Effects 0.000 description 2
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 206010013023 diphtheria Diseases 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 108010078428 env Gene Products Proteins 0.000 description 2
- 229940105423 erythropoietin Drugs 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 229940028334 follicle stimulating hormone Drugs 0.000 description 2
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 239000000122 growth hormone Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 210000003494 hepatocyte Anatomy 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940084986 human chorionic gonadotropin Drugs 0.000 description 2
- 230000003463 hyperproliferative effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 229940047122 interleukins Drugs 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 229940040129 luteinizing hormone Drugs 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 210000004898 n-terminal fragment Anatomy 0.000 description 2
- 201000009240 nasopharyngitis Diseases 0.000 description 2
- 229940053128 nerve growth factor Drugs 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 239000000199 parathyroid hormone Substances 0.000 description 2
- 229960001319 parathyroid hormone Drugs 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010047079 phenylalanyl-leucyl-arginyl-phenylalanine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010079892 phosphoglycerol kinase Proteins 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 229940021993 prophylactic vaccine Drugs 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 241001147422 tick-borne encephalitis virus group Species 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 230000010415 tropism Effects 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 201000008827 tuberculosis Diseases 0.000 description 2
- 208000035408 type 1 diabetes mellitus 1 Diseases 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- UBWXUGDQUBIEIZ-UHFFFAOYSA-N (13-methyl-3-oxo-2,6,7,8,9,10,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthren-17-yl) 3-phenylpropanoate Chemical compound CC12CCC(C3CCC(=O)C=C3CC3)C3C1CCC2OC(=O)CCC1=CC=CC=C1 UBWXUGDQUBIEIZ-UHFFFAOYSA-N 0.000 description 1
- ZEIYPKQQLSUPOT-QORCZRPOSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 ZEIYPKQQLSUPOT-QORCZRPOSA-N 0.000 description 1
- XYWBPLHHAZLXAI-ASHKBJFXSA-N (2s)-2-[[(2s)-2-[[(2s)-4-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)C(C)C XYWBPLHHAZLXAI-ASHKBJFXSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- XKZQKPRCPNGNFR-UHFFFAOYSA-N 2-(3-hydroxyphenyl)phenol Chemical compound OC1=CC=CC(C=2C(=CC=CC=2)O)=C1 XKZQKPRCPNGNFR-UHFFFAOYSA-N 0.000 description 1
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 1
- YRNWIFYIFSBPAU-UHFFFAOYSA-N 4-[4-(dimethylamino)phenyl]-n,n-dimethylaniline Chemical compound C1=CC(N(C)C)=CC=C1C1=CC=C(N(C)C)C=C1 YRNWIFYIFSBPAU-UHFFFAOYSA-N 0.000 description 1
- 102100031126 6-phosphogluconolactonase Human genes 0.000 description 1
- 108010029731 6-phosphogluconolactonase Proteins 0.000 description 1
- 101150079978 AGRN gene Proteins 0.000 description 1
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 1
- 108010059616 Activins Proteins 0.000 description 1
- 102000005606 Activins Human genes 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 1
- 108010027410 Adenovirus E3 Proteins Proteins 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 102100040026 Agrin Human genes 0.000 description 1
- 108700019743 Agrin Proteins 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- GRIFPSOFWFIICX-GOPGUHFVSA-N Ala-His-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GRIFPSOFWFIICX-GOPGUHFVSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 108010080691 Alcohol O-acetyltransferase Proteins 0.000 description 1
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 1
- 241000700587 Alphaherpesvirinae Species 0.000 description 1
- 208000004881 Amebiasis Diseases 0.000 description 1
- 206010001980 Amoebiasis Diseases 0.000 description 1
- 102000009840 Angiopoietins Human genes 0.000 description 1
- 108010009906 Angiopoietins Proteins 0.000 description 1
- 102400000068 Angiostatin Human genes 0.000 description 1
- 108010079709 Angiostatins Proteins 0.000 description 1
- 206010002556 Ankylosing Spondylitis Diseases 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000712891 Arenavirus Species 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- NUBPTCMEOCKWDO-DCAQKATOSA-N Arg-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N NUBPTCMEOCKWDO-DCAQKATOSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- VSPLYCLMFAUZRF-GUBZILKMSA-N Arg-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N VSPLYCLMFAUZRF-GUBZILKMSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- ZUVDFJXRAICIAJ-BPUTZDHNSA-N Arg-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 ZUVDFJXRAICIAJ-BPUTZDHNSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 102000004452 Arginase Human genes 0.000 description 1
- 108700024123 Arginases Proteins 0.000 description 1
- 206010003267 Arthritis reactive Diseases 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- GURLOFOJBHRPJN-AAEUAGOBSA-N Asn-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GURLOFOJBHRPJN-AAEUAGOBSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 description 1
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- 201000002909 Aspergillosis Diseases 0.000 description 1
- 208000036641 Aspergillus infections Diseases 0.000 description 1
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 1
- 241000711404 Avian avulavirus 1 Species 0.000 description 1
- 241000700663 Avipoxvirus Species 0.000 description 1
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 1
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 1
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 1
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 1
- 101000805768 Banna virus (strain Indonesia/JKT-6423/1980) mRNA (guanine-N(7))-methyltransferase Proteins 0.000 description 1
- 206010044583 Bartonella Infections Diseases 0.000 description 1
- 101000742334 Bdellovibrio phage phiMH2K Replication-associated protein VP4 Proteins 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000701021 Betaherpesvirinae Species 0.000 description 1
- 206010005098 Blastomycosis Diseases 0.000 description 1
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000712005 Bovine respirovirus 3 Species 0.000 description 1
- 206010006500 Brucellosis Diseases 0.000 description 1
- 241000722910 Burkholderia mallei Species 0.000 description 1
- 206010069747 Burkholderia mallei infection Diseases 0.000 description 1
- 206010069748 Burkholderia pseudomallei infection Diseases 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 101710186200 CCAAT/enhancer-binding protein Proteins 0.000 description 1
- 108010063916 CD40 Antigens Proteins 0.000 description 1
- 102100022002 CD59 glycoprotein Human genes 0.000 description 1
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 1
- 208000008889 California Encephalitis Diseases 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 206010007134 Candida infections Diseases 0.000 description 1
- 241000711506 Canine coronavirus Species 0.000 description 1
- 241000712083 Canine morbillivirus Species 0.000 description 1
- 241000701931 Canine parvovirus Species 0.000 description 1
- 241000700664 Capripoxvirus Species 0.000 description 1
- 101710167800 Capsid assembly scaffolding protein Proteins 0.000 description 1
- 101710197658 Capsid protein VP1 Proteins 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102000004031 Carboxy-Lyases Human genes 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000242722 Cestoda Species 0.000 description 1
- 101000686790 Chaetoceros protobacilladnavirus 2 Replication-associated protein Proteins 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 108010019670 Chimeric Antigen Receptors Proteins 0.000 description 1
- 101000864475 Chlamydia phage 1 Internal scaffolding protein VP3 Proteins 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 241000700628 Chordopoxvirinae Species 0.000 description 1
- 206010008803 Chromoblastomycosis Diseases 0.000 description 1
- 208000015116 Chromomycosis Diseases 0.000 description 1
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 1
- 241001112696 Clostridia Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 241000223205 Coccidioides immitis Species 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- 208000009802 Colorado tick fever Diseases 0.000 description 1
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 1
- 101710139375 Corneodesmosin Proteins 0.000 description 1
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241001445332 Coxiella <snail> Species 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 208000011231 Crohn disease Diseases 0.000 description 1
- 201000007336 Cryptococcosis Diseases 0.000 description 1
- 241000221204 Cryptococcus neoformans Species 0.000 description 1
- 108010045171 Cyclic AMP Response Element-Binding Protein Proteins 0.000 description 1
- 102000005636 Cyclic AMP Response Element-Binding Protein Human genes 0.000 description 1
- 102100023580 Cyclic AMP-dependent transcription factor ATF-4 Human genes 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- LRZPRGJXAZFXCR-DCAQKATOSA-N Cys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N LRZPRGJXAZFXCR-DCAQKATOSA-N 0.000 description 1
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 1
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 1
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 1
- WYZLWZNAWQNLGQ-FXQIFTODSA-N Cys-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N WYZLWZNAWQNLGQ-FXQIFTODSA-N 0.000 description 1
- HNNGTYHNYDOSKV-FXQIFTODSA-N Cys-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N HNNGTYHNYDOSKV-FXQIFTODSA-N 0.000 description 1
- RFHGRMMADHHQSA-KBIXCLLPSA-N Cys-Gln-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RFHGRMMADHHQSA-KBIXCLLPSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- JDHMXPSXWMPYQZ-AAEUAGOBSA-N Cys-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N JDHMXPSXWMPYQZ-AAEUAGOBSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- OWAFTBLVZNSIFO-SRVKXCTJSA-N Cys-His-His Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OWAFTBLVZNSIFO-SRVKXCTJSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- YFAFBAPQHGULQT-HJPIBITLSA-N Cys-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N YFAFBAPQHGULQT-HJPIBITLSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- KCPOQGRVVXYLAC-KKUMJFAQSA-N Cys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KCPOQGRVVXYLAC-KKUMJFAQSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 1
- AFYGNOJUTMXQIG-FXQIFTODSA-N Cys-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N AFYGNOJUTMXQIG-FXQIFTODSA-N 0.000 description 1
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 1
- CNBIWHCVAZHRBI-IHRRRGAJSA-N Cys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N CNBIWHCVAZHRBI-IHRRRGAJSA-N 0.000 description 1
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- MSWBLPLBSLQVME-XIRDDKMYSA-N Cys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 MSWBLPLBSLQVME-XIRDDKMYSA-N 0.000 description 1
- NMPSRDYYNIYOSJ-IHPCNDPISA-N Cys-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CS)N NMPSRDYYNIYOSJ-IHPCNDPISA-N 0.000 description 1
- ZXGDAZLSOSYSBA-IHRRRGAJSA-N Cys-Val-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZXGDAZLSOSYSBA-IHRRRGAJSA-N 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 241000710829 Dengue virus group Species 0.000 description 1
- 206010012504 Dermatophytosis Diseases 0.000 description 1
- 208000000655 Distemper Diseases 0.000 description 1
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 1
- 101150029662 E1 gene Proteins 0.000 description 1
- 101150005585 E3 gene Proteins 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 241001466953 Echovirus Species 0.000 description 1
- 241000588877 Eikenella Species 0.000 description 1
- 206010014596 Encephalitis Japanese B Diseases 0.000 description 1
- 206010014584 Encephalitis california Diseases 0.000 description 1
- 206010014614 Encephalitis western equine Diseases 0.000 description 1
- 206010053025 Endemic syphilis Diseases 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 241000700572 Entomopoxvirinae Species 0.000 description 1
- 206010066919 Epidemic polyarthritis Diseases 0.000 description 1
- 108050004280 Epsilon toxin Proteins 0.000 description 1
- 241000710803 Equine arteritis virus Species 0.000 description 1
- 241000713730 Equine infectious anemia virus Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000186810 Erysipelothrix rhusiopathiae Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 101000867232 Escherichia coli Heat-stable enterotoxin II Proteins 0.000 description 1
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 1
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 101000803553 Eumenes pomiformis Venom peptide 3 Proteins 0.000 description 1
- 108010076282 Factor IX Proteins 0.000 description 1
- 108010054218 Factor VIII Proteins 0.000 description 1
- 102000001690 Factor VIII Human genes 0.000 description 1
- 241000725579 Feline coronavirus Species 0.000 description 1
- 241000711475 Feline infectious peritonitis virus Species 0.000 description 1
- 241000714165 Feline leukemia virus Species 0.000 description 1
- 241000701915 Feline panleukopenia virus Species 0.000 description 1
- 241000701925 Feline parvovirus Species 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 201000006353 Filariasis Diseases 0.000 description 1
- 241000711950 Filoviridae Species 0.000 description 1
- 241000710781 Flaviviridae Species 0.000 description 1
- 208000007212 Foot-and-Mouth Disease Diseases 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 241000589602 Francisella tularensis Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 101710177291 Gag polyprotein Proteins 0.000 description 1
- 241000701047 Gallid alphaherpesvirus 2 Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 101001066288 Gallus gallus GATA-binding factor 3 Proteins 0.000 description 1
- 241000701046 Gammaherpesvirinae Species 0.000 description 1
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 1
- 201000003641 Glanders Diseases 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- QDXMSSWCEVYOLZ-SZMVWBNQSA-N Gln-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QDXMSSWCEVYOLZ-SZMVWBNQSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- 102000051325 Glucagon Human genes 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- 102000003676 Glucocorticoid Receptors Human genes 0.000 description 1
- 108090000079 Glucocorticoid Receptors Proteins 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 102000003638 Glucose-6-Phosphatase Human genes 0.000 description 1
- 108010086800 Glucose-6-Phosphatase Proteins 0.000 description 1
- 108010018962 Glucosephosphate Dehydrogenase Proteins 0.000 description 1
- 108010015451 Glutaryl-CoA Dehydrogenase Proteins 0.000 description 1
- 102100028603 Glutaryl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 108090000826 Glycine dehydrogenase (decarboxylating) Proteins 0.000 description 1
- 102000004327 Glycine dehydrogenase (decarboxylating) Human genes 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 108060003393 Granulin Proteins 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- 206010072579 Granulomatosis with polyangiitis Diseases 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101000852023 Halorubrum pleomorphic virus 1 Envelope protein Proteins 0.000 description 1
- 101000583961 Halorubrum pleomorphic virus 1 Matrix protein Proteins 0.000 description 1
- 208000030836 Hashimoto thyroiditis Diseases 0.000 description 1
- 108090000031 Hedgehog Proteins Proteins 0.000 description 1
- 102000003693 Hedgehog Proteins Human genes 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 241000724709 Hepatitis delta virus Species 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 208000007514 Herpes zoster Diseases 0.000 description 1
- 102000005548 Hexokinase Human genes 0.000 description 1
- 108700040460 Hexokinases Proteins 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- QQJMARNOLHSJCQ-DCAQKATOSA-N His-Cys-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N QQJMARNOLHSJCQ-DCAQKATOSA-N 0.000 description 1
- LDTJBEOANMQRJE-CIUDSAMLSA-N His-Cys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LDTJBEOANMQRJE-CIUDSAMLSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- BMZLDCQIWUHVRS-DCAQKATOSA-N His-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CN=CN1 BMZLDCQIWUHVRS-DCAQKATOSA-N 0.000 description 1
- HJUPAYWVVVRYFQ-PYJNHQTQSA-N His-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N HJUPAYWVVVRYFQ-PYJNHQTQSA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 201000002563 Histoplasmosis Diseases 0.000 description 1
- 101150068639 Hnf4a gene Proteins 0.000 description 1
- 101000897400 Homo sapiens CD59 glycoprotein Proteins 0.000 description 1
- 101000941045 Homo sapiens Cortactin-binding protein 2 Proteins 0.000 description 1
- 101000974934 Homo sapiens Cyclic AMP-dependent transcription factor ATF-2 Proteins 0.000 description 1
- 101000905743 Homo sapiens Cyclic AMP-dependent transcription factor ATF-4 Proteins 0.000 description 1
- 101000997829 Homo sapiens Glial cell line-derived neurotrophic factor Proteins 0.000 description 1
- 101000979333 Homo sapiens Neurofilament light polypeptide Proteins 0.000 description 1
- 101000801195 Homo sapiens TLE family member 5 Proteins 0.000 description 1
- 101000837845 Homo sapiens Transcription factor E3 Proteins 0.000 description 1
- 101000837829 Homo sapiens Transcription factor IIIA Proteins 0.000 description 1
- 101000666856 Homo sapiens Vasoactive intestinal polypeptide receptor 1 Proteins 0.000 description 1
- 241001428587 Human adenovirus 16 Species 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- 241000701096 Human adenovirus 7 Species 0.000 description 1
- 241001135572 Human adenovirus E4 Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 241001207270 Human enterovirus Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000726041 Human respirovirus 1 Species 0.000 description 1
- 241000712003 Human respirovirus 3 Species 0.000 description 1
- 241001559187 Human rubulavirus 2 Species 0.000 description 1
- 241001559186 Human rubulavirus 4 Species 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 108010056651 Hydroxymethylbilane synthase Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- GLLAUPMJCGKPFY-BLMTYFJBSA-N Ile-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 GLLAUPMJCGKPFY-BLMTYFJBSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 102000006496 Immunoglobulin Heavy Chains Human genes 0.000 description 1
- 108010019476 Immunoglobulin Heavy Chains Proteins 0.000 description 1
- 241000711450 Infectious bronchitis virus Species 0.000 description 1
- 241000702626 Infectious bursal disease virus Species 0.000 description 1
- 108010004250 Inhibins Proteins 0.000 description 1
- 102000002746 Inhibins Human genes 0.000 description 1
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 1
- 102000004218 Insulin-Like Growth Factor I Human genes 0.000 description 1
- 108090001117 Insulin-Like Growth Factor II Proteins 0.000 description 1
- 102000048143 Insulin-Like Growth Factor II Human genes 0.000 description 1
- 102100034353 Integrase Human genes 0.000 description 1
- 102000016921 Integrin-Binding Sialoprotein Human genes 0.000 description 1
- 108010028750 Integrin-Binding Sialoprotein Proteins 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000003810 Interleukin-18 Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 108010013792 Isovaleryl-CoA Dehydrogenase Proteins 0.000 description 1
- 102100025392 Isovaleryl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 1
- 201000005807 Japanese encephalitis Diseases 0.000 description 1
- 241000710843 Japanese encephalitis virus group Species 0.000 description 1
- 101710172804 K protein Proteins 0.000 description 1
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 201000009908 La Crosse encephalitis Diseases 0.000 description 1
- 206010023927 Lassa fever Diseases 0.000 description 1
- 208000004554 Leishmaniasis Diseases 0.000 description 1
- 241000700563 Leporipoxvirus Species 0.000 description 1
- 206010024229 Leprosy Diseases 0.000 description 1
- 206010024238 Leptospirosis Diseases 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- GZAUZBUKDXYPEH-CIUDSAMLSA-N Leu-Cys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N GZAUZBUKDXYPEH-CIUDSAMLSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 102000004058 Leukemia inhibitory factor Human genes 0.000 description 1
- 108090000581 Leukemia inhibitory factor Proteins 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 108010071324 Livagen Proteins 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- 241000701043 Lymphocryptovirus Species 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 241000711828 Lyssavirus Species 0.000 description 1
- 108010059343 MM Form Creatine Kinase Proteins 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241001115401 Marburgvirus Species 0.000 description 1
- 241000701244 Mastadenovirus Species 0.000 description 1
- 101710085938 Matrix protein Proteins 0.000 description 1
- 201000005505 Measles Diseases 0.000 description 1
- 101710127721 Membrane protein Proteins 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 1
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 1
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 1
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- NLDXSXDCNZIQCN-ULQDDVLXSA-N Met-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 NLDXSXDCNZIQCN-ULQDDVLXSA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- MNGBICITWAPGAS-BPUTZDHNSA-N Met-Ser-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MNGBICITWAPGAS-BPUTZDHNSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 1
- DOQXHOUYYSPISL-SZMVWBNQSA-N Met-Trp-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N DOQXHOUYYSPISL-SZMVWBNQSA-N 0.000 description 1
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- VYXIKLFLGRTANT-HRCADAONSA-N Met-Tyr-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N VYXIKLFLGRTANT-HRCADAONSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- 108010085747 Methylmalonyl-CoA Decarboxylase Proteins 0.000 description 1
- 102000019010 Methylmalonyl-CoA Mutase Human genes 0.000 description 1
- 108010051862 Methylmalonyl-CoA mutase Proteins 0.000 description 1
- 241001460074 Microsporum distortum Species 0.000 description 1
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 1
- 101710169105 Minor spike protein Proteins 0.000 description 1
- 102000014962 Monocyte Chemoattractant Proteins Human genes 0.000 description 1
- 108010064136 Monocyte Chemoattractant Proteins Proteins 0.000 description 1
- 241000588621 Moraxella Species 0.000 description 1
- 241000712045 Morbillivirus Species 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000701034 Muromegalovirus Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101100335081 Mus musculus Flt3 gene Proteins 0.000 description 1
- 241000041810 Mycetoma Species 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 206010028470 Mycoplasma infections Diseases 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- 102100032970 Myogenin Human genes 0.000 description 1
- 108010056785 Myogenin Proteins 0.000 description 1
- 102100026057 Myosin regulatory light chain 2, atrial isoform Human genes 0.000 description 1
- 101710098224 Myosin regulatory light chain 2, atrial isoform Proteins 0.000 description 1
- 102100030626 Myosin-binding protein H Human genes 0.000 description 1
- 101710139548 Myosin-binding protein H Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 208000006007 Nairobi Sheep Disease Diseases 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 102000015336 Nerve Growth Factor Human genes 0.000 description 1
- 102000007072 Nerve Growth Factors Human genes 0.000 description 1
- 108010074223 Netrin-1 Proteins 0.000 description 1
- 102000009065 Netrin-1 Human genes 0.000 description 1
- 102000014413 Neuregulin Human genes 0.000 description 1
- 108050003475 Neuregulin Proteins 0.000 description 1
- 102100029268 Neurotrophin-3 Human genes 0.000 description 1
- 102100033857 Neurotrophin-4 Human genes 0.000 description 1
- 102100021584 Neurturin Human genes 0.000 description 1
- 108010015406 Neurturin Proteins 0.000 description 1
- 206010029443 Nocardia Infections Diseases 0.000 description 1
- 206010029444 Nocardiosis Diseases 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 101710113540 ORF2 protein Proteins 0.000 description 1
- 241000702259 Orbivirus Species 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710198224 Ornithine carbamoyltransferase, mitochondrial Proteins 0.000 description 1
- 241000150218 Orthonairovirus Species 0.000 description 1
- 241000700629 Orthopoxvirus Species 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 102000004067 Osteocalcin Human genes 0.000 description 1
- 108090000573 Osteocalcin Proteins 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241001504519 Papio ursinus Species 0.000 description 1
- 241000700639 Parapoxvirus Species 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- DHZOGDVYRQOGAC-BZSNNMDCSA-N Phe-Cys-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DHZOGDVYRQOGAC-BZSNNMDCSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- YZJKNDCEPDDIDA-BZSNNMDCSA-N Phe-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 YZJKNDCEPDDIDA-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 108010069013 Phenylalanine Hydroxylase Proteins 0.000 description 1
- 102100038223 Phenylalanine-4-hydroxylase Human genes 0.000 description 1
- 241000713137 Phlebovirus Species 0.000 description 1
- 108010064071 Phosphorylase Kinase Proteins 0.000 description 1
- 102000014750 Phosphorylase Kinase Human genes 0.000 description 1
- 108010073135 Phosphorylases Proteins 0.000 description 1
- 102000009097 Phosphorylases Human genes 0.000 description 1
- 208000004842 Pinta Diseases 0.000 description 1
- 206010035148 Plague Diseases 0.000 description 1
- 241000233872 Pneumocystis carinii Species 0.000 description 1
- 241000711902 Pneumovirus Species 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 241000156302 Porcine hemagglutinating encephalomyelitis virus Species 0.000 description 1
- 241000702619 Porcine parvovirus Species 0.000 description 1
- 241001135989 Porcine reproductive and respiratory syndrome virus Species 0.000 description 1
- 102100034391 Porphobilinogen deaminase Human genes 0.000 description 1
- 101100102840 Potato mop-top virus (isolate Potato/Sweden/Sw) 8K protein gene Proteins 0.000 description 1
- 101710193132 Pre-hexon-linking protein VIII Proteins 0.000 description 1
- 101710143509 Pre-histone-like nucleoprotein Proteins 0.000 description 1
- 108010035004 Prephenate Dehydrogenase Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 1
- 101710093543 Probable non-specific lipid-transfer protein Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710192141 Protein Nef Proteins 0.000 description 1
- 101710150344 Protein Rev Proteins 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 101000584831 Pseudoalteromonas phage PM2 Protein P6 Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 206010037151 Psittacosis Diseases 0.000 description 1
- 101710090523 Putative movement protein Proteins 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 108010054530 RGDN peptide Proteins 0.000 description 1
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 206010037742 Rabies Diseases 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 241000734695 Recchia Species 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000701037 Rhadinovirus Species 0.000 description 1
- 206010051497 Rhinotracheitis Diseases 0.000 description 1
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 1
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 1
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 240000000528 Ricinus communis Species 0.000 description 1
- 241000606723 Rickettsia akari Species 0.000 description 1
- 241000606651 Rickettsiales Species 0.000 description 1
- 201000004282 Rickettsialpox Diseases 0.000 description 1
- 208000000705 Rift Valley Fever Diseases 0.000 description 1
- 208000006257 Rinderpest Diseases 0.000 description 1
- 206010039207 Rocky Mountain Spotted Fever Diseases 0.000 description 1
- 241000710942 Ross River virus Species 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 241000710801 Rubivirus Species 0.000 description 1
- 241001533467 Rubulavirus Species 0.000 description 1
- 108091006629 SLC13A2 Proteins 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 108050003978 Semaphorin Proteins 0.000 description 1
- 102000014105 Semaphorin Human genes 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- HZNFKPJCGZXKIC-DCAQKATOSA-N Ser-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N HZNFKPJCGZXKIC-DCAQKATOSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 241000710960 Sindbis virus Species 0.000 description 1
- 208000021386 Sjogren Syndrome Diseases 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 241000605008 Spirillum Species 0.000 description 1
- 206010041736 Sporotrichosis Diseases 0.000 description 1
- 206010041896 St. Louis Encephalitis Diseases 0.000 description 1
- 241000710888 St. Louis encephalitis virus Species 0.000 description 1
- 241000295644 Staphylococcaceae Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241001478880 Streptobacillus moniliformis Species 0.000 description 1
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 1
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 1
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 1
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000700568 Suipoxvirus Species 0.000 description 1
- 101001062859 Sus scrofa Fatty acid-binding protein, adipocyte Proteins 0.000 description 1
- 230000005867 T cell response Effects 0.000 description 1
- 206010042971 T-cell lymphoma Diseases 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 208000002474 Tinea Diseases 0.000 description 1
- 241000223997 Toxoplasma gondii Species 0.000 description 1
- 101001023030 Toxoplasma gondii Myosin-D Proteins 0.000 description 1
- 201000005485 Toxoplasmosis Diseases 0.000 description 1
- 108010018242 Transcription Factor AP-1 Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102100028507 Transcription factor E3 Human genes 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 1
- 241000242541 Trematoda Species 0.000 description 1
- 241000869417 Trematodes Species 0.000 description 1
- 206010044608 Trichiniasis Diseases 0.000 description 1
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- AOAMKFFPFOPMLX-BVSLBCMMSA-N Trp-Arg-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AOAMKFFPFOPMLX-BVSLBCMMSA-N 0.000 description 1
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 1
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- HRKOLWXWQSDMSK-XIRDDKMYSA-N Trp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HRKOLWXWQSDMSK-XIRDDKMYSA-N 0.000 description 1
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- LFMMXTLRXKBPMC-FDARSICLSA-N Trp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LFMMXTLRXKBPMC-FDARSICLSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- RERRMBXDSFMBQE-ZFWWWQNUSA-N Trp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERRMBXDSFMBQE-ZFWWWQNUSA-N 0.000 description 1
- BOESUSAIMQGVJD-RYQLBKOJSA-N Trp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BOESUSAIMQGVJD-RYQLBKOJSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- HWCBFXAWVTXXHZ-NYVOZVTQSA-N Trp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HWCBFXAWVTXXHZ-NYVOZVTQSA-N 0.000 description 1
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- CRCHQCUINSOGFD-JBACZVJFSA-N Trp-Tyr-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CRCHQCUINSOGFD-JBACZVJFSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102100031988 Tumor necrosis factor ligand superfamily member 6 Human genes 0.000 description 1
- 108050002568 Tumor necrosis factor ligand superfamily member 6 Proteins 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- BODHJXJNRVRKFA-BZSNNMDCSA-N Tyr-Cys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BODHJXJNRVRKFA-BZSNNMDCSA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108091000117 Tyrosine 3-Monooxygenase Proteins 0.000 description 1
- 102000048218 Tyrosine 3-monooxygenases Human genes 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- 101710135104 Uncharacterized protein p6 Proteins 0.000 description 1
- 101150004676 VGF gene Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 1
- PWCJARIQERIIGF-BZSNNMDCSA-N Val-Met-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWCJARIQERIIGF-BZSNNMDCSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- HOZAIQIEJTWWDG-HJOGWXRNSA-N Val-Trp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HOZAIQIEJTWWDG-HJOGWXRNSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 241000701067 Varicellovirus Species 0.000 description 1
- 241000870995 Variola Species 0.000 description 1
- 206010047115 Vasculitis Diseases 0.000 description 1
- 102100038388 Vasoactive intestinal polypeptide receptor 1 Human genes 0.000 description 1
- 241000711975 Vesicular stomatitis virus Species 0.000 description 1
- 241000711970 Vesiculovirus Species 0.000 description 1
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 1
- 108010015780 Viral Core Proteins Proteins 0.000 description 1
- 101800001476 Viral genome-linked protein Proteins 0.000 description 1
- 208000028227 Viral hemorrhagic fever Diseases 0.000 description 1
- 101710108545 Viral protein 1 Proteins 0.000 description 1
- 208000005466 Western Equine Encephalomyelitis Diseases 0.000 description 1
- 201000005806 Western equine encephalitis Diseases 0.000 description 1
- 241000710951 Western equine encephalitis virus Species 0.000 description 1
- 102100022748 Wilms tumor protein Human genes 0.000 description 1
- 101710127857 Wilms tumor protein Proteins 0.000 description 1
- 241001492404 Woodchuck hepatitis virus Species 0.000 description 1
- 241000120645 Yellow fever virus group Species 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 206010061418 Zygomycosis Diseases 0.000 description 1
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 201000007691 actinomycosis Diseases 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000000488 activin Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 208000012873 acute gastroenteritis Diseases 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 description 1
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 1
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000005809 anti-tumor immunity Effects 0.000 description 1
- 238000011394 anticancer treatment Methods 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 101150067977 ap gene Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- FZCSTZYAHCUGEM-UHFFFAOYSA-N aspergillomarasmine B Natural products OC(=O)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O FZCSTZYAHCUGEM-UHFFFAOYSA-N 0.000 description 1
- 230000005784 autoimmunity Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 201000008680 babesiosis Diseases 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 206010004145 bartonellosis Diseases 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- 108010047754 beta-Glucosidase Proteins 0.000 description 1
- 229920000249 biocompatible polymer Polymers 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 208000003836 bluetongue Diseases 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 229940074375 burkholderia mallei Drugs 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000012830 cancer therapeutic Substances 0.000 description 1
- 201000003984 candidiasis Diseases 0.000 description 1
- 208000014058 canine distemper Diseases 0.000 description 1
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 201000004308 chancroid Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000011260 co-administration Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 201000003486 coccidioidomycosis Diseases 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 201000003740 cowpox Diseases 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- ILRYLPWNYFXEMH-UHFFFAOYSA-N cystathionine Chemical compound OC(=O)C(N)CCSCC(N)C(O)=O ILRYLPWNYFXEMH-UHFFFAOYSA-N 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 201000001981 dermatomyositis Diseases 0.000 description 1
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- 239000000386 donor Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 206010014599 encephalitis Diseases 0.000 description 1
- 239000012645 endogenous antigen Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 102000012803 ephrin Human genes 0.000 description 1
- 108060002566 ephrin Proteins 0.000 description 1
- 102000015694 estrogen receptors Human genes 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 229960004222 factor ix Drugs 0.000 description 1
- 229960000301 factor viii Drugs 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 108700014844 flt3 ligand Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 229940118764 francisella tularensis Drugs 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 244000053095 fungal pathogen Species 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 201000006592 giardiasis Diseases 0.000 description 1
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 244000000013 helminth Species 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 102000056245 human TLE5 Human genes 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 239000000852 hydrogen donor Substances 0.000 description 1
- 238000002991 immunohistochemical analysis Methods 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 238000010324 immunological assay Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001524 infective effect Effects 0.000 description 1
- 230000003960 inflammatory cascade Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000000893 inhibin Substances 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 150000004715 keto acids Chemical class 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 208000001581 lymphogranuloma venereum Diseases 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 201000004015 melioidosis Diseases 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 1
- 230000000921 morphogenic effect Effects 0.000 description 1
- 201000007524 mucormycosis Diseases 0.000 description 1
- 201000009671 multidrug-resistant tuberculosis Diseases 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 210000003098 myoblast Anatomy 0.000 description 1
- 108010081726 netrin-2 Proteins 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 1
- 108700007229 noggin Proteins 0.000 description 1
- 102000045246 noggin Human genes 0.000 description 1
- NJPPVKZQTLUDBO-UHFFFAOYSA-N novaluron Chemical compound C1=C(Cl)C(OC(F)(F)C(OC(F)(F)F)F)=CC=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F NJPPVKZQTLUDBO-UHFFFAOYSA-N 0.000 description 1
- 108020004017 nuclear receptors Proteins 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 201000000901 ornithosis Diseases 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 230000003071 parasitic effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000009984 peri-natal effect Effects 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 150000002978 peroxides Chemical class 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 208000011079 pinta disease Diseases 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 208000005987 polymyositis Diseases 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 235000013594 poultry meat Nutrition 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- 208000009305 pseudorabies Diseases 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 208000002574 reactive arthritis Diseases 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 239000012898 sample dilution Substances 0.000 description 1
- 201000000306 sarcoidosis Diseases 0.000 description 1
- 108010078070 scavenger receptors Proteins 0.000 description 1
- 102000014452 scavenger receptors Human genes 0.000 description 1
- 201000004409 schistosomiasis Diseases 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 108020003113 steroid hormone receptors Proteins 0.000 description 1
- 102000005969 steroid hormone receptors Human genes 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 208000006379 syphilis Diseases 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 108010001055 thymocartin Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 208000003982 trichinellosis Diseases 0.000 description 1
- 201000007588 trichinosis Diseases 0.000 description 1
- 201000002311 trypanosomiasis Diseases 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 102000003390 tumor necrosis factor Human genes 0.000 description 1
- 206010061393 typhus Diseases 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241000724775 unclassified viruses Species 0.000 description 1
- 241001148471 unidentified anaerobic bacterium Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 201000006266 variola major Diseases 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 102000009310 vitamin D receptors Human genes 0.000 description 1
- 108050000156 vitamin D receptors Proteins 0.000 description 1
- 101150040614 vpx gene Proteins 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 201000009482 yaws Diseases 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 229940118695 yersinia pestis Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Engineering & Computer Science (AREA)
- General Chemical & Material Sciences (AREA)
- Veterinary Medicine (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
Chimpanzee serotype C68 proteins, peptides, and polypeptide are provided. Also provided are novel adenoviruses derived from these proteins, as well as compositions containing these proteins and methods of using same for immunization and therapy. Further, a rapid method for screening recombinant transformants using a visually detectable method is described.
Description
METHOD FOR RAPID SCREENING OF BACTERIAL TRANSFORMANTS
AND NOVEL SIMIAN ADENOVIRUS PROTEINS
BACKGROUND OF THE INVENTION
Recombinant adenoviruses have been described for gene therapy and vaccine uses.
Adenoviruses have a characteristic morphology with an icosahedral capsid consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), along with a number of other minor proteins, VI, VIII, IX, IIIa and IVa2 [W.C.
Russell, J.
Gen Virol., 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded DNA
with a terminal protein attached covalently to the 5' termini, which have inverted terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic protein VII and a small peptide termed mu. Another protein, V, is packaged with this DNA-protein complex and provides a structural link to the capsid via protein VI. The virus also contains a virus-encoded protease, which is necessary for processing of some of the structural proteins to produce mature infectious virus.
There continues to be a need for recombinant viral vectors and improved methods for making these vectors.
SUMMARY OF THE INVENTION
In one aspect, the invention provides a method for rapid screening of bacterial transformants. The method involves engineering a recombinant shuttle vector comprising a nucleic acid cassette containing a transgene and a nucleic acid sequence encoding prokaryotic green fluorescent protein (GFP) operably linked to regulatory sequences which permit its expression in a host cell. Thereafter, host cells are transfected with the shuttle vector and screened for expression of GFP. The absence of green color (i.e., white) is indicative of a cell carrying the recombinant virus. Expression of GFP is readily detected by the green color when activated by fluorescent light, and indicates the presence of parent virus (i.e., absence of recombinant).
In another aspect, the invention provides capsid proteins of C68, isolated from other C68 proteins, and characterized by the amino acids provided herein.
In still another embodiment, the invention provides adenoviral vectors and non-viral targeting proteins derived from the C68 capsid proteins, termed herein C68-derived constructs.
Yet other advantages of the present invention will be readily apparent from the following detailed description of the invention.
Brief Description of the Drawings Fig. 1 summarizes the genetic organization of the chimpanzee adenovirus C68 genome. In Fig. IA the genome of the C68 chimpanzee adenovirus is schematically represented by the box at the top. The inverted terminal repeats are shaded black and the early regions are shaded gray. The arrowheads above the box indicate the direction of expression of the early genes. The line below the box represents the division of the genome into 100 map units. The arrows below the line represent the five late gene regions and the proteins encoded in each region. The numbers below the box or arrows indicate the start (promoter or initiation codon) and end (canonical PolyA signal) for each region. * represents the E2A late promoter. Fig. 1B illustrates the Pstl clones; Fig. 1C
illustrates the BamHI
clones. Fig. 1 D illustrates the Hindlll clones. For parts 1 B-1 D, the unshaded regions indicate that a fragment was cloned into a plasmid vector, while the shaded regions indicate that the restriction fragment was not cloned. For each section the fragment name, alphabetical with A
being the largest fragment, and the fragment size are listed above the box and the fragment end points are listed below the box.
Fig. 2 provides a sequence alignment of the C68 hexon protein [aa 131 to 441 of SEQ
ID NO:16] with AN [SEQ ID NO:34], Ad16 [SEQ ID NO:35], Ad3 [SEQ ID NO:36], Adz [SEQ ID NO:37], and Ad2 [SEQ ID NO:38]. The deduced amino acid sequences of highly similar human adenovirus hexons were compared with the C68 chimpanzee adenovirus using CLUSTAL X. Serotypes and subgroups are indicated on the left margin, followed by the residue number. The numbering refers to the amino acid position with respect to the start of translation. Amino acids are shaded with respect to C68 to highlight sequence similarities (gray) and identities (black). The seven hypervariable regions within loop domains DEl and FGI are labeled along the bottom and correspond to the following Ad2 sequences in the alignment: HVR1, 137-188; HVR2, 194-204; HVR3, 222-229; HVR4, 258-271; HVR5, 294; HVR6, 316-327; and HVR7, 433-465 of SEQ ID NO:16. The GenBank accession numbers for the sequences shown are as follow: AAD03657 (Ad4), S37216 (Ad16), (Ad3), AAD03663 (Ad7), and NP040525 (Ad2).
Fig. 3 provides an alignment of the amino acid sequences of the fiber knob domains of chimpanzee C68 (Pan-9) [amino acids 247 to 425 of SEQ ID NO: 27] and the human adenovirus serotypes 2 [SEQ ID NO: 39] and 5[SEQ ID NO:40].
Fig. 4 provides an alignment of the amino acid sequences of the L1 and a portion of the L2 loops of the capsid hexon on the human adenovirus serotype 5 [SEQ ID
NO:41 ] and chimpanzee C68 (Pan-9) [amino acids 125 to 443 of SEQ ID NO: 16] adenovirus sequences.
The intervening conserved region is part of the pedestal domain conserved between adenovirus serotypes.
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides novel adenovirus capsid proteins derived from the unique sequences of chimpanzee adenovirus C68. The capsid proteins of the invention are useful for a variety of purposes, including non-viral targeted delivery to cells and for creating recombinant viral vectors. These proteins and viral vectors are useful for delivery of heterologous molecules to target cells.
The invention further provides a novel method for rapid screening of bacterial transformants obtained during production of the novel adenoviral capsids of the invention, and during production of a variety of other viral or non-viral constructs. In this method, at least the shuttle vector is engineered to contain a marker gene, e.g., green fluorescent protein (GFP), gene under the control of a suitable promoter. The transformed cells are screened for expression of marker. In the case of GFP, white colonies are recombinants while green colonies are residual parental plasmid.
AND NOVEL SIMIAN ADENOVIRUS PROTEINS
BACKGROUND OF THE INVENTION
Recombinant adenoviruses have been described for gene therapy and vaccine uses.
Adenoviruses have a characteristic morphology with an icosahedral capsid consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), along with a number of other minor proteins, VI, VIII, IX, IIIa and IVa2 [W.C.
Russell, J.
Gen Virol., 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded DNA
with a terminal protein attached covalently to the 5' termini, which have inverted terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic protein VII and a small peptide termed mu. Another protein, V, is packaged with this DNA-protein complex and provides a structural link to the capsid via protein VI. The virus also contains a virus-encoded protease, which is necessary for processing of some of the structural proteins to produce mature infectious virus.
There continues to be a need for recombinant viral vectors and improved methods for making these vectors.
SUMMARY OF THE INVENTION
In one aspect, the invention provides a method for rapid screening of bacterial transformants. The method involves engineering a recombinant shuttle vector comprising a nucleic acid cassette containing a transgene and a nucleic acid sequence encoding prokaryotic green fluorescent protein (GFP) operably linked to regulatory sequences which permit its expression in a host cell. Thereafter, host cells are transfected with the shuttle vector and screened for expression of GFP. The absence of green color (i.e., white) is indicative of a cell carrying the recombinant virus. Expression of GFP is readily detected by the green color when activated by fluorescent light, and indicates the presence of parent virus (i.e., absence of recombinant).
In another aspect, the invention provides capsid proteins of C68, isolated from other C68 proteins, and characterized by the amino acids provided herein.
In still another embodiment, the invention provides adenoviral vectors and non-viral targeting proteins derived from the C68 capsid proteins, termed herein C68-derived constructs.
Yet other advantages of the present invention will be readily apparent from the following detailed description of the invention.
Brief Description of the Drawings Fig. 1 summarizes the genetic organization of the chimpanzee adenovirus C68 genome. In Fig. IA the genome of the C68 chimpanzee adenovirus is schematically represented by the box at the top. The inverted terminal repeats are shaded black and the early regions are shaded gray. The arrowheads above the box indicate the direction of expression of the early genes. The line below the box represents the division of the genome into 100 map units. The arrows below the line represent the five late gene regions and the proteins encoded in each region. The numbers below the box or arrows indicate the start (promoter or initiation codon) and end (canonical PolyA signal) for each region. * represents the E2A late promoter. Fig. 1B illustrates the Pstl clones; Fig. 1C
illustrates the BamHI
clones. Fig. 1 D illustrates the Hindlll clones. For parts 1 B-1 D, the unshaded regions indicate that a fragment was cloned into a plasmid vector, while the shaded regions indicate that the restriction fragment was not cloned. For each section the fragment name, alphabetical with A
being the largest fragment, and the fragment size are listed above the box and the fragment end points are listed below the box.
Fig. 2 provides a sequence alignment of the C68 hexon protein [aa 131 to 441 of SEQ
ID NO:16] with AN [SEQ ID NO:34], Ad16 [SEQ ID NO:35], Ad3 [SEQ ID NO:36], Adz [SEQ ID NO:37], and Ad2 [SEQ ID NO:38]. The deduced amino acid sequences of highly similar human adenovirus hexons were compared with the C68 chimpanzee adenovirus using CLUSTAL X. Serotypes and subgroups are indicated on the left margin, followed by the residue number. The numbering refers to the amino acid position with respect to the start of translation. Amino acids are shaded with respect to C68 to highlight sequence similarities (gray) and identities (black). The seven hypervariable regions within loop domains DEl and FGI are labeled along the bottom and correspond to the following Ad2 sequences in the alignment: HVR1, 137-188; HVR2, 194-204; HVR3, 222-229; HVR4, 258-271; HVR5, 294; HVR6, 316-327; and HVR7, 433-465 of SEQ ID NO:16. The GenBank accession numbers for the sequences shown are as follow: AAD03657 (Ad4), S37216 (Ad16), (Ad3), AAD03663 (Ad7), and NP040525 (Ad2).
Fig. 3 provides an alignment of the amino acid sequences of the fiber knob domains of chimpanzee C68 (Pan-9) [amino acids 247 to 425 of SEQ ID NO: 27] and the human adenovirus serotypes 2 [SEQ ID NO: 39] and 5[SEQ ID NO:40].
Fig. 4 provides an alignment of the amino acid sequences of the L1 and a portion of the L2 loops of the capsid hexon on the human adenovirus serotype 5 [SEQ ID
NO:41 ] and chimpanzee C68 (Pan-9) [amino acids 125 to 443 of SEQ ID NO: 16] adenovirus sequences.
The intervening conserved region is part of the pedestal domain conserved between adenovirus serotypes.
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides novel adenovirus capsid proteins derived from the unique sequences of chimpanzee adenovirus C68. The capsid proteins of the invention are useful for a variety of purposes, including non-viral targeted delivery to cells and for creating recombinant viral vectors. These proteins and viral vectors are useful for delivery of heterologous molecules to target cells.
The invention further provides a novel method for rapid screening of bacterial transformants obtained during production of the novel adenoviral capsids of the invention, and during production of a variety of other viral or non-viral constructs. In this method, at least the shuttle vector is engineered to contain a marker gene, e.g., green fluorescent protein (GFP), gene under the control of a suitable promoter. The transformed cells are screened for expression of marker. In the case of GFP, white colonies are recombinants while green colonies are residual parental plasmid.
1. Novel Adenovirus Capsid Proteins In one aspect, the invention provides unique C68 adenoviral capsid proteins, including the C68 hexon region, the C68 penton region, and the C68 fiber region, and fragments thereof. Suitably, these capsid proteins can be substantially pure, i.e., are free of other proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 60%
homogeneous, and most preferably 95% homogeneous.
In addition, the invention provides unique C68-derived capsid proteins. As used herein, a C68-derived capsid protein includes any C68 capsid protein or a fragment thereof including, without limitation, a polypeptide, peptide or a consecutive sequence of at least 8 amino acid residues unique to a C68 capsid protein and which is free of other proteins. A
C68-derived capsid protein also includes a capsid protein that contains a C68 capsid protein or fragment thereof as defined above, including, without limitation, a chimeric capsid protein, a fusion protein, an artificial capsid protein, a synthetic capsid protein, and a recombinant capsid proteins, without limitation to means of generating these proteins.
Suitably, these C68-derived capsid proteins contain one or more C68 regions or fragments thereof (e.g., a hexon) in combination with capsid regions or fragments thereof of different adenoviral serotypes, or of non-adenoviral sources, as described herein. These C68-derived capsid proteins may be used in non-viral targeting of useful molecules to cells, or for production of viral vectors, as described herein.
A "modification of a capsid protein associated with altered tropism" as used herein includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or fragment thereof, such as the knob domain of the fiber region, or a polynucleotide encoding same, such that specificity is altered.
In one embodiment, the amino acid sequences of the C68 penton protein are provided in SEQ ID NO:12: MMRRAYPEGPPPSYESVMQQAMAAAAMQPPLEAPYVPPRYLAPT
EGRNSIRYSELAPLYDTTRLYLVDNKSADIASLNYQNDHSNFLTTVVQNNDFTPTEAS
TQTINFDERSRWGGQLKTIMHTNMPNVNEFMYSNKFKARVMVSRKTPNGVTVTEDYDG
SQDELKYEWVEFELPEGNFSVTMTIDLMNNAIIDNYLAVGRQNGVLESDIGVKFDTRN
FRLGWDPVTELVMPGVYTNEAFHPDIVLLPGCGVDFTESRLSNLLGIRKRQPFQEGFQ
IMYEDLEGGNIPALLDVDAYEKSKEDAAAEATAAVATASTEVRGDNFASAAAVAAAEA
AETESKIVIQPVEKDSKNRSYNVLPDKINTAYRSWYLAYNYGDPEKGVRSWTLLTTSD
homogeneous, and most preferably 95% homogeneous.
In addition, the invention provides unique C68-derived capsid proteins. As used herein, a C68-derived capsid protein includes any C68 capsid protein or a fragment thereof including, without limitation, a polypeptide, peptide or a consecutive sequence of at least 8 amino acid residues unique to a C68 capsid protein and which is free of other proteins. A
C68-derived capsid protein also includes a capsid protein that contains a C68 capsid protein or fragment thereof as defined above, including, without limitation, a chimeric capsid protein, a fusion protein, an artificial capsid protein, a synthetic capsid protein, and a recombinant capsid proteins, without limitation to means of generating these proteins.
Suitably, these C68-derived capsid proteins contain one or more C68 regions or fragments thereof (e.g., a hexon) in combination with capsid regions or fragments thereof of different adenoviral serotypes, or of non-adenoviral sources, as described herein. These C68-derived capsid proteins may be used in non-viral targeting of useful molecules to cells, or for production of viral vectors, as described herein.
A "modification of a capsid protein associated with altered tropism" as used herein includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or fragment thereof, such as the knob domain of the fiber region, or a polynucleotide encoding same, such that specificity is altered.
In one embodiment, the amino acid sequences of the C68 penton protein are provided in SEQ ID NO:12: MMRRAYPEGPPPSYESVMQQAMAAAAMQPPLEAPYVPPRYLAPT
EGRNSIRYSELAPLYDTTRLYLVDNKSADIASLNYQNDHSNFLTTVVQNNDFTPTEAS
TQTINFDERSRWGGQLKTIMHTNMPNVNEFMYSNKFKARVMVSRKTPNGVTVTEDYDG
SQDELKYEWVEFELPEGNFSVTMTIDLMNNAIIDNYLAVGRQNGVLESDIGVKFDTRN
FRLGWDPVTELVMPGVYTNEAFHPDIVLLPGCGVDFTESRLSNLLGIRKRQPFQEGFQ
IMYEDLEGGNIPALLDVDAYEKSKEDAAAEATAAVATASTEVRGDNFASAAAVAAAEA
AETESKIVIQPVEKDSKNRSYNVLPDKINTAYRSWYLAYNYGDPEKGVRSWTLLTTSD
VTCGVEQVYWSLPDMMQDPVTFRSTRQVSNYPVVGAELLPVYSKSFFNEQAVYSQQLR
AFTSLTHVFNRFPENQILVRPPAPTITTVSENVPALTDHGTLPLRSSIRGVQRVTVTD
ARRRTCPYVYKALGIVAPRVLSSRTF.
Suitably, this penton protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the C68 penton having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO:12. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein may be modified for a variety of purposes known to those of skill in the art.
The sequences of the C68 hexon are provided in SEQ ID NO:16:
MAT PSMLPQWAYMHIAGQDASEYLSPGLVQFARATDTYFSLGNK
FRNPTVAPTHDVTTDRSQRLTLRFVPVDREDNTYSYKVRYTLAVGDNRVLDMASTYFD
IRGVLDRGPSFKPYSGTAYNSLAPKGAPNTCQWTYKADGETATEKTYTYGNAPVQGIN
ITKDGIQLGTDTDDQPIYADKTYQPEPQVGDAEWHDITGTDEKYGGRALKPDTKMKPC
YGSFAKPTNKEGGQANVKTGTGTTKEYDIDMAFFDNRSAAAAGLAPEIVLYTENVDLE
TPDTHIVYKAGTDDSSSSINLGQQAMPNRPNYIGFRDNFIGLMYYNSTGNMGVLAGQA
SQLNAVVDLQDRNTELSYQLLLDSLGDRTRYFSMWNQAVDSYDPDVRIIENHGVEDEL
PNYCFPLDAVGRTDTYQGIKANGTDQTTWTKDDSVNDANEIGKGNPFAMEINIQANLW
RNFLYANVALYLPDSYKYTPANVTLPTNTNTYDYMNGRVVAPSLVDSYINIGARWSLD
PMDNVNPFNHHRNAGLRYRSMLLGNGRYVPFHIQVPQKFFAIKSLLLLPGSYTYEWNF
RKDVNMILQSSLGNDLRTDGASISFTSINLYATFFPMAHNTASTLEAMLRNDTNDQSF
NDYLSAANMLYPIPANATNVPISIPSRNWAAFRGWSFTRLKTKETPSLGSGFDPYFVY
SGSIPYLDGTFYLNHTFKKVSITFDSSVSWPGNDRLLTPNEFEIKRTVDGEGYNVAQC
NMTKDWFLVQMLAHYNIGYQGFYVPEGYKDRMYSFFRNFQPMSRQVVDEVNYKDYQAV
TLAYQHNNSGFVGYLAPTMRQGQPYPAXYPYPLIGKSAVTSVTQKKFLCDRVMWRIPF
SSNFMSMGALTDLGQNMLYANSAHALDMNFEVDPMDESTLLYVVFEVFDVVRVHQPHR
GVIEAVYXRTPFSAGNATT.
Suitably, this hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the C68 hexon having N-terminal and/or C-terminal truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO:16. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For example, one suitable fragment the loop region (domain) of the hexon protein, designated DE1 and FG1, or a hypervariable region thereof. Such fragments include the regions spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, such as those spanning about residue 138 to residue 163; about 170 to about 176; about 195 to about 203; about 233 to about 246; about 253 to about 264; about 287 to about 297; and about 404 to about 430 of C68, with reference to SEQ ID NO:16. Other suitable fragments may be readily identified by one of skill in the art. Further, the hexon protein may be modified for a variety of purposes known to those of skill in the art.
In one example, it may be desirable to generate an adenovirus having an altered hexon protein utilizing the C68 hexon protein sequences of the invention. One suitable method for altering hexon proteins is described in US Patent 5,922,315. In this method, at least one loop region of the adenovirus hexon is changed with at least one loop region of another adenovirus serotype. Thus, at least one loop region of such an altered adenovirus hexon protein is a C68 hexon loop region. In one embodiment, a loop region of the C68 hexon protein is replaced by a loop region from another adenovirus serotype. In another embodiment, the loop region of the C68 hexon is used to replace a loop region from another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from among human and non-human serotypes, as described herein. Where non-human adenoviruses are selected, the serotypes are preferably selected from non-human primates. However, the selection of a suitable serotype is not a limitation of the present invention. Still other uses for the C68 hexon protein sequences of the invention will be readily apparent to those of skill in the art.
The sequences of the C68 fiber protein are: SEQ ID NO:27:
MSKKRVRVDDDFDPVYPYDADNAPTVPFINPPFVSSDGFQEKPL
GVLSLRLADPVTTKNGEITLKLGEGVDLDSSGKLISNTATKAAAPLSFSNNTISLNMD
PFYTKDGKLSLQVSPPLNILRTSILNTLALGFGSGLGLRGSALAVQLVSPLTFDTDGN
IKLTLDRGLHVTTGDAIESNISWAKGLKFEDGAIATNIGNGLEFGSSSTETGVDDAY
PIQVKLGSGLSFDSTGAIMAGNKEDDKLTLWTTPDPSPNCQILAENDAKLTLCLTKCG
SQILATVSVLVVGSGNLNPITGTVSSAQVFLRFDANGVLLTEHSTLKKYWGYRQGDSI
DGTPYTNAVGFMPNLKAYPKSQSSTTKNNIVGQVYMNGDVSKPMLLTITLNGTDDSNS
TYSMSFSYTWTNGSYVGATFGANSYTFSYIAQE.
AFTSLTHVFNRFPENQILVRPPAPTITTVSENVPALTDHGTLPLRSSIRGVQRVTVTD
ARRRTCPYVYKALGIVAPRVLSSRTF.
Suitably, this penton protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the C68 penton having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO:12. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein may be modified for a variety of purposes known to those of skill in the art.
The sequences of the C68 hexon are provided in SEQ ID NO:16:
MAT PSMLPQWAYMHIAGQDASEYLSPGLVQFARATDTYFSLGNK
FRNPTVAPTHDVTTDRSQRLTLRFVPVDREDNTYSYKVRYTLAVGDNRVLDMASTYFD
IRGVLDRGPSFKPYSGTAYNSLAPKGAPNTCQWTYKADGETATEKTYTYGNAPVQGIN
ITKDGIQLGTDTDDQPIYADKTYQPEPQVGDAEWHDITGTDEKYGGRALKPDTKMKPC
YGSFAKPTNKEGGQANVKTGTGTTKEYDIDMAFFDNRSAAAAGLAPEIVLYTENVDLE
TPDTHIVYKAGTDDSSSSINLGQQAMPNRPNYIGFRDNFIGLMYYNSTGNMGVLAGQA
SQLNAVVDLQDRNTELSYQLLLDSLGDRTRYFSMWNQAVDSYDPDVRIIENHGVEDEL
PNYCFPLDAVGRTDTYQGIKANGTDQTTWTKDDSVNDANEIGKGNPFAMEINIQANLW
RNFLYANVALYLPDSYKYTPANVTLPTNTNTYDYMNGRVVAPSLVDSYINIGARWSLD
PMDNVNPFNHHRNAGLRYRSMLLGNGRYVPFHIQVPQKFFAIKSLLLLPGSYTYEWNF
RKDVNMILQSSLGNDLRTDGASISFTSINLYATFFPMAHNTASTLEAMLRNDTNDQSF
NDYLSAANMLYPIPANATNVPISIPSRNWAAFRGWSFTRLKTKETPSLGSGFDPYFVY
SGSIPYLDGTFYLNHTFKKVSITFDSSVSWPGNDRLLTPNEFEIKRTVDGEGYNVAQC
NMTKDWFLVQMLAHYNIGYQGFYVPEGYKDRMYSFFRNFQPMSRQVVDEVNYKDYQAV
TLAYQHNNSGFVGYLAPTMRQGQPYPAXYPYPLIGKSAVTSVTQKKFLCDRVMWRIPF
SSNFMSMGALTDLGQNMLYANSAHALDMNFEVDPMDESTLLYVVFEVFDVVRVHQPHR
GVIEAVYXRTPFSAGNATT.
Suitably, this hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the C68 hexon having N-terminal and/or C-terminal truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO:16. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For example, one suitable fragment the loop region (domain) of the hexon protein, designated DE1 and FG1, or a hypervariable region thereof. Such fragments include the regions spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, such as those spanning about residue 138 to residue 163; about 170 to about 176; about 195 to about 203; about 233 to about 246; about 253 to about 264; about 287 to about 297; and about 404 to about 430 of C68, with reference to SEQ ID NO:16. Other suitable fragments may be readily identified by one of skill in the art. Further, the hexon protein may be modified for a variety of purposes known to those of skill in the art.
In one example, it may be desirable to generate an adenovirus having an altered hexon protein utilizing the C68 hexon protein sequences of the invention. One suitable method for altering hexon proteins is described in US Patent 5,922,315. In this method, at least one loop region of the adenovirus hexon is changed with at least one loop region of another adenovirus serotype. Thus, at least one loop region of such an altered adenovirus hexon protein is a C68 hexon loop region. In one embodiment, a loop region of the C68 hexon protein is replaced by a loop region from another adenovirus serotype. In another embodiment, the loop region of the C68 hexon is used to replace a loop region from another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from among human and non-human serotypes, as described herein. Where non-human adenoviruses are selected, the serotypes are preferably selected from non-human primates. However, the selection of a suitable serotype is not a limitation of the present invention. Still other uses for the C68 hexon protein sequences of the invention will be readily apparent to those of skill in the art.
The sequences of the C68 fiber protein are: SEQ ID NO:27:
MSKKRVRVDDDFDPVYPYDADNAPTVPFINPPFVSSDGFQEKPL
GVLSLRLADPVTTKNGEITLKLGEGVDLDSSGKLISNTATKAAAPLSFSNNTISLNMD
PFYTKDGKLSLQVSPPLNILRTSILNTLALGFGSGLGLRGSALAVQLVSPLTFDTDGN
IKLTLDRGLHVTTGDAIESNISWAKGLKFEDGAIATNIGNGLEFGSSSTETGVDDAY
PIQVKLGSGLSFDSTGAIMAGNKEDDKLTLWTTPDPSPNCQILAENDAKLTLCLTKCG
SQILATVSVLVVGSGNLNPITGTVSSAQVFLRFDANGVLLTEHSTLKKYWGYRQGDSI
DGTPYTNAVGFMPNLKAYPKSQSSTTKNNIVGQVYMNGDVSKPMLLTITLNGTDDSNS
TYSMSFSYTWTNGSYVGATFGANSYTFSYIAQE.
Suitably, this fiber protein, or unique fragments thereof, may be utilized for a variety of purposes. One suitable fragment is the fiber knob, which spans about amino acids 247 to 425 of SEQ ID NO: 27. Examples of other suitable fragments include the C68 fiber having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO:27. Still other suitable fragments include internal fragments. Further, the fiber protein may be modified using a variety of techniques known to those of skill in the art.
The amino acid sequences of other useful gene products of C68 are provided in SEQ
ID Nos. 1 - 11, 13 - 15, 17 - 26, and 28 -38 of the attached sequence listing.
More particularly, these sequences are as follows.
Regions Ad C68 - CDS, Ad C68 With ref to SEQ ID NO:33. SEQ ID NO:
Eta 1lkDa 578... 649,1236 ... 1469 1 28.2 kDa 578... 1142,1236 ... 1444 2 24.8 kDa 578.. 1049, 1236.. 1444 3 E l b 20.5kDa 1603 ...2163 4 54.7 kDa 1908... 3404 5 18.5 kDa 1908 .. 2200, 3188 ..3404 6 10.1 kDa 1908.. 2170, 3306...3324 7 IX Hexon-associated 3489..3917 8 protein - pIX
IVa2 Maturation protein - Complement (3976.. 5309, 9 pIVa2 5588 ... 5600) LI 21.9kDa 7858... 8460 10 42.9 kDa 10825.. 12000 11 L2 Penton - plII 13888... 15492 12 Major core protein - 15493 ... 16098 13 VIII
Minor Core Protein - 16120... 17190 14 V
L3 Hexon-associated 17442 ... 18215 15 protein - pVI
Hexon - p1l 18322... 21123 16 E2a DNA-Binding Protein Complement 17 Endo a tidase (21835 . . 23376 L4 Virion Complement (25529... 18 morphogenesis- 25862, 26032 .. 26366) associated protein 24.3 kDa Hexon-associated 26446 19 protein - pVIII
The amino acid sequences of other useful gene products of C68 are provided in SEQ
ID Nos. 1 - 11, 13 - 15, 17 - 26, and 28 -38 of the attached sequence listing.
More particularly, these sequences are as follows.
Regions Ad C68 - CDS, Ad C68 With ref to SEQ ID NO:33. SEQ ID NO:
Eta 1lkDa 578... 649,1236 ... 1469 1 28.2 kDa 578... 1142,1236 ... 1444 2 24.8 kDa 578.. 1049, 1236.. 1444 3 E l b 20.5kDa 1603 ...2163 4 54.7 kDa 1908... 3404 5 18.5 kDa 1908 .. 2200, 3188 ..3404 6 10.1 kDa 1908.. 2170, 3306...3324 7 IX Hexon-associated 3489..3917 8 protein - pIX
IVa2 Maturation protein - Complement (3976.. 5309, 9 pIVa2 5588 ... 5600) LI 21.9kDa 7858... 8460 10 42.9 kDa 10825.. 12000 11 L2 Penton - plII 13888... 15492 12 Major core protein - 15493 ... 16098 13 VIII
Minor Core Protein - 16120... 17190 14 V
L3 Hexon-associated 17442 ... 18215 15 protein - pVI
Hexon - p1l 18322... 21123 16 E2a DNA-Binding Protein Complement 17 Endo a tidase (21835 . . 23376 L4 Virion Complement (25529... 18 morphogenesis- 25862, 26032 .. 26366) associated protein 24.3 kDa Hexon-associated 26446 19 protein - pVIII
Regions Ad C68 - CDS, SEQ ID NO:
With ref to SEQ ID NO:33.
E3 11.6 kDa 27130... 27450 20 16 kDa (27404 ... 27477,27666.. 21 28032) 19.3 kDa 28014.. 28544 22 22.3 28572.. 29186 23 9.9 kDa 30722.. 30997 24 15.6 kDa 31003.. 31434 25 14.7 kDa 31427.. 31834 26 L5 Fiber - p1V 32137.. 33414 27 E4 ORF7-like protein Complement (33521.. 28 >33772) Orf 6 - 33 kDa Complement (33769..34674) 29 Orf4 - 13.2 kDa Complement (34580.. 34945) 30 Orf 3 - 12. 8 kDa Complement (34955.. 35308) 31 Orf 2 - 14.2 kDa Complement (35305.. 35694 32 Thus, the invention provides unique C68 proteins, peptides and fragments thereof, which are produced recombinantly or by other methods. Suitably, such fragments are at least 8 amino acids in length. However, fragments of other desired lengths are readily utilized. In addition, the invention encompasses such modifications as may be introduced to enhance yield and/or expression of a C68 protein or fragment, construction of a fusion molecule in which all or a fragment of the C68 protein or fragment is fused (either directly or via a linker) with a fusion partner to enhance. Other suitable modifications include, without limitation, truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre-or pro-protein ordinarily cleaved to produce the mature protein or enzyme and/or mutation of a coding region to provide a secretable gene product. Still other modifications will be readily apparent to one of skill in the art. The invention further encompasses proteins having at least about 95% to 99% identity to the C68 proteins provided herein.
The term "substantial homology" or "substantial similarity," when referring to a protein or fragment thereof, indicates that, when optimally aligned with appropriate amino acid insertions or deletions with another protein, there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences .
The term "percent sequence identity" or "identical" in the context of proteins or fragments thereof refers to the amino acids in the two sequences that are the same when aligned for maximum correspondence. The length of sequence identity comparison may be over the full length of a protein, enzyme, polypeptide, peptide, or other fragment of at least about 200 to 500 amino acids, is desired. However, identity among smaller fragments, e.g. of at least about 8 amino acids, usually at least about 20 to 24 amino acids, at least about 28 to 32 amino acids, at least about 50 or more amino acids, may also be desired.
Identity is readily determined by one of skill in the art by resort to algorithms and computer programs known by those of skill in the art. As described herein, alignments are performed using any of a variety of publicly or commercially available Multiple Sequence Alignment Programs, such as "Clustal W", accessible through Web Servers on the internet.
Alternatively, Vector NTI utilities are also used. There are also a number of algorithms known in the art that can be used to measure amino acid sequence identity, including those contained in the programs described above. Generally, these programs are used at default settings, although one of skill in the art can alter these settings as needed.
Alternatively, one of skill in the art can utilize another algorithm or computer program that provides at least the level of identity or alignment as that provided by the referenced algorithms and programs.
As described herein, the C68-derived capsid proteins of the invention are particularly well suited for use in applications in which the neutralizing antibodies diminish the effectiveness of other Ad serotype based targeting proteins and vectors, as well as other viral vectors. The C68-derived constructs of the invention are particularly advantageous in readministration for repeat gene therapy or for boosting immune response (vaccine titers).
Also provided by the present invention are artificial adenoviral capsid proteins, which involve modifications and chimeric capsids constructed using the C68 adenoviral capsid proteins of the invention. Such artificial capsid proteins can be constructed using the amino acid sequences of the chimp C68 Ad hexon of the invention. Because the hexon protein is the determinant for serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses having artificial serotypes. Other artificial capsid proteins can also be constructed using the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments thereof.
In one embodiment, a chimeric C68 capsid is constructed using C68 hexon and fiber and a penton from another adenovirus. Alternatively, a chimeric C68 capsid comprises a C68 hexon and a fiber and penton from one or more different adenoviruses.
Another chimeric adenovirus capsid comprises the C68 fiber and a penton and a hexon from one or more different different adenovirus serotypes. Yet another chimeric adenovirus capsid comprises the C68 penton and a fiber and hexon from one or more different adenovirus serotypes. Suitably, for such chimeric and artificial capsids constructed from C68 proteins, the non-C68 adenovirus components may be readily selected from other adenovirus serotypes.
Under certain circumstances, it may be desirable to use one or more of the C68-derived capsid proteins or a fragment thereof to generate an antibody. The term "an antibody," as used herein, refers to an immunoglobulin molecule which is able to specifically bind to an epitope. The antibodies in the present invention exist in a variety of forms including, for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. Such antibodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE.
Such antibodies may be generated using any of a number of methods know in the art.
Suitable antibodies may be generated by well-known conventional techniques, e.g. Kohler and Milstein and the many known modifications thereof. Similarly desirable high titer antibodies are generated by applying known recombinant techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, e.g., PCT Patent Application No.
PCT/GB85/00392; British Patent Application Publication No. GB2188638A; Amit et al., 1986 Science, 233:747-753; Queen et al., 1989 Proc. Nat'l. Acad. Sci. USA, 86:10029-10033;
PCT Patent Application No. PCT/W09007861; and Riechmann et al., Nature, 332:323-327 (1988); Huse et al, 1988a Science, 246:1275-1281]. Alternatively, antibodies can be produced by manipulating the complementarity determining regions of animal or human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, "Humanization of Monoclonal Antibodies", Chapter 4, The Handbook of Experimental Pharmacology, Vol.
113, The Pharmacology of Monoclonal Antibodies, Springer-Verlag (June, 1994);
Harlow et al., 1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et al., 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York;
Houston et al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Bird et al., 1988, Science 242:423-426.
Alternatively, one or more of the C68 capsid proteins of the invention are assembled as multi-antigenic complexes [see, e.g., European Patent Application 0339695, published November 2, 1989] and employed to elicit high titer antibodies. Further provided by the present invention are anti-idiotype antibodies (Ab2) and anti-anti-idiotype antibodies (Ab3).
See, e.g., M. Wettendorff et al., "Modulation of anti-tumor immunity by anti-idiotypic antibodies." In Idiotypic Network and Diseases, ed. by J. Cerny and J.
Hiernaux, 1990 J. Am.
Soc. Microbiol., Washington DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced using techniques well known to those of skill in the art. These antibodies may be used for a variety of purposes, including diagnostic and clinical methods and kits.
Under certain circumstances, it may be desirable to introduce a detectable label or a tag onto a C68 antibody or other construct of the invention. As used herein, a detectable label is a molecule which is capable, alone or upon interaction with another molecule, of providing a detectable signal. Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in immunohistochemical analyses or immunofluorescent microscopy. For example, suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin (APC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PC5), and PE-Texas Red (ECD). All of these fluorescent dyes are commercially available, and their uses known to the art. Other useful labels include a colloidal gold label. Still other useful labels include radioactive compounds or elements. Additionally, labels include a variety of enzyme systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase (which uses glucose as a substrate) releases peroxide as a product which in the presence of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an oxidized TMB that is seen as a blue color. Other examples include horseradish peroxidase (HRP) or alkaline phosphatase (AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is detected as increased absorbance at 340 nm wavelength. Other label systems that are utilized in the methods of this invention are detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, Indiana] in which a dye is embedded are used in place of enzymes to form conjugates with the target sequences provide a visual signal indicative of the presence of the resulting complex in applicable assays.
Methods for coupling or associating the label with a desired molecule are similarly conventional and known to those of skill in the art. Known methods of label attachment are described [see, for example, Handbook of Fluorescent probes and Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, selection of the label and coupling methods do not limit this invention.
The C68-derived proteins, peptides, and fragments described herein can be produced by any suitable means, including chemical synthesis, or other synthetic means, or by recombinant production and conventional genetic engineering methodologies. For example, peptides can be synthesized by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc., 85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). These and other suitable production methods are within the knowledge of those of skill in the art and are not a limitation of the present invention.
Alternatively, suitable methods for recombinant production can be used.
Selection of suitable expression systems, including expression vectors and host cells for protein expression and/or viral packaging is within the ability of one of skill in the art and is not a limitation of the present invention. See, e.g., Sambrook et al, Molecular Cloning: A
Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY).
Nucleic acid sequences for the C68 genome, which is 36521 bp in length, may be obtained using information available in US Patent 6,083,716 and from the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209 (Pan-9).
This sequences is also available from GenBank. Other chimpanzee adenovirus sequences are available from the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209, and other sources. Desirable chimpanzee strains Pan 5 [ATCC VR-591], Pan 6 [ATCC VR-592], and Pan 7 [ATCC VR-593]. Another particularly desirable chimpanzee adenovirus strain is chimpanzee adenovirus strain Bertha or Cl [ATCC
Accession No. VR-20]. The sequence of the Cl serotype, and the location of the adenovirus genes Ela, Elb, E2a, E2b, E3, E4, L1, L2, L3, L4 and L5 are provided in US
Patent 6,083,716. Optionally, non-chimpanzee simian adenoviral sequences may be used.
Such non-chimpanzee adenovirus include those obtained from baboon adenovirus strains [e.g., ATCC VR-275], adenovirus strains isolated from rhesus monkeys [e.g., ATCC VR-209, ATCC VR-275, ATCC VR-353, ATCC VR-355], and adenovirus strains isolated from African green monkeys [e.g., ATCC VR-541; ATCC VR-941; ATCC VR-942; ATCC VR-943]. Alternatively, one may readily select from among the at least 51 different human serotypes, including, without limitation, human adenovirus serotypes 1, 2, 3, 4, 5, 12, 35, 37, and 40, and other, non-human primate adenovirus serotypes. Further, the sequences of these and other suitable serotypes are available from a variety of databases including, e.g., PubMed and GenBank [see, for example, US Patent No. 5,240,846]. Selection of an appropriate adenovirus is not a limitation of the present invention.
The invention further provides molecules useful for production of the C68 and derived proteins of the invention, including such molecules which carry polynucleotides including DNA sequences. Thus, the invention further encompasses the nucleic acid sequences encoding the C68-derived constructs of the invention, and molecules and host cells useful in expression thereof, including suitable DNA molecules and vectors, which can be any suitable genetic element as defined herein. Preferably, these vectors are DNA-based (e.g., plasmids) or viral vectors.
In one embodiment, the C68-derived capsid proteins and other C68 adenovirus proteins described herein are used for non-viral, protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and immunogenic molecules. A desired molecule for delivery to a target call may be associated with a C68-derived capsid protein or other protein by any suitable means, including, e.g., covalent or non-covalent binding. For example, the C68 penton protein may be readily utilized for such a purpose by production of a fusion protein using the C68 penton sequences of SEQ ID NO:12 in a manner analogous to that described in Medina-Kauwe LK, et al, Gene Ther. 2001 May; 8(10):795-803 and Medina-Kauwe LK, et al, Gene Ther. 2001 Dec; 8(23): 1753-1761. Alternatively, the amino acid sequences of C68 protein IX may be utilized for targeting vectors by associating the protein IX with a ligand that binds to a cell surface receptor, as described in US
Patent Appln 20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or polylysine-containing sequence, and the like. Still other C68 proteins may be used for used for these and similar purposes.
Further, the C68 adenovirus proteins of the invention are particularly well suited for use in producing viral vectors in C68-derived capsids. Suitably, these adenoviruses are pseudotyped such that a nucleic acid molecule carrying adenovirus ITRs from a non-C68 serotype and a minigene are packaged in a C68-derived adenoviral capsid of the invention.
Alternatively, adenoviruses may be generated which contain at least the 5' ITRs or the 3' ITRs from C68, in a C68-derived capsid protein. The adenoviral vectors described herein may contain adenoviral sequences derived from one, more than one adenoviral strain. In yet another alternative, other C68 elements described herein may be utilized in production of recombinant vectors, or other desirable constructs.
The C68 proteins of the invention are useful for a variety of purposes, including construction of recombinant viruses. The C68-derived capsid proteins of the invention are useful in producing hybrid vectors, including, hybrid C68-adeno-associated viruses, Epstein-Barr virus, and retroviruses [Caplen et al, Gene Ther. 6: 454-459 (1999); Tan et al, J Virol., 73:7582-7589 (1999)]. Such viruses include C68-derived capsids which encapsidated vectors with adeno-associated virus (AAV) ITRs [Lieber et al, J Virol, 73:9314-9324 (1999), Recchia et al, Proc Natl Acad Sci USA, 96:2615-2620 (1999); or lentivirus ITRs (Zheng et al, Nat Biotech, 18:176-180 (2000), using Maloney leukemia virus long terminal repeats).
In a particularly desirable embodiment, the C68-derived capsid proteins, and optionally, the other C68 sequences described herein, are used to produce recombinant adenoviruses and pseudotyped adenoviruses. However, it will be readily understood that the C68-derived capsid proteins and other novel C68 sequences can be utilized for a variety of purposes, including production of other types of viral vectors (such as, e.g., hybrid vectors) carrying the therapeutic and immunogenic transgenes described below.
Additionally, it will be readily understood that viral vectors carrying the unique C68 proteins and other sequences of the invention can be utilized for targeting and/or delivery of other types of molecules, including proteins, chemical molecules and other moieties useful for diagnostic, therapeutic and/or immunization purposes.
II. Recombinant Adenoviral Vectors The compositions of this invention include vectors that deliver a heterologous molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector may include any genetic element including, without limitation, a cosmid, episome, plasmid, or a virus. In a particularly preferred embodiment, these vectors are viral vectors having capsid proteins derived from the C68 proteins of the invention. Alternatively, these vectors may contain other C68 sequences of the invention. These viral vectors suitably contain a minigene. By "minigene" is meant the combination of a selected heterologous gene and the other regulatory elements necessary to drive translation, transcription and/or expression of the gene product in a host cell.
Typically, an adenoviral vector is designed such that the minigene is flanked on its 5' end and/or its 3' end by adenoviral sequences which include, at a minimum, the cis-elements necessary for replication and virion encapsidation. Thus, in one embodiment, the vector contains adenoviral sequences encompassing at least the 5' end of the adenoviral genome, i.e., the 5' inverted terminal repeat sequences (which functions as origins of replication) and the native 5' packaging enhancer domains (that contain sequences necessary for packaging linear Ad genomes and enhancer elements for the El promoter). The vector is also provided with the cis-acting 3' ITRs. Suitably, the minigene is located between the 5' adenoviral elements and the 3' adenoviral elements. An adenoviral vector of the invention may also contain additional adenoviral sequences. For example, the minigene may be located in the site of such as the site of a functional El deletion or functional E3 deletion, among others that may be selected. Alternatively, the minigene may be inserted into an existing gene region to disrupt the function of that region, if desired.
The term "functionally deleted" or "functional deletion" means that a sufficient amount of the gene region is removed or otherwise damaged, e.g., by mutation or modification, so that the gene region is no longer capable of producing functional products of gene expression. If desired, the entire gene region may be removed.
Suitably, these adenoviral vectors of the invention contain one or more adenoviral elements derived from C68. In one embodiment, the vectors contain adenoviral ITRs from an adenoviral serotype which differs from C68. Alternatively, C68 ITRs may be utilized in a viral vector of the invention in which the capsid is not naturally occurring, but contains one or more C68 proteins, or fragments thereof. The selection of the serotype of the ITRs and the serotype of any other adenoviral sequences present in vector is not a limitation of the present invention. A variety of adenovirus strains are described herein.
The viral sequences, helper viruses, if needed, and recombinant viral particles, and other vector components and sequences employed in the construction of the vectors described herein are obtained as described above. See, e.g., US Patent No. 5,240,846.
The DNA
sequences of the adenovirus sequences are employed to construct vectors and cell lines useful in the preparation of such vectors. See, e.g., US Patent No. 6,083,716.
Modifications of the nucleic acid sequences forming the vectors of this invention, including sequence deletions, insertions, and other mutations may be generated using standard molecular biological techniques and are within the scope of this invention.
A. The "Minigene"
The methods employed for the selection of the transgene, the cloning and construction of the "minigene" and its insertion into the viral vector are within the skill in the art given the teachings provided herein.
1. The transgene The transgene is a nucleic acid sequence, heterologous to the vector sequences flanking the transgene, which encodes a polypeptide, protein, or other product, of interest. The nucleic acid coding sequence is operatively linked to regulatory components in a manner which permits transgene transcription, translation, and/or expression in a host cell.
The composition of the transgene sequence will depend upon the use to which the resulting vector will be put. For example, one type of transgene sequence includes a reporter sequence, which upon expression produces a detectable signal. Such reporter sequences include, without limitation, DNA sequences encoding (3-lactamase, J3-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins including, for example, CD2, CD4, CD8, the influenza hemagglutinin protein, and others well known in the art, to which high affinity antibodies directed thereto exist or can be produced by conventional means, and fusion proteins comprising a membrane bound protein appropriately fused to an antigen tag domain from, among others, hemagglutinin or Myc. These coding sequences, when associated with regulatory elements which drive their expression, provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays for beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the signal may be measured visually by color or light production in a luminometer.
However, desirably, the transgene is a non-marker sequence encoding a product which is useful in biology and medicine, such as proteins, peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence in the treated animal.
The transgene may be used for treatment, e.g., of genetic deficiencies, as a cancer therapeutic or vaccine, for induction of an immune response, and/or for prophylactic vaccine purposes. As used herein, induction of an immune response refers to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral immune response to the molecule. The invention further includes using multiple transgenes, e.g., to correct or ameliorate a condition caused by a multi-subunit protein. In certain situations, a different transgene may be used to encode each subunit of a protein, or to encode different peptides or proteins. This is desirable when the size of the DNA encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order for the cell to produce the multi-subunit protein, a cell is infected with the recombinant virus containing each of the different subunits. Alternatively, different subunits of a protein may be encoded by the same transgene. In this case, a single transgene includes the DNA
encoding each of the subunits, with the DNA for each subunit separated by an internal ribozyme entry site (IRES). This is desirable when the size of the DNA
encoding each of the subunits is small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five kilobases. As an alternative to an IRES, the DNA may be separated by sequences encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L.
Donnelly, et al, J. Gen. Virol., 78(Pt 1):13-21 (Jan 1997); Furler, S., et al, Gene Ther., 8(11):864-873 (June 2001); Klump H., et al., Gene Ther., 8(10):811-817 (May 2001). This 2A peptide is significantly smaller than an IRES, making it well suited for use when space is a limiting factor. However, the selected transgene may encode any biologically active product or other product, e.g., a product desirable for study.
Suitable transgenes may be readily selected by one of skill in the art. The selection of the transgene is not considered to be a limitation of this invention.
2. Regulatory Elements In addition to the major elements identified above for the minigene, the vector also includes conventional control elements necessary which are operably linked to the transgene in a manner that permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus produced by the invention. As used herein, "operably linked" sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest.
Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA
processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence);
sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and may be utilized.
Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV
enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the J3-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1a promoter [Invitrogen].
Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. For example, inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc.
Natl. Acad.
Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl.
Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)]. Other systems include the FK506 dimer, VP16 or p65 using castradiol, diphenol murislerone, the RU486-inducible system [Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 (1997)]. The effectiveness of some inducible promoters increases over time. In such cases one can enhance the effectiveness of such systems by inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES.
Alternatively, one can wait at least 3 days before screening for the desired function. Once can enhance expression of desired proteins by known means to enhance the effectiveness of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE).
In another embodiment, the native promoter for the transgene will be used. The native promoter may be preferred when it is desired that expression of the transgene should mimic the native expression. The native promoter may be used when expression of the transgene must be regulated temporally or developmentally, or in a tissue-specific manner, or in response to specific transcriptional stimuli. In a further embodiment, other native expression control elements, such as enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used to mimic the native expression.
Another embodiment of the transgene includes a transgene operably linked to a tissue-specific promoter. For instance, if expression in skeletal muscle is desired, a promoter active in muscle should be used. These include the promoters from genes encoding skeletal (3-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, as well as synthetic muscle promoters with activities higher than naturally occurring promoters (see Li et al., Nat. Biotech., 17:241-245 (1999)). Examples of promoters that are tissue-specific are known for liver (albumin, Miyatake et al., J. Virol., 71:5124-32 (1997);
hepatitis B virus core promoter, Sandig et al., Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al., Hum. Gene Ther., 7:1503-14 (1996)), bone osteocalcin (Stein et al., Mol.
Biol. Rep., 24:185-96 (1997)); bone sialoprotein (Chen et al., J. Bone Miner.
Res., 11:654-64 (1996)), lymphocytes (CD2, Hansal et al., J. Immunol., 161:1063-8 (1998);
immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen el al., Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene (Piccioli et al., Neuron, 15:373-84 (1995)), among others.
Optionally, vectors carrying transgenes encoding therapeutically useful or immunogenic products may also include selectable markers or reporter genes may include sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such selectable reporters or marker genes (preferably located outside the viral genome to be packaged into a viral particle) can be used to signal the presence of the plasmids in bacterial cells, such as ampicillin resistance. Other components of the vector may include an origin of replication. Selection of these and other promoters and vector elements are conventional and many such sequences are available [see, e.g., Sambrook et al.].
These vectors are generated using the techniques and sequences provided herein, in conjunction with techniques known to those of skill in the art.
Such techniques include conventional cloning techniques of cDNA such as those described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence.
III. Production of the Recombinant Viral Particle In one embodiment, the chimpanzee adenoviral plasmids (or other vectors) are used to produce recombinant adenoviral particles. In one embodiment, the recombinant adenoviruses are functionally deleted in the E 1 a or E 1 b genes, and optionally bearing other mutations, e.g., temperature-sensitive mutations or deletions in other genes. In other embodiments, it is desirable to retain an intact El a and/or El b region in the recombinant adenoviruses. Such an intact El region may be located in its native location in the adenoviral genome or placed in the site of a deletion in the native adenoviral genome (e.g., in the E3 region).
In the construction of useful chimpanzee adenovirus vectors for delivery of a gene to the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can be employed in the vectors. For example, all or a portion of the adenovirus delayed early gene E3 may be eliminated from the C68 adenovirus sequence which forms a part of the recombinant virus. The function of adenovirus E3 is believed to be irrelevant to the function and production of the recombinant virus particle. Adenovirus vectors may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and more desirably because of the redundancy in the function of this region, the entire E4 region. Still another vector of this invention contains a deletion in the delayed early gene E2a.
Deletions may also be made in any of the late genes L1 through L5 of the chimpanzee adenovirus genome.
Similarly, deletions in the intermediate genes IX and IVa2 may be useful for some purposes.
Other deletions may be made in the other structural or non-structural adenovirus genes. The above discussed deletions may be used individually, i.e., an adenovirus sequence for use in the present invention may contain deletions in only a single region.
Alternatively, deletions of entire genes or portions thereof effective to destroy their biological activity may be used in any combination. For example, in one exemplary vector, the adenovirus sequence may have deletions of the E 1 genes and the E4 gene, or of the E 1, E2a and E3 genes, or of the E 1 and E3 genes, or of E1, E2a and E4 genes, with or without deletion of E3, and so on. As discussed above, such deletions may be used in combination with other mutations, such as temperature-sensitive mutations, to achieve a desired result.
An adenoviral vector lacking any essential adenoviral sequences (e.g., El a, Elb, E2a, E2b, E4 ORF6, L1, L2, L3, L4 and L5) may be cultured in the presence of the missing adenoviral gene products which are required for viral infectivity and propagation of an adenoviral particle. These helper functions may be provided by culturing the adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or virus) or a packaging host cell. See, for example, the techniques described for preparation of a "minimal" human Ad vector in International Patent Application W096/13597, published May 9, 1996.
1. Helper Viruses Thus, depending upon the chimpanzee adenovirus gene content of the viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus fragment may be necessary to provide sufficient chimpanzee adenovirus gene sequences necessary to produce an infective recombinant viral particle containing the minigene. Useful helper viruses contain selected adenovirus gene sequences not present in the adenovirus vector construct and/or not expressed by the packaging cell line in which the vector is transfected. In one embodiment, the helper virus is replication-defective and contains a variety of adenovirus genes in addition to the sequences described above. Such a helper virus is desirably used in combination with an E l -expressing cell line.
Helper viruses may also be formed into poly-cation conjugates as described in Wu et a!, J. Biol. Chem., 264:16985-16987 (1989); K. J. Fisher and J. M.
Wilson, Biochem. J., 299:49 (April 1, 1994). Helper virus may optionally contain a second reporter minigene. A number of such reporter genes are known to the art. The presence of a reporter gene on the helper virus which is different from the transgene on the adenovirus vector allows both the Ad vector and the helper virus to be independently monitored. This second reporter is used to enable separation between the resulting recombinant virus and the helper virus upon purification.
2. Complementation Cell Lines To generate recombinant chimpanzee adenoviruses (Ad) deleted in any of the genes described above, the function of the deleted gene region, if essential to the replication and infectivity of the virus, must be supplied to the recombinant virus by a helper virus or cell line, i.e., a complementation or packaging cell line. In many circumstances, a cell line expressing the human E1 can be used to transcomplement the chimp Ad vector. This is particularly advantageous because, due to the diversity between the chimp Ad sequences of the invention and the human AdEI sequences found in currently available packaging cells, the use of the current human E 1-containing cells prevents the generation of replication-competent adenoviruses during the replication and production process. However, in certain circumstances, it will be desirable to utilize a cell line which expresses the E1 gene products can be utilized for production of an El-deleted chimpanzee adenovirus. Such cell lines have been described. See, e.g., US Patent 6,083,716.
If desired, one may utilize the sequences provided herein to generate a packaging cell or cell line that expresses, at a minimum, the adenovirus El gene under the transcriptional control of a promoter for expression in a selected parent cell line. Inducible or constitutive promoters may be employed for this purpose. Examples of such promoters are described in detail elsewhere in this specification. A parent cell is selected for the generation of a novel cell line expressing any desired Ad gene. Without limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL
185], KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among others.
These cell lines are all available from the American Type Culture Collection, University Boulevard, Manassas, Virginia 20110-2209. Other suitable parent cell lines may be obtained from other sources.
Such E 1-expressing cell lines are useful in the generation of recombinant chimpanzee adenovirus El deleted vectors. Additionally, or alternatively, the invention provides cell lines that express one or more chimpanzee adenoviral gene products, e.g., Ela, Elb, E2a, and/or E4 ORF6, can be constructed using essentially the same procedures for use in the generation of recombinant chimpanzee viral vectors.
Such cell lines can be utilized to transcomplement adenovirus vectors deleted in the essential genes that encode those products, or to provide helper functions necessary for packaging of a helper-dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to this invention involves techniques such as assembly of selected DNA sequences.
This assembly may be accomplished utilizing conventional techniques. Such techniques include cDNA and genomic cloning, which are well known and are described in Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and any other suitable methods which provide the desired nucleotide sequence.
In still another alternative, the essential adenoviral gene products are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a suitable host cell can be selected from any biological organism, including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and mammalian cells.
Particularly desirable host cells are selected from among any mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, IOTI/2, 293 cells (which express functional adenoviral El), Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection of the mammalian species providing the cells is not a limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc.
3. Assembly of Viral Particle and Transfection of a Cell Line Generally, when delivering the vector comprising the minigene by transfection, the vector is delivered in an amount from about 5 g to about 100 .tg DNA, and preferably about 10 to about 50 g DNA to about 1 x 104 cells to about 1 x 1013 cells, and preferably about 105 cells. However, the relative amounts of vector DNA to host cells may be adjusted, taking into consideration such factors as the selected vector, the delivery method and the host cells selected.
The vector may be any vector known in the art or disclosed above, including naked DNA, a plasmid, phage, transposon, cosmids, viruses, etc.
Introduction into the host cell of the vector may be achieved by any means known in the art or as disclosed above, including transfection, and infection. One or more of the adenoviral genes may be stably integrated into the genome of the host cell, stably expressed as episomes, or expressed transiently. The gene products may all be expressed transiently, on an episome or stably integrated, or some of the gene products may be expressed stably while others are expressed transiently. Furthermore, the promoters for each of the adenoviral genes may be selected independently from a constitutive promoter, an inducible promoter or a native adenoviral promoter. The promoters may be regulated by a specific physiological state of the organism or cell (i.e., by the differentiation state or in replicating or quiescent cells) or by exogenously-added factors, for example.
Introduction of the molecules (as plasmids or viruses) into the host cell may also be accomplished using techniques known to the skilled artisan and as discussed throughout the specification. In preferred embodiment, standard transfection techniques are used, e.g., CaPO4 transfection or electroporation.
Assembly of the selected DNA sequences of the adenovirus (as well as the transgene and other vector elements into various intermediate plasmids, and the use of the plasmids and vectors to produce a recombinant viral particle are all achieved using conventional techniques. Such techniques include conventional cloning techniques of cDNA
such as those described in texts [Sambrook et al, cited above], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence. Standard transfection and co-transfection techniques are employed, e.g., CaPO4 precipitation techniques.
Other conventional methods employed include homologous recombination of the viral genomes, plaquing of viruses in agar overlay, methods of measuring signal generation, and the like.
For example, following the construction and assembly of the desired minigene-containing viral vector, the vector is transfected in vitro in the presence of a helper virus into the packaging cell line. Homologous recombination occurs between the helper and the vector sequences, which permits the adenovirus-transgene sequences in the vector to be replicated and packaged into virion capsids, resulting in the recombinant viral vector particles. The current method for producing such virus particles is transfection-based.
However, the invention is not limited to such methods.
The resulting recombinant chimpanzee adenoviruses are useful in transferring a selected transgene to a selected cell. In in vivo experiments with the recombinant virus grown in the packaging cell lines, the E1-deleted recombinant chimpanzee adenoviral vectors of the invention demonstrate utility in transferring a transgene to a non-chimpanzee, preferably a human, cell.
IV. Use of Non-Viral C68 Proteins and C68-derived Adenoviruses The recombinant adenovirus vectors of the invention are useful for gene transfer to a human or non-chimpanzee veterinary patient in vitro, ex vivo, and in vivo. In addition, a variety of C68 proteins described herein are useful in non-viral targeting of transgenes, proteins, chemical molecules, and other moieties or molecules to cells.
Suitable methods of delivery and dosing regimens are readily determined based upon the targeted molecule and targeting protein. Examples of suitable genes and sources of proteins for protein-mediated delivery are provided in the sections below relating to viral delivery of therapeutic and immunogenic molecules. While the discussion below focuses on viral vectors, it will be appreciated that the C68-derived proteins of the invention may be formulated as described herein for the C68-derived viral vectors and the same routes of administration and regimens may be utilized.
The recombinant adenovirus vectors described herein can be used as expression vectors for the production of the products encoded by the heterologous genes in vitro. For example, the recombinant adenoviruses containing a gene inserted into the location of an El deletion may be transfected into an E1-expressing cell line as described above. Alternatively, replication-competent adenoviruses may be used in another selected cell line.
The transfected cells are then cultured in the conventional manner, allowing the recombinant adenovirus to express the gene product from the promoter. The gene product may then be recovered from the culture medium by known conventional methods of protein isolation and recovery from culture.
A C68-derived vector or C68-derived protein of the invention provides an efficient gene transfer vehicle that can deliver a selected transgene or other molecule to a selected host cell in vivo or ex vivo even where the organism has neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV and the cells are mixed ex vivo;
the infected cells are cultured using conventional methodologies; and the transduced cells are re-infused into the patient. These compositions are particularly well suited to gene delivery for therapeutic purposes and for immunization, including inducing protective immunity.
More commonly, the C68-derived vectors and C68-derived proteins of the invention will be utilized for delivery of therapeutic or immunogenic molecules, as described below. It will be readily understood for both applications, that the C68-derived constructs of the invention are useful for use in regimens involving single administrations, as well as in .regimens involving repeat delivery of adenoviral vectors or non-viral targeted delivery, or repeat delivery of the transgene or other molecule to the cells.
Such regimens typically involve delivery of a series of viral vectors in which the viral capsids are alternated. The viral capsids may be changed for each subsequent administration, or after a pre-selected number of administrations of a particular serotype capsid (e.g., one, two, three, four or more). For example, a regimen may involve delivery of a rAd with a C68-derived capsid and delivery with a rAd with another human or non-human primate adenovirus serotype. Optionally, these regimens may involve administration of rAd with capsids of other non-human primate adenoviruses, human adenoviruses, or artificial serotypes such as are described herein. Alternativley, the regimens involve administration of C68-derived proteins for non-viral targeting with repeat administrations of C68-derived proteins, or with other protein-based delivery systems. Each phase of these regimens can involve administration of a series of injections (or other delivery routes) with a single C68-derived construct followed by a series with another Ad serotype construct.
Alternatively, the C68-derived vectors and proteins of the invention may be utilized in regimens involving other non-adenoviral-mediated delivery systems, including other viral systems, non-viral delivery systems, protein, peptides, and other biologically active molecules.
The following sections will focus on exemplary molecules which may be delivered via the adenoviral vectors of the invention.
A. Ad-Mediated Delivery of Therapeutic Molecules In one embodiment, the above-described C68-derived constructs are administered to humans according to published methods for gene therapy. A C68-derived construct bearing a transgene can be administered to a patient, preferably suspended in a biologically compatible solution or pharmaceutically acceptable delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions known to be pharmaceutically acceptable carriers and well known to those of skill in the art may be employed for this purpose.
The C68-derived adenoviral vectors are administered in sufficient amounts to transduce the target cells and to provide sufficient levels of gene transfer and expression to provide a therapeutic benefit without undue adverse or with medically acceptable physiological effects, which can be determined by those skilled in the medical arts.
Conventional and pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to the retina and other intraocular delivery methods, direct delivery to the liver, intranasal, intravenous, intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. Routes of administration may be combined, if desired, or adjusted depending upon the transgene or the condition. The route of administration primarily will depend on the nature of the condition being treated.
Dosages of the viral vector will depend primarily on factors such as the condition being treated, the age, weight and health of the patient, and may thus vary among patients. For example, a therapeutically effective adult human or veterinary dosage of the viral vector is generally in the range of from about 100 L to about 100 mL of a carrier containing concentrations of from about 1 x 106 to about 1 x 1015 particles, about 1 x 1011 to 1 x 1013 particles, or about 1 x 109 to IX 1012 particles virus. Dosages will range depending upon the size of the animal and the route of administration. For example, a suitable human or veterinary dosage (for about an 80 kg animal) for intramuscular injection is in the range of about 1 x 109 to about 5 x 1012 particles per mL, for a single site.
Optionally, multiple sites of administration may be delivered. In another example, a suitable human or veterinary dosage may be in the range of about 1 x 1011 to about 1 x 1015 particles for an oral formulation.
When C68 proteins of the invention are utilized for targeted delivery, suitable dosage ranges, a therapeutically effective adult human or veterinary dosage of the construct is generally in the range of from about 100 L to about 100 mL of a carrier containing concentrations of from about 0.01 g to about 100 mg protein, about 0.1 g to about 10 mg, about I pg to about I mg protein. Dosages will range depending upon the size of the animal and the route of administration. Routes of administration may be readily selected from any suitable route including, without limitation, the routes described above.
One of skill in the art may adjust these doses, depending the route of administration, and the therapeutic or vaccinal application for which the C68-derived construct is employed. The levels of expression of the transgene, or for an immunogen, the level of circulating antibody, can be monitored to determine the frequency of dosage administration. Yet other methods for determining the timing of frequency of administration will be readily apparent to one of skill in the art.
An optional method step involves the co-administration to the patient, either concurrently with, or before or after administration of the C68-derived construct, of a suitable amount of a short acting immune modulator. The selected immune modulator is defined herein as an agent capable of inhibiting the formation of neutralizing antibodies directed against the recombinant vector of this invention or capable of inhibiting cytolytic T
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with the interactions between the T helper subsets (TH} or TH2) and B cells to inhibit neutralizing antibody formation. Alternatively, the immune modulator may inhibit the interaction between THI cells and CTLs to reduce the occurrence of CTL elimination of the vector. A
variety of useful immune modulators and dosages for use of same are disclosed, for example, in Yang et al., J. Virol., 70(9) (Sept., 1996); International Patent Application No.
W096/12406, published May 2, 1996; and International Patent Application No. PCT/US96/03035.
1. Therapeutic Transgenes Useful therapeutic products encoded by the transgene include hormones and growth and differentiation factors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor (TGF), platelet-derived growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of the transforming growth factor superfamily, including TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase.
Other useful transgene products include proteins that regulate the immune system including, without limitation, cytokines and lymphokines such as thrombopoietin (TPO), interleukins (IL) IL-1 through IL-18, monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system are also useful in the invention. These include, without limitations, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors, chimeric T
cell receptors, single chain T cell receptors, class I and class II MHC
molecules, as well as engineered immunoglobulins and MHC molecules. Useful gene products also include complement regulatory proteins such as complement regulatory proteins, membrane cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59.
Still other useful gene products include any one of the receptors for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune system proteins. The invention encompasses receptors for cholesterol regulation, including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also encompasses gene products such as members of the steroid hormone receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other nuclear receptors. In addition, useful gene products include transcription factors such as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP 1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix proteins.
Other useful gene products include, carbamoyl synthetase I, ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucose-6-phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta-synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence.
Other useful gene products include non-naturally occurring polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid sequence containing insertions, deletions or amino acid substitutions.
For example, single-chain engineered immunoglobulins could be useful in certain immunocompromised patients. Other types of non-naturally occurring gene sequences include antisense molecules and catalytic nucleic acids, such as ribozymes, which could be used to reduce overexpression of a target.
Reduction and/or modulation of expression of a gene are particularly desirable for treatment of hyperproliferative conditions characterized by hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those polypeptides which are produced exclusively or at higher levels in hyperproliferative cells as compared to normal cells. Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene products as target antigens, target polypeptides for anti-cancer treatments and protective regimens include variable regions of antibodies made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used as target antigens for autoimmune disease. Other tumor-associated polypeptides can be used as target polypeptides such as polypeptides which are found at higher levels in tumor cells including the polypeptide recognized by monoclonal antibody 17-1A and folate binding polypeptides. Such target polypeptides and their ligands are also useful in forming fusion partners with a C68 protein of the invention.
Other suitable therapeutic polypeptides and proteins include those which may be useful for treating individuals suffering from autoimmune diseases and disorders by conferring a broad based protective immune response against targets that are associated with autoimmunity including cell receptors and cells which produce self-directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis (RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is characterized by T
cell receptors (TCRs) that bind to endogenous antigens and initiate the inflammatory cascade associated with autoimmune diseases.
The C68-derived constructs of the invention are particularly well suited for therapeutic regimens in which multiple deliveries of transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in combination regimens involving delivery of other transgenes. Such regimens may involve administration of a C68-derived construct, followed by re-administration with a vector from the same serotype adenovirus.
Particularly desirable regimens involve administration of a C68-derived construct of the invention, in which the serotype of the viral vector delivered in the first administration differs from the serotype of the viral vector utilized in one or more of the subsequent administrations. For example, a therapeutic regimen involves administration of a C68-derived vector and repeat administration with one or more adenoviral vectors of the same or different serotypes. In another example, a therapeutic regimen involves administration of an adenoviral vector followed by repeat administration with a C68-derived vector of the invention which differs from the serotype of the first delivered adenoviral vector, and optionally further administration with another vector which is the same or, preferably, differs from the serotype of the vector in the prior administration steps. These regimens are not limited to delivery of adenoviral vectors constructed using the C68-derived capsids of the invention. Rather, these regimens can readily utilize constructs, including non-viral targeting proteins and viral vectors, from other adenoviral serotypes, including, without limitation, other chimpanzee adenoviral serotypes (e.g., Cl, etc), other non-human primate adenoviral serotypes, or human adenoviral serotypes, in combination with one or more of the C68-derived constructs of the invention. Examples of such chimpanzee, other non-human primate and human adenoviral serotypes are discussed elsewhere in this document.
Further, these therapeutic regimens may involve either simultaneous or sequential delivery of C68-derived constructs of the invention in combination with non-adenoviral vectors, non-viral vectors, and/or a variety of other therapeutically useful compounds or molecules. The present invention is not limited to these therapeutic regimens, a variety of which will be readily apparent to one of skill in the art.
B. Ad-Mediated Delivery of Immunogenic Transgenes The C68-derived constructs of the invention, including viral vectors and proteins, may also be employed as immunogenic compositions. As used herein, an immunogenic composition is a composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) response is mounted to a transgene product delivered by the immunogenic composition following delivery to a mammal, and preferably a primate. The present invention provides a recombinant C68-derived Ad that can contain in any of its adenovirus sequence deletions a gene encoding a desired immunogen, or a C68 protein capable of targeting an immunogenic molecule. The C68-derived adenovirus is well suited for use as a live recombinant virus vaccine in different animal species compared to an adenovirus of human origin, but is not limited to such a use. The recombinant adenoviruses and C68 proteins can be used as prophylactic or therapeutic vaccines against any pathogen for which the antigen(s) crucial for induction of an immune response and able to limit the spread of the pathogen has been identified and for which the cDNA is available.
Such vaccinal (or other immunogenic) compositions are formulated in a suitable delivery vehicle, as described above. Generally, doses for the immunogenic compositions are in the range defined above for therapeutic compositions. The levels of immunity of the selected gene can be monitored to determine the need, if any, for boosters.
Following an assessment of antibody titers in the serum, optional booster immunizations may be desired.
Optionally, a vaccinal composition of the invention may be formulated to contain other components, including, e.g. adjuvants, stabilizers, pH
adjusters, preservatives and the like. Such components are well known to those of skill in the vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, monophosphoryl lipid A, and any biologically active factor, such as cytokine, an interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of these biologically active factors can be expressed in vivo, e.g., via a polynucleotide, plasmid or viral vector. For example, such an adjuvant can be administered with a priming DNA vaccine encoding an antigen to enhance the antigen-specific immune response compared with the immune response generated upon priming with a DNA vaccine encoding the antigen only.
The recombinant adenoviruses are administered in a "an immunogenic amount", that is, an amount of recombinant adenovirus that is effective in a route of administration to transfect the desired cells and provide sufficient levels of expression of the selected gene to induce an immune response. Where protective immunity is provided, the recombinant adenoviruses are considered to be vaccine compositions useful in preventing infection and/or recurrent disease.
Alternatively, or in addition, the vectors of the invention may contain, or capsid or other protein can be utilized to target a transgene encoding a peptide, polypeptide or protein which induces an immune response to a selected immunogen. The C68-derived viruses of this invention are expected to be highly efficacious at inducing cytolytic T cells and antibodies to the inserted heterologous antigenic protein expressed by the vector.
1. Immunogenic Transgenes For example, immunogens may be selected from a variety of viral families. Example of desirable viral families against which an immune response would be desirable include, the picornavirus family, which includes the genera rhinoviruses, which are responsible for about 50% of cases of the common cold; the genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are responsible for foot and mouth diseases, primarily in non-human animals. Within the picornavirus family of viruses, target antigens include the VP1, VP2, VP3, VP4, and VPG. Another viral family includes the calcivirus family, which encompasses the Norwalk group of viruses, which are an important causative agent of epidemic gastroenteritis. Still another viral family desirable for use in targeting antigens for inducing immune responses in humans and non-human animals is the togavirus family, which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis viruses.
Other target antigens may be generated from the Hepatitis C or the coronavirus family, which includes a number of non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus (dog), and human respiratory coronaviruses, which may cause the common cold and/or non-A, B or C hepatitis. Within the coronavirus family, target antigens include the El (also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N
(nucleocapsid).
Still other antigens may be targeted against the rhabdovirus family, which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies).
Within the rhabdovirus family, suitable antigens may be derived from the G
protein or the N
protein. The family filoviridae, which includes hemorrhagic fever viruses such as Marburg and Ebola virus may be a suitable source of antigens. The paramyxovirus family includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes measles and canine distemper, and pneumovirus, which includes respiratory syncytial virus.
The influenza virus is classified within the family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the N 1 protein). The bunyavirus family includes the genera bunyavirus (California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The arenavirus family provides a source of antigens against LCM
and Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine encephalosis, blue tongue).
The retrovirus family includes the sub-family oncorivirinal which encompasses such human and veterinary diseases as feline leukemia virus, HTLVI
and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have been described and can readily be selected. Examples of suitable HIV and SIV
antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, as well as various fragments thereof. For example, suitable fragments of the Env protein may include any of its subunits such as the gp120, gp160, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the tat protein may be selected.
[See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, the HIV and SIV
proteins described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 (March 2001), and R.R. Amara, et al, Science, 292:69-74 (6 April 2001). In another example, the HIV and/or SIV
immunogenic proteins or peptides may be used to form fusion proteins or other immunogenic molecules.
See, e.g., the HIV-1 Tat and/or Nef fusion proteins and immunization regimens described in WO 01/54719, published August 2, 2001, and WO 99/16884, published April 8, 1999. The invention is not limited to the HIV and/or SIV immunogenic proteins or peptides described herein. In addition, a variety of modifications to these proteins have been described or could 3o readily be made by one of skill in the art. See, e.g., the modified gag protein that is described in US Patent 5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or in combination. Such combinations may include expression from a single vector or from multiple vectors. Optionally, another combination may involve delivery of one or more expressed immunogens with delivery of one or more of the immunogens in protein form.
Such combinations are discussed in more detail below.
The papovavirus family includes the sub-family polyomaviruses (BKU
and JCU viruses) and the sub-family papillomavirus (associated with cancers or malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the genera simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which may be suitable source of antigens is the Hepatitis delta virus.
Still other viral sources may include avian infectious bursal disease virus and porcine respiratory and reproductive syndrome virus. The alphavirus family includes equine arteritis virus and various Encephalitis viruses.
The present invention may also encompass immunogens which are useful to immunize a human or non-human animal against other pathogens including bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci.
Pathogenic gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H.
ducreyi (which causes chancroid); brucella; Franisella tularensis (which causes tularemia);
yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria (diphtheria);
cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis.
Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism;
other clostridia;
tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis.
Other infections caused by higher pathogen bacteria and pathogenic fungi include actinomycosis;
nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis;
candidiasis, aspergillosis, and mucormycosis; sporotrichosis;
paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox.
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae;
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections.
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; toxoplasmosis;
Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; giardiasis; trichinosis;
filariasis;
schistosomiasis; nematodes; trematodes or flukes; and cestode (tapeworm) infections.
Many of these organisms and/or toxins produced thereby have been identified by the Centers for Disease Control [(CDC), Department of Heath and Human Services, USA], as agents which have potential for use in biological attacks.
For example, some of these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum and its toxin (botulism), Yersiniapestis (plague), variola major (smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fever, all of which are currently classified as Category A agents; Coxiella burnetti (Q fever); Brucella species (brucellosis), Burkholderia mallei (glanders), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and its toxin (epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), all of which are currently classified as Category B agents; and Nipan virus, multidrug-resistant tuberculosis, yellow fever, tickborne hemorrhagic fever viruses, tickborne encephalitis viruses, and hantaviruses, which are currently classified as Category C
agents. In addition, other organisms, which are so classified or differently classified, may be identified and/or used for such a purpose in the future. It will be readily understood that the viral vectors and other constructs described herein are useful to deliver antigens from these organisms, viruses, their toxins or other by-products, which will prevent and/or treat infection or other adverse reactions with these biological agents.
Administration of the vectors and proteins of the invention to deliver immunogens against the variable region of the T cells elicit an immune response including CTLs to eliminate those T cells. In RA, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in RA. In MS, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-7 and Va-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in MS. In scleroderma, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a recombinant chimpanzee adenovirus that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in scleroderma.
C. Ad-Mediated Delivery Methods The therapeutic levels, or levels of immunity, of the selected gene can be monitored to determine the need, if any, for boosters. Following an assessment of CD8+ T
cell response, or optionally, antibody titers, in the serum, optional booster immunizations may be desired. Optionally, the C68-derived constructs of the invention may be delivered in a single administration or in various combination regimens, e.g., in combination with a regimen or course of treatment involving other active ingredients or in a prime-boost regimen. A
variety of such regimens have been described in the art and may be readily selected.
For example, prime-boost regimens may involve the administration of a DNA
(e.g., plasmid) based vector to prime the immune system to second, booster, administration with a traditional antigen, such as a protein or a recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 00/11140, published March 2, 2000.
Alternatively, an immunization regimen may involve the administration of a recombinant chimpanzee adenoviral vector of the invention to boost the immune response to a vector (either viral or DNA-based) carrying an antigen, or a protein. In still another alternative, an immunization regimen involves administration of a protein followed by booster with a vector encoding the antigen.
In one embodiment, the invention provides a method of priming and boosting an immune response to a selected antigen by delivering a plasmid DNA vector carrying said antigen, followed by boosting with a recombinant chimpanzee adenoviral vector of the invention. In one embodiment, the prime-boost regimen involves the expression of multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of protein subunits useful for generating an immune response against HIV and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-l Env is delivered in a recombinant adenovirus construct of the invention. Still other regimens are described in and WO 01/54719.
However, the prime-boost regimens are not limited to immunization for HIV
or to delivery of these antigens. For example, priming may involve delivering with a first chimp vector of the invention followed by boosting with a second chimp vector, or with a composition containing the antigen itself in protein form. In one example, the prime-boost regimen can provide a protective immune response to the virus, bacteria or other organism from which the antigen is derived. In another desired embodiment, the prime-boost regimen provides a therapeutic effect that can be measured using convention assays for detection of the presence of the condition for which therapy is being administered.
The priming composition may be administered at various sites in the body in a dose dependent manner, which depends on the antigen to which the desired immune response is being targeted. The invention is not limited to the amount or situs of injection(s) or to the pharmaceutical carrier. Rather, the regimen may involve a priming and/or boosting step, each of which may include a single dose or dosage that is administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals may receive one or two doses containing between about 10 g to about 50 g of plasmid in carrier. A desirable amount of a DNA
composition ranges between about 1 .tg to about 10,000 .tg of the DNA vector.
Dosages may vary from about 1 g to 1000 g DNA per kg of subject body weight. The amount or site of delivery is desirably selected based upon the identity and condition of the mammal.
The dosage unit of the vector suitable for delivery of the antigen to the mammal is described herein. The vector is prepared for administration by being suspended or dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic saline;
isotonic salts solution or other formulations that will be apparent to those skilled in such administration. The appropriate carrier will be evident to those skilled in the art and will depend in large part upon the route of administration. The compositions of the invention may be administered to a mammal according to the routes described above, in a sustained release formulation using a biodegradable biocompatible polymer, or by on-site delivery using micelles, gels and liposomes. Optionally, the priming step of this invention also includes administering with the priming composition, a suitable amount of an adjuvant, such as are defined herein.
Preferably, a boosting composition is administered about 2 to about 27 weeks after administering the priming composition to the mammalian subject. The administration of the boosting composition is accomplished using an effective amount of a boosting composition containing or capable of delivering the same antigen as administered by the priming DNA vaccine. The boosting composition may be composed of a recombinant viral vector derived from the same viral source (e.g., adenoviral sequences of the invention) or from another source. Alternatively, the "boosting composition" can be a composition containing the same antigen as encoded in the priming DNA vaccine, but in the form of a protein or peptide, which composition induces an immune response in the host.
In another embodiment, the boosting composition contains a DNA sequence encoding the antigen under the control of a regulatory sequence directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or viral vectors. The primary requirements of the boosting composition are that the antigen of the composition is the same antigen, or a cross-reactive antigen, as that encoded by the priming composition.
In another embodiment, the chimpanzee adenoviral vectors and C68 targeting proteins of the invention are also well suited for use in a variety of other immunization and therapeutic regimens. Such regimens may involve delivery of C68 constructs of the invention simultaneously or sequentially with Ad constructs of different serotype capsids, regimens in which C68-derived constructs of the invention are delivered simultaneously or sequentially with non-Ad vectors, regimens in which the adenoviral vectors of the invention are delivered simultaneously or sequentially with proteins, peptides, and/or other biologically useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one of skill in the art.
V. Method for Rapid Screening of Bacterial Transformants An elegant selection method is provided by the present invention, which permits the rapid screening of constructs produced by homologous recombination or direct cloning methods. As used herein, these constructs are preferably viruses, but may include other types of vectors, such as a cosmid, episome, plasmid, or other genetic element that delivers a heterologous molecule to cells.
In one desired embodiment, the method utilizes the gene encoding green fluorescent protein (GFP), to provide a green-white selection method in which the presence of a recombinant is detected by the absence of GFP expression (i.e., the recombinants are observed as white in a green background). Alternatively, the method may utilize another suitable marker genes, including, without limitation, other fluorescent proteins and luciferase.
In one example, the method is used for production of a recombinant construct from homologous recombination of co-transected vectors into a selected host cell.
As used herein, a host cell may be readily selected from an biological organism, including prokaryotic and eukaryotic cells, such as those discussed in the section related to production of a recombinant viral particle. Selection of the host cell is not a limitation of the present invention.
Suitably, each of the vectors contains the marker gene (e.g., GFP) under the control of a promoter that directs expression thereof in a host cell. Alternatively, each of the parental vectors may contain a different marker gene that allows them to be distinguished not only from the recombinant construct produced, but also from each other. Preferably, where prokaryotic GFP is utilized, it is under the control of a prokaryotic promoter such as the promoter from lacZ. However, other suitable prokaryotic or non-prokaryotic promoters may be readily selected from among the promoters described herein and known to those of skill in the art. Advantageously, the GFP protein is placed in the portion of the vectors that are eliminated during homologous recombination and thus, the GFP protein is absent from the recombinant vector produced. In this manner, the presence of unrecombined parental vectors are readily detected under a phase contrast fluorescent microscope (or other suitable detection means) as expressing the marker gene and the recombinant constructs lack expression of the marker. In the methods in which both parent vectors utilize GFP, the recombinant appears as white in a background of green.
In another example, the method is used for production of a recombinant construct involving homologous recombination, in which the host cell stably contains at least one of the parental constructs to be utilized for production of the recombinant construct. In this embodiment, the host cell can be subjected to a single transfection. In still other embodiments, the method of the invention may be utilized for triple transfections. As with the double transfection described above, the parental constructs may contain the same marker gene or may contain different marker genes.
In another example, the method of the invention is used from production of a recombinant construct by direct cloning. Suitably, in this embodiment, the marker gene is present is that portion of the parent construct which is deleted during the cloning process. For example, the marker gene expression cassette (i.e., the gene, promoter, and any other necessary regulatory sequences) is engineered into the E1- or E3-region of an adenoviral vector, into which a transgene or minigene cassette will be cloned. The success of direct cloning into the target region can be readily detected by the absence of marker gene expression.
Optionally, the method of the invention can be readily assembled in the form of a kit which is available in a commercially useful format for production of recombinant constructs, 3o e.g., recombinant adenoviruses. Typically, such kits will include plasmid backbones containing a desired viral genome containing a marker gene inserted at a point upstream or downstream of the recombination site, as appropriate, or a plasmid backbone containing the marker gene inserted at the splice site for direct cloning of a heterologous gene. Such a kit can further include appropriate culture media, host cells, a test control, instructions, and other suitable materials.
In the examples below, this method is used in production of adenoviruses.
However, it will be readily understood that this method may be readily adapted for use in generating other types of adenoviral, or non-adenoviral viral vectors.
The following examples are provided to illustrate the invention and do not limit the scope thereof One skilled in the art will appreciate that although specific reagents and conditions are outlined in the following examples, modifications can be made which are meant to be encompassed by the spirit and scope of the invention.
Example 1 - Creation of an El deleted vector based on Chimpanzee Adenovirus C68 Using Green-white Selection Of Recombinants A replication defective version of C68 was isolated for use in gene transfer.
The classic strategy of creating a recombinant with El deleted, by homologous recombination in an E 1 expressing cell line was pursued. The first step was creation of a plasmid containing m.u. 0 through 1.3 followed by addition of a minigene expressing enhanced green fluorescent protein (GFP) from a CMV promoter and C68 sequence spanning 9-16.7 m.u. This linearized plasmid was cotransfected into an E 1 expressing cell line with Ssp I-digested C68 plasmid (SspI cuts at 3.6 m.u. leaving 4644 bp for homologous recombination).
Experiments were initially conducted with 293 cells which harbor E 1 from human Ad5 with the hope that this would suffice for transcomplementation. Indeed, plaques formed which represented the desired recombinant. The resulting vector was called C68-CMV-GFP.
The strategy for generating recombinants was modified to enable efficient and rapid isolation of recombinants. First, the alkaline phosphatase DNA in the initial shuttle vector was replaced with a prokaryotic GFP gene driven by the prokaryotic promoter from lacZ.
This allowed efficient screening of bacterial transformations when attempting to incorporate a desired eukaryotic RNA po1 II transcriptional unit into the shuttle vector.
The resulting transformation can be screened for expression of GFP; white colonies are recombinants while green colonies are residual parental plasmid.
A green-white selection has been used to screen the products of cotransfection for the isolation of human Ad5 recombinants (A.R. Davis et al, Gene Thera., 5:1148-1152 (1998)).
In the present system, and in contrast to Davis, the initial shuttle vector was revised to include extended 3' sequences from 9 to 26 MU. This vector was cotransfected with viral DNA from the original C68-CMV-GFP isolate that had been restricted with Xba I, which cuts at MU
16.5 allowing for 9.5 Kb of overlap for homologous recombination. The resulting plaques were screened under a phase contrast fluorescent microscope for non-fluorescing isolates that represent the desired recombinants. This greatly simplified screening in comparison to the standard methods based on structure or transgene expression. Thus, this method may be readily adapted for use in generating other types of adenoviral, or non-adenoviral viral vectors.
A. Shuttle Plasmid To construct a plasmid shuttle vector for creation of recombinant C68 virus, the plasmid pSP72 (Promega, Madison, WI) was modified by digestion with Bgl II
followed by filling-in of the ends with Klenow enzyme (Boehringer Mannheim, Indianapolis, IN) and ligation with a synthetic 12 bp Pac I linker (New England Biolabs, Beverly, MA) to yield pSP72-Pac. A 456 bp Pac I/SnaB I fragment spanning map unit (m.u. or MU) 0-1.3 of the C68 genome was isolated from the pNEB-BamE plasmid containing BamHI E fragment of the C68 genome and cloned into Pac I and EcoR V treated pSP72-Pac to yield pSP-0-1.3. A minigene cassette consisting of the cytomegalovirus early promoter driving lacZ with a SV40 poly A signal was separated from pCMV(3 (Clontech, Palo Alto, CA) as a 4.5 kb EcoRI/SaII fragment and ligated to pSP-C68-MU 0-1.3 restricted with the same set of enzymes, resulting in pSP-C68-MU 0-1.3-CMVLacZ.
For the initial step in the isolation of the 9-16.7 MU region of C68, both pGEM-3Z (Promega, Madison, MI) and pBS-C68-BamF were double-digested with BamHI
and Sph I enzymes. Then the 293 bp fragment from pBS-C68-BamF was ligated with pGEM-3Z backbone to form pGEM-C68-MU 9-9.8. A 2.4 kb fragment including the C68 MU
9.8-16.7 was obtained from the pBS-C68 BamHB clone after Xbal digestion, filling in reaction and subsequent BamHI treatment and cloned into BamHI/Smal double digested pGEM-MU 9-9.8 to generate pGEM-C68-MU 9-16.7. The C68 9-16.7 m.u. region was isolated from pGEM-C68-MU 9-16.7 by digestion with EcoRl, filling in of the ends with Klenow enzyme (Boehringer Mannheim, Indianapolis, IN), ligation of a synthetic 12 bp HindI1l linker (NEB) and then digestion with HindIII. This 2.7 kb fragment spanning the C68 MU 9-16.7 was cloned into the HindIII site of pSP-C68-MU 0- 1.3-CMVIacZ to form the final shuttle plasmid pC68-CMV-LacZ. In addition, an 820 bp alkaline phosphatase (AP) cDNA fragment was isolated from pAdCMVALP (K. J. Fisher, et al., J. Virol., 70:520-532 (1996)) and exchanged for lacZ at Not I sites of pC68-CMV-lacZ, resulting in pC68-CMV-AP.
B. Construction of Recombinant Virus To create the E1-deleted recombinant C68-CMVEGFP vector, a pC68-CMV-EGFP shuttle plasmid was first constructed by replacing the lacZ transgene in pC68-CMV-lacZ with the enhanced green fluorescent protein (EGFP) gene. The replacement cloning process was carried out as the follows. An additional Notl restriction site was introduced into the 5' end of the EGFP coding sequence in the pEGFP-1 (Clontech, Palo Alto, CA) by BamHI digestion, filling in reaction and ligation of a 8 bp synthetic NotI
linker (NEB). After NotI restriction of both constructs, the EGFP sequence was isolated from the modified pEGFP-l and used to replace the lacZ gene in the pC68-CMV-LacZ. The pC68-CMVEGFP
construct (3 pg) was co-transfected with Ssp I-digested C68 genomic DNA (1 jig) into 293 cells for homologous recombination as previously described (G. Gao, et al, J.
Virol, 70:8934-8943 (1996)). Green plaques visualized by fluorescent microscopy were isolated for 2 rounds of plaque purification, expansion and purification by CsCI gradient sedimentation (G. Gao, et al, cited above).
The invention provides a uniquely modified version of the green/white selection process (A. R. Davis, et al., Gene Thera., 5:1148-1152 (1998)). The present example illustrates use of this method for construction of recombinant C68 vectors. A 7.2 kb fragment spanning 9 to 36 MU was isolated from the pBSC68-BamB plasmid by treatment with Agel and Bsiwl restriction endonucleases and cloned into Asp718 and Agel sites of pC68-CMV-AP shuttle plasmid, resulting in a new plasmid called pC68CMV-AP-MU36. A
further modification was made to remove 26 to 36 m.u. from pC68CMV-AP-MU36 by Eco47111 and Nrul digestions. The new shuttle plasmid called pC68CMV-AP-MU26 has a shorter region for homologous recombination (i.e., 16.7-26 MU) 3' to the minigene. To make a recombinant C68 vector, alkaline phosphatase (AP) is replaced with the gene of interest. The resulting pC68CMV-Nugene-MU26 construct is co-transfected with Xba I (16.5 MU) restricted C68-CMVGFP viral DNA into 293 cells, followed by top agar overlay. The recombinant virus plaques (white) are generated through the homologous recombination in the region of 16.7-26 MU which is shared between pC68CMV-Nugene construct and viral backbone; the recombinants which form white plaques are selected from green plaques of uncut C68-CMVGFP virus.
The green/white selection mechanism was also introduced to the process of cloning of the gene of interest into the pC68 shuttle plasmid. The AP gene in both pC68CMV-AP-MU36 and pC68CMV-AP-MU26 was replaced with a cassette of prokaryotic GFP gene driven by the lacZ promoter isolated from pGFPMU31 (Clontech, Palo Alto, CA).
Thus, white colonies of bacterial transformants will contain the recombinant plasmid. This green/white selection process for bacterial colonies circumvented the need for making and characterizing large numbers of minipreped DNAs and so further enhanced the efficiency in creating recombinant C68 vectors.
Example 2 - Chimpanzee C68 Virus Stock and Replication Examples 3 through 5 which follow provide additional characterization of the chimpanzee C68. It will be appreciated by one of skill in the art that this information can be readily used in the construction of novel recombinant chimpanzee adenoviral constructs.
The C68 virus stock was obtained from ATCC (Rockville, MD) and propagated in 293 cells (ATCC) cultured in DMEM (Sigma, St. Louis, MO) supplemented with 10%
fetal calf serum (FCS; Sigma or Hyclone, Logan, UT) and 1% Penicillin-Streptomycin (Sigma).
Infection of 293 cells was carried out in DMEM supplemented with 2% FCS for the first 24 hours, after which FCS was added to bring the final concentration to 10%.
Infected cells were harvested when 100% of the cells exhibited virus-induced cytopathic effect (CPE), collected, and concentrated by centrifugation. Cell pellets were resuspended in 10mM Tris (pH 8.0), and lysed by 3 cycles of freezing and thawing. Virus preparations were obtained following 2 ultra centrifuge steps on cesium chloride density gradients and stocks of virus were diluted to 1 x 1012 particles/ml in 10mM Tris/I OOmM NaC1/50% glycerol and stored at -70 C.
Example 3 - Cloning and sequencing of viral genomic DNA
Genomic DNA was isolated from the purified virus preparation following standard methods and digested with a panel of 16 restriction enzymes following the manufacturer's recommendations. Except as noted, all restriction and modifying enzymes were obtained from Boehringer Mannheim, Indianapolis, IN. Genomic DNA was digested with BamHI, PstI, Sall, HindIII or XbaI and the fragments were subcloned into plasmids (K.
L. Berkner and P.A. Sharp, Nuci. Acids Res., 11:6003-20 (1983)). After deproteination, synthetic 10bp PacI linkers (New England Biolabs, Beverly, MA) were double digested with PacI
and BamHI, or Pstl.
The PstI, BamHI and HindIII clones generated from C68 are illustrated in Figure 1, parts C, D and E, respectively. The fragments indicated by the shaded boxes were not cloned, but the sequence of the entire genome has been determined through sequencing overlapping clones and viral DNA directly (unshaded boxes). The cloned fragments and insert sizes are described in Table 1. In the following table, pBS = pBluescript SK+ clone;
pNEB = pNEB
193 clone; pBR = pBR322 clone; No prefix = fragment not cloned Table 1. C68 plasmid clones and insert sizes Construct Name Insert Size Fragment Fragment 5' End 3' End (base 5' End 3' End Map Unit Map Unit pairs) Pst-I Fragments C68-Pst-A 6768 24784 31551 67.9% 86.4%
pBS:C68-Pst-B 6713 4838 11550 13.2% 31.6%
pBS:C68-Pst-C 5228 14811 20038 40.6% 54.9%
pBS:C68-Pst-D 2739 12072 14810 33.1% 40.6%
pBS:C68-Pst-E 2647 20039 22685 54.9% 32.1%
pBS:C68-Pst-F 1951 32046 33996 87.8% 93.1%
pNEB:C68-Pst-G 1874 1 1874 0.0% 5.1%
pBS:C68-Pst-H 1690 23094 24783 63.2% 67.9%
pBS:C68-Pst-I 1343 33997 35339 93.1% 96.8%
pNEB:C68-Pst-J 1180 35340 36519 96.8% 100.0%
pBS:C68-Pst-K 1111 2763 3873 7.6% 10.6%
pBS:C68-Pst-L 964 3874 4837 10.6% 13.2%
pBS:C68-Pst-M 888 1875 2762 5.1% 7.6%
pBS:C68-Pst-N 408 22686 23093 62.1% 63.2%
C68-Pst-O 380 31666 32045 86.7% 87.7%
pBS:C68-Pst-P 285 11551 11835 31.6% 32.4%
C68-Pst-Q 236 11836 12071 32.4% 33.1%
pBS:C68-Pst-R 114 31552 31665 86.4% 86.7%
BamHI Fragments C68-Bam-A 16684 19836 36519 54.3% 100.0%
pBS:C68-Bam-B 8858 3582 12439 9.8% 34.1%
pBS:C68-Bam-C 4410 12440 16849 34.1% 46.1%
pBS:C68-Bam-D 2986 16850 19835 46.1% 54.3%
pNEB:C68-Bam-E 2041 1 2041 0.0% 5.6%
pBS:C68-Bam-F 1540 2042 3581 5.6% 9.8%
Hindlll Fragments pBR:C68-Hind-B 9150 23471 32620 64.3% 89.3%
Chimpanzee adenovirus, C68, was obtained from ATCC and propagated in human 293 cells. Viral genomic DNA was isolated from purified virions using established procedures (A. R. Davis, et al., Gene Thera., 5:1148-1152 (1998)) and digested with a panel of restriction enzymes; the data were consistent with previous studies (data not shown) (G. R.
Kitchingman, Gene, 20:205-210 (1982); Q. Li and G. Wadell, Arch Virol. 101:65-77 (1998);
R. Wigand, et al., Intervirology. 30:1-9 (1989)). Restriction fragments spanning the entire genome of C68 were subcloned into plasmids. A schematic drawing of the C68 genome is shown in Figure IA, and the Pst-I, BamHI and HindIll fragments that were cloned into plasmid vectors are indicated by the unshaded boxes, in Figs. 1 B, 1 C, and 1 D, respectively.
The cloned fragments, fragment sizes and genomic position are also listed in Table 1. Both plasmid clones and genomic DNA were used as templates for sequencing. The genome was sequenced by primer walking in both directions and each base was included in an average of approximately four reactions.
The C68 genome is 36521 bp in length [see, US Patent 6,083,716]. Preliminary comparison with GenBank sequences indicated varying degrees of similarity with other human and animal adenoviruses along the entire length of the viral genome.
Regions with homology to all of the previously described adenoviral genetic units, early regions 1-4 and the major late genes, were found in the C68 genome (Fig. IA). DNA homology between and the human adenoviruses that have been completely sequenced, Ad2 (NC001405), Ad5 (N0001405), Ad12 (N0001460), Ad17 (N0002067) and Ad40 (NCO1464), was used to order the clones. The open reading frames (ORF) were determined and the genes were identified based on homology to other human adenoviruses. All of the major adenoviral early and late genes are present in C68. The inverted terminal repeats (ITR=s) are 130 bp in length.
Example 4 - Analysis of C68 sequence The complete nucleotide sequence of every member of the Mastadenovirus genus accessible from GenBank, including isolates from different species, were screened for identity to C68. The Ad4 minigenome was assembled from the following GenBank sequences: Left-hand ITR (JO 1964); E 1 A region (M 14918); DNA pol and pTP
(X74508, 74672); VA RNA-I, II (U10682); 52, 55K (U52535); pVII (U70921); hexon (X84646);
endoprotease (M16692); DNA-binding protein (M12407); fiber (X76547); right-hand ITR
(JO1965). The Adz composite genome was created from the following sequence data: Mu 3-21 (X03000); VA RNA-1, II, pTP & 52, 55K (U52574); penton (AD001675); pVI, hexon and endoprotease (AF065065); DNA-binding protein (K02530); E3 and fiber region (AF104384);
right-hand ITR (V00037).
The amino acid sequence alignment was generated with Clustal X, edited with Jalview and analyzed with Boxshade. Publicly available hexon protein sequences from all human adenovirus serotypes were initially aligned to identify the set showing the highest homology to C68.
The nucleotide sequence and predicted amino acid sequences of all significant open reading frames in the C68 genome were compared to known DNA and protein sequences.
The nucleotide sequence of C68 was compared to sequences of Ad 2, 4, 5, 7, 12, 17 and 40.
In agreement with previous restriction analysis (Kitchingman, cited above) C68 is most similar to human Ad4 (subgroup E).
The EIA region of C68 extends from the TATA box at nt 480 to the poly A
addition site at 1521. The consensus splice donor and acceptor sites are in the analogous position of the human Ad counterparts, and the 28.2K and 24.8K proteins are similar in size to the human Ad proteins. The ORF for the smallest EIA protein of C68 is predicted to encode 101 residues as opposed to approximately 60 amino acids for other adenoviruses.
There is a TTA
codon at residue 60 for C68 where other adenoviruses often have a TGA stop codon. The first 60 residues of C68 ElA I00R protein have 85% identity to the Ad4 homolog.
The C68 genome encodes genes for the four E I B proteins, 20.5K, 54.7K, 10.1 K
and 18.5K as well as pIX. All five C68 encoded proteins are similar in size to that of other Ad E I B and pIX proteins. The Ad4 homolog of the E 1 B 21 K protein has only 142 amino acids, where C68 has 186 residues and other human adenoviruses have 163-178 residues.
The C68 and Ad4 proteins share 95% identity over the first 134 aa, then the similarity ends and the Ad4 protein terminates at 142 amino acids.
The C68 genome encodes homologs of the E2A 55K DNA binding protein and the Iva2 maturation protein, as well as the E2B terminal protein and the DNA
polymerase. All of the E2 region proteins are similar in size to their human Ad counterparts, and the E2B
proteins are particularly well conserved. The C68 E2B 123.6K DNA polymerase is predicted to be 1124 residues, while Ad4 is predicted to have 1193 although the other human adenoviruses have smaller polymerases. Residues 1-71 of the Ad4 polymerase have no similarity to any other Ad polymerase, and it is possible that this protein actually initiates at an internal ATG codon. From amino acids 72-1193, Ad4 and C68 polymerases have 96%
amino acid identity.
The E3 regions of human adenoviruses sequenced so far exhibit considerable sequence and coding capacity variability. Ad40 has five E3 region genes, Ad12 has six, C68 and Ad5 have seven, Ad38 has eight and Ad3 as well as Adz (subgroup B human adenoviruses) have nine putative E3 region genes. The Ad4 E3 region has not yet been sequenced. In comparison with the E3 region of Ad35, all 7 E3 gene homologs were identified in the C68 genome (C. F. Basler and M.S. Horwitz, Virology, 215:
(1996)).
The C68 E4 region has 6 ORFs and each is homologous to proteins in the human Ad5, 12 and 40 E4 region. The E4 nomenclature is confusing because the ORF2 homologs of C68, Ad12 and Ad40 are approximately 130 residues, while in Ad5 there are two ORFs encoding proteins of 64 and 67 residues with homology, respectively, to the amino and carboxy terminal ends of the larger ORF2 proteins. ORF5 has been omitted in our nomenclature because the 5th ORF in the E4 region is homologous to the widely studied ORF6 protein of human Ad5.
The major late promoter and the tri-partite leader sequences of the C68 genome were located. ORFs with the potential to encode the 15 major late proteins were located. All of the C68 late proteins are similar in size to their human Ad counterparts. The percent amino acid identity between chimpanzee and human Ad late proteins varies considerably. The C68 fiber protein is predicted to have 90% amino acid identity with the Ad4 protein, but much less similarity to the other human Ad fiber proteins. The CAR binding site in the fiber knob is present in C68.
Example 5 - Virus neutralizing antibody assays Several studies were performed to determine if there is cross-reactivity between type specific antisera of C68 and human adenovirus. The neutralizing activity of sera was tested as follows. Panels of sera from normal human subjects (N=50), rhesus monkeys (N=52) and chimpanzees (N=20) were evaluated for neutralizing antibodies against Ad5 and C68 based vectors using 293 cells as an indicator cell line. Sera collected from individual humans, rhesus monkeys, or chimpanzees were inactivated at 56 C for 30 minutes. A
serial dilution of each sample (1:10, 1:20, 1:40, 1:80, 1:160, 1:320 in 100p1 of DMEM
containing 10%
FCS) was added to equal amounts of H5.010CMVEGFP (1000 PFU/well) or C68CMVEGFP
virus and incubated at 4 C for two hrs. One hundred and fifty microliters of the mixture were transferred onto 2 x 10 293 cells in 96 well flat bottom plates. Control wells were infected with equal amounts of virus (without addition of serum). Samples were incubated at 37 C in 5% CO2 for 48 hrs and examined under a fluorescent microscope. Sample dilutions that showed >50% reduction of green-fluorescent foci as compared to infected controls were scored positive for neutralizing antibodies.
As expected, approximately 35% of normal human subjects demonstrated neutralizing antibody against Ad5, a frequency much higher than observed in sera of rhesus monkeys and chimpanzee. Neutralizing antibody to C68 was observed in 80% of chimpanzee and only 2%
of normal human subjects or rhesus monkeys. Titers of neutralizing antibodies in the non-target species were generally low.
To further evaluate cross-reactivity of C68 with human adenovirus vectors, mice were immunized with 2 x 107 plaque forming units (pfu) of Ad 2, 4, 5, 7 and 12 as well as C68.
Sera were harvested 2 weeks later and tested for antibodies that neutralized either Ad5 or C68 vectors. Neutralizing antibody to Ad5 vector was only detected in animals immunized with Ad5. Importantly, the only animals with neutralizing antibody to C68 vector were those immunized with C68 vector; none of the human serotypes tested, including Ad4, generated antibodies in mice that neutralized C68 in vitro.
Important to the utility of C68 vector in human trials is the absence of neutralizing antibody in the human population. In our study, a screen of 50 normal human subjects failed to detect any significant neutralizing antibodies (>1:10) using the same assay that showed neutralizing antibodies in >50% of chimpanzees. Furthermore, sera of mice immunized with multiple human Ad serotypes including Ad4, did not neutralize infection with C68.
Example 6 - Structural analysis of hexon proteins The absence of neutralizing antibodies between C68 and human serotypes compelled us to more carefully evaluate structural differences in the regions of hexon presumed to harbor type specific epitopes. Previous studies have suggested that these epitopes are located within the 7 hypervariable regions of hexon determined by Crawford-Miksza and Schnurr (J
Virol, 70:1836-1844 (1996)). A comparison of the amino acid sequences of hexon proteins between C68 and several human adenoviruses is shown in Figure 3. Indeed, C68 is substantially dissimilar in significant regions of these hypervariable sequences.
Example 7 - Construction of C68-Derived Capsid Containing a Human Fiber Gene To generate a C68-derived vector with an altered tropism, the chimeric fiber gene construct containing the Ad5 fiber knob fused to the C68 tail and shaft is incorporated into a plasmid carrying the C68 genome. For the precise replacement of the wild-type C68 fiber gene, a plasmid carrying the green fluorescent protein driven by a CMV
promoter is used for modification of C68 fiber. The resulting transfer vector contains a CMV
promoter driven green fluorescent protein (GFP) expression cassette inserted into the E3 region, the chimeric C68/Ad5 fiber gene, and E4. This transfer vector was used for incorporation of GFP cassette and modified fiber gene into the backbone of an E3 deleted C68 infectious plasmid via homologous recombination in E. coli. The viral genome was released from the plasmid by PacI digestion and used to transfect 293 cells. The chimeric C68-derived virus is produced about 3 weeks following transfection using techniques described herein.
Similar techniques can be readily utilized to construct other C68-derived capsids.
While the invention has been described with reference to a particularly preferred embodiment, it will be appreciated that modifications can be made without departing from the spirit of the invention. Such modifications are intended to fall within the scope of the appended claims.
SEQUENCE LISTING
<110> The Trustees of the University of Pennsylvania <120> Method for Rapid Screening of Bacterial Transformants and Novel Simian Adenovirus Proteins <130> 08899274CA
<140>
<141> 2002-06-20 <150> US 60/300,501 <151> 2001-06-22 <150> US 60/385,632 <151> 2002-06-04 <160> 41 <170> Patentln version 3.1 <210> 1 <211> 101 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 1 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Pro Ser Leu Thr Gln Met Met Arg Pro Pro Leu Gln Ser Pro Leu Arg His Pro Gln Lys Leu Ala His Leu His Leu Arg Ile Leu Leu Asp Gln Phe Leu Leu Glu Pro Leu Gly Gly Glu Gln Leu Trp Asn Val Trp Met Thr Cys Tyr Arg Val Gly Leu Asn Leu Trp Thr Cys Val Pro Giy Asn Ala Pro Gly Thr Lys Cys His Thr Cys Val Phe Thr <210> 2 <211> 257 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 2 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His <210> 3 <211> 226 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 3 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Giu Asp Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Pro Val Ser Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His <210> 4 <211> 186 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 4 Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gin Asp Tyr Ser Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg Arg Gln Gln Gln Gln Gln Glu Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Ala Glu Glu Glu Glu <210> 5 <211> 498 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 5 Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His Asp Glu Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr Asp Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gin Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu Val Glu Ile Cys Leu Gln Glu Arg Val Ala Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr Cys Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Met Leu Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val Leu Cys Lys Giy Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Ala Arg Lys Pro Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His Leu Val Leu Pro Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp <210> 6 <211> 169 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 6 Met Glu Ser Arg Asn Pro Phe Gin Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His Leu Val Leu Pro Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp <210> 7 <211> 93 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 7 Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Pro Cys Val Trp Met <210> 8 <211> 142 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 8 Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr Leu Ser Ser Ser Ser Leu Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys <210> 9 <211> 448 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 9 Met Glu Thr Lys Gly Arg Arg Ser Gly Ala Val Phe Asp Gln Pro Asp Glu Pro Glu Ala His Pro Arg Lys Arg Pro Ala Arg Arg Ala Pro Leu His Arg Asp Gly Asp His Pro Asp Ala Asp Ala Ala Thr Leu Glu Gly Pro Asp Pro Gly Cys Ala Gly Arg Pro Ser Ser Gly Ala Ile Leu Pro Gln Pro Ser Gln Pro Ala Lys Arg Gly Gly Leu Leu Asp Arg Asp Ala Val Glu His Ile Thr Glu Leu Trp Asp Arg Leu Glu Leu Leu Gln Gln Thr Leu Ser Lys Met Pro Met Ala Asp Gly Leu Lys Pro Leu Lys Asn Phe Ala Ser Leu Gln Glu Leu Leu Ser Leu Gly Gly Glu Arg Leu Leu Ala Glu Leu Val Arg Glu Asn Met His Val Arg Glu Met Met Asn Glu Val Ala Pro Leu Leu Arg Glu Asp Gly Ser Cys Leu Ser Leu Asn Tyr His Leu Gln Pro Val Ile Gly Val Ile Tyr Gly Pro Thr Gly Cys Gly Lys Ser Gln Leu Leu Arg Asn Leu Leu Ser Ala Gln Leu Ile Ser Pro Ala Pro Glu Thr Val Phe Phe Ile Ala Pro Gln Val Asp Met Ile Pro Pro Ser Glu Leu Lys Ala Trp Glu Met Gln Ile Cys Glu Gly Asn Tyr Ala Pro Gly Ile Glu Gly Thr Phe Val Pro Gln Ser Gly Thr Leu Arg Pro Lys Phe Ile Lys Met Ala Tyr Asp Asp Leu Thr Gln Asp His Asn Tyr Asp Val Ser Asp Pro Arg Asn Val Phe Ala Gln Ala Ala Ala His Gly Pro Ile Ala Ile Ile Met Asp Glu Cys Met Glu Asn Leu Gly Gly His Lys Gly Val Ala Lys Phe Phe His Ala Phe Pro Ser Lys Leu His Asp Lys Phe Pro Lys Cys Thr Gly Tyr Thr Val Leu Val Val Leu His Asn Met Asn Pro Arg Arg Asp Leu Gly Gly Asn Ile Ala Asn Leu Lys Ile Gln Ala Lys Met His Leu Ile Ser Pro Arg Met His Pro Ser Gln Leu Asn Arg Phe Val Asn Thr Tyr Thr Lys Gly Leu Pro Val Ala Ile Ser Leu Leu Leu Lys Asp Ile Val Gln His His Ala Leu Arg Pro Cys Tyr Asp Trp Val Ile Tyr Asn Thr Thr Pro Glu His Glu Ala Leu Gln Trp Ser Tyr Leu His Pro Arg Asp Gly Leu Met Pro Met Tyr Leu Asn Ile Gln Ala His Leu Tyr Arg Val Leu Glu Lys Ile His Arg Val Leu Asn Asp Arg Asp Arg Trp Ser Arg Ala Tyr Arg Ala Arg Lys Ile Lys <210> 10 <211> 200 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (137)..(137) <223> xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (155)..(155) <223> xaa can be any amino acid <400> 10 Met Arg Ala Asp Gly Glu Glu Leu Asp Leu Leu Pro Pro Ile Gly Gly Met Ala Val Asp Val Met Glu Val Glu Met Pro Thr Ala Arg Arg Thr Leu Val Leu Val Phe Ile Gln Ala Ala Thr Val Leu Ala Thr Leu His Gly Met His Val Leu His Glu Leu Tyr Leu Ser Ser Phe Asp Glu Glu Phe Gln Trp Glu Val Glu Ser Trp Arg Leu His Leu Val Leu Tyr Tyr Val Val Val Val Gly Leu Ala Leu Phe Cys Leu Asp Gly Gly His Ala Asp Glu Pro Ala Arg Glu Ala Gly Pro Asp Leu Gly Ala Ser Gly Ser Glu Ser Glu Asp Glu Gly Ala Gln Ala Gly Ala Val Gln Gly Pro Glu Thr Leu Arg Ser Gin Val Ser Gly Xaa Arg Arg Arg Ala Val Asp Leu Gln Glu Phe Phe Gln Gly Ala Arg Glu Val Xaa Met Val Leu Asp Leu His Arg Ala Ile Gly Gly Glu Leu His Gly Leu Gln Gly Pro Val Pro Leu Gly Cys Asp His Arg Pro Pro Phe Leu Leu Gly Arg Leu Gly Arg Arg Gly Arg Cys Leu Phe His Gly <210> 11 <211> 391 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 11 Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gin Ser Tyr Asp His Gln Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp Giu Glu Gly Glu Tyr Leu Glu Asp <210> 12 <211> 534 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 12 Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Thr Val Thr Glu Asp Tyr Asp Gly Ser Gln Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ala Ala Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp Lys Ile Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe <210> 13 <211> 201 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (70)_.(70) <223> Xaa can be any amino acid <400> 13 Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val Val Ala Asp Ala Arg Xaa Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ser Leu Val Ala Gln Gly Gln Ala His Gly Thr Gln Gly His Val Gln Gly Gly Gln Thr Arg Gly Phe Arg Arg Gln Arg Arg Gln Asp Pro Glu Thr Arg Gly His Gly Gly Gly Ser Gly His Arg Gln His Val Pro Pro Ala Ala Arg Glu Arg Val Leu Gly Ala Arg Arg Arg His Arg Cys Ala Arg Ala Arg Ala His Pro Pro Pro Ser His Leu Lys Met Phe Thr Ser Arg Cys <210> 14 <211> 356 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (111)..(111) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (183)..(183) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (212)..(212) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (220)..(220) <223> Xaa can be any amino acid <400> 14 Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Ser Asp Val Asp Gly Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Xaa Arg Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Thr Ala Ala Pro Arg Arg Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val Pro Lys Arg Gln Lys Xaa Glu Asp Val Leu Glu Thr Met Lys Val Asp Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly Xaa Gly Val Gln Thr Val Asp Ile Xaa Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met Pro Ser Ala Pro Ser Arg Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly His Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Ser Pro Ser Pro His Arg Arg Cys Asn His Pro Cys Arg Pro Gly Ala Glu Ser Val Pro Pro Arg Pro Arg Thr Ser Asp Pro Ala Ala Arg Ala Leu Pro Pro Glu His Arg His Leu Asn Phe Arg Gln Leu Cys Arg Ser Met Ala Leu Thr <210> 15 <211> 257 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (202)..(202) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (210)..(210) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (256)..(256) <223> Xaa can be any amino acid <400> 15 Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys Pro Glu Lys Pro Ala Thr Leu Asp Leu Xaa Pro Pro Gln Pro Ser Arg Pro Xaa Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Xaa Tyr <210> 16 <211> 933 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (826)..(826) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (922)..(922) <223> Xaa can be any amino acid <400> 16 Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Asn Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr Asp Asp Gln Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Thr Asp Gln Thr Thr Trp Thr Lys Asp Asp Ser Val Asn Asp Ala Asn Glu Ile Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ser Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Xaa Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Xaa Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr <210> 17 <211> 513 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (511)..(511) <223> Xaa can be any amino acid <400> 17 Met Ala Gly Arg Gly Gly Ser Gln Ser Glu Arg Arg Arg Glu Arg Thr Pro Glu Arg Gly Arg Gly Ser Ala Ser His Pro Pro Ser Arg Gly Gly Glu Ser Pro Ser Pro Pro Pro Leu Pro Pro Lys Arg His Thr Tyr Arg Arg Val Ala Ser Asp Gln Glu Glu Glu Glu Ile Val Val Val Ser Glu Asn Ser Arg Ser Pro Ser Pro Ser Pro Thr Ser Pro Pro Pro Leu Pro Pro Lys Lys Lys Pro Arg Lys Thr Lys His Val Val Leu Gln Asp Val Ser Gln Asp Ser Glu Asp Glu Arg Gln Ala Glu Glu Glu Leu Ala Ala Val Gly Phe Ser Tyr Pro Pro Val Arg Ile Thr Glu Lys Asp Gly Lys Arg Ser Phe Glu Thr Leu Asp Glu Ser Asp Pro Leu Ala Ala Ala Ala Ser Ala Lys Met Met Val Lys Asn Pro Met Ser Leu Pro Ile Val Ser Ala Trp Glu Lys Gly Met Glu Ile Met Thr Met Leu Met Asp Arg Tyr Arg Val Glu Thr Asp Leu Lys Ala Asn Phe Gln Leu Met Pro Glu Gln Gly Glu Val Tyr Arg Arg Ile Cys His Leu Tyr Ile Asn Glu Glu His Arg Gly Ile Pro Leu Thr Phe Thr Ser Asn Lys Thr Leu Thr Thr Met Met Gly Arg Phe Leu Gln Gly Phe Val His Ala His Ser Gln Ile Ala His Lys Asn Trp Glu Cys Thr Gly Cys Ala Leu Trp Leu His Gly Cys Thr Glu Ala Glu Gly Lys Leu Arg Cys Leu His Gly Thr Thr Met Ile Gln Lys Glu His Met Ile Glu Met Asp Val Ala Ser Glu Asn Gly Gln Arg Ala Leu Lys Glu Asn Pro Asp Arg Ala Lys Ile Thr Gln Asn Arg Trp Gly Arg Ser Val Val Gln Leu Ala Asn Asn Asp Ala Arg Cys Cys Val His Asp Ala Gly Cys Ala Thr Asn Gln Phe Ser Ser Lys Ser Cys Gly Val Phe Phe Thr Glu Gly Ala Lys Ala Gin Gln Ala Phe Arg Gln Leu Glu Ala Phe Met Lys Ala Met Tyr Pro Gly Met Asn Ala Asp Gln Ala Gin Met Met Leu Ile Pro Leu His Cys Asp Cys Asn His Lys Pro Gly Cys Val Pro Thr Met Gly Arg Gln Thr Cys Lys Met Thr Pro Phe Gly Met Ala Asn Ala Glu Asp Leu Asp Val Glu Ser Ile Thr Asp Ala Thr Val Leu Ala Ser Val Lys His Pro Ala Leu Met Val Phe Gln Cys Cys Asn Pro Val Tyr Arg Asn Ser Arg Ala Gln Asn Ala Gly Pro Asn Cys Asp Phe Lys Ile Ser Ala Pro Asp Leu Leu Gly Ala Leu Gln Leu Thr Arg Lys Leu Trp Thr Asp Ser Phe Pro Asp Thr Pro Leu Pro Lys Leu Leu Ile Pro Glu Phe Lys Trp Leu Ala Lys Tyr Gln Phe Arg Asn Val Ser Leu Pro Ala Gly His Ala Glu Thr Arg Lys Asn Pro Xaa Asp Phe <210> 18 <211> 222 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 18 Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gin Ala Glu Glu Glu Glu Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr Thr Ser Lys Lys Arg Gln Gln Gln Gln Lys Lys Thr Ser Arg Lys Pro Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Gly Gly Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu <210> 19 <211> 227 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 19 Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Ser Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp <210> 20 <211> 106 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 20 Met Ser His Gly Giy Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Leu Leu Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser <210> 21 <211> 146 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (62)..(62) <223> Xaa can be any amino acid <400> 21 Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Thr Ala Thr Thr Pro Asp Phe Arg Val Ser Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn Thr Thr Ala Ser Leu Pro Xaa Thr Asn Asn Gin Thr Asn Leu His Gln Arg His Arg Arg Asp Leu Ser Glu Ser Asn Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Gln Pro Thr Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Thr Leu Gly Leu Val Ala Gly Gly Leu Leu Val Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro <210> 22 <211> 176 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (28)_.(28) <223> Xaa can be any amino acid <400> 22 Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Leu Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Xaa Lys Glu Lys Ala Asp Pro Cys Leu His Phe Asn Pro Asn Lys Cys Gln Leu Ser Phe Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro <210> 23 <211> 204 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 23 Met Ala Ser Val Lys Phe Leu Leu Leu Phe Ala Ser Leu Ile Ala Val Ile His Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn His Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys Tyr Phe Asn Glu Ser Asp Val Ser Thr Glu Leu Cys Gly Asn Asn Asn Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Val Thr Thr Met Gln Leu Thr Thr Asn Asn Ile Phe Ala Met Arg Gln Met Val Asn Asn Ser Thr Gln Pro Thr Pro Pro Ser Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe <210> 24 <211> 91 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 24 Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Thr Thr Cys Ile Cys Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile the Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp Gln Arg Val Ala Arg Leu Leu Arg Leu Leu <210> 25 <211> 143 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (5)._(5) <223> Xaa can be any amino acid <400> 25 Met Arg Ala Val Xaa Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val Asp Pro Arg Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Leu Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp <210> 26 <211> 135 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 26 Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His Gln Cys Lys Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Ser Tyr Glu Val Thr Pro Asn Asp His Arg Leu Ser Tyr Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn <210> 27 <211> 425 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 27 Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met Asp His Pro Phe Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Arg Thr Ser Ile Leu Asn Thr Leu Ala Leu Gly Phe Gly Ser Gly Leu Gly Leu Arg Gly Ser Ala Leu Ala Val Gln Leu Val Ser Pro Leu Thr Phe Asp Thr Asp Gly Asn Ile Lys Leu Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly Leu Lys Phe Glu Asp Gly Ala Ile Ala Thr Asn Ile Gly Asn Gly Leu Glu Phe Gly Ser Ser Ser Thr Glu Thr Gly Val Asp Asp Ala Tyr Pro Ile Gln Val Lys Leu Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala Ile Met Ala Gly Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Leu Ala Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn Pro Ile Thr Gly Thr Val Ser Ser Ala Gln Val Phe Leu Arg Phe Asp Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly Thr Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln Val Tyr Met Asn Gly Asp Val Ser Lys Pro Met Leu Leu Thr Ile Thr Leu Asn Gly Thr Asp Asp Ser Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 28 <211> 83 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 28 Ile Thr Val Ile Pro Thr Thr Glu Asp Asn Pro Gln Leu Leu Ser Cys Glu Val Gln Met Arg Glu Cys Pro Glu Gly Phe Ile Ser Leu Thr Asp Pro Arg Leu Ala Arg Ser Glu Thr Val Trp Asn Val Glu Thr Lys Ser Met Ser Ile Thr Asn Gly Ile Gin Met Phe Lys Ala Val Arg Gly Glu Arg Val Val Tyr Ser Met Ser Trp Glu Gly Giy Gly Lys Ile Thr Ala Arg Ile Leu <210> 29 <211> 301 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 29 Met Ser Glu Ser Asn Cys Ile Met Thr Arg Ser Arg Thr Arg Ser Ala Ala Ser Arg His His Pro Tyr Arg Pro Ala Pro Leu Pro Arg Cys Glu Glu Thr Glu Thr Arg Ala Ser Leu Val Glu Asp His Pro Val Leu Pro Asp Cys Asp Thr Leu Ser Met His Asn Val Ser Ser Val Arg Gly Leu Pro Cys Ser Ala Gly Phe Ala Val Leu Gln Glu Phe Pro Val Pro Trp Asp Met Val Leu Thr Pro Glu Glu Leu Arg Val Leu Lys Arg Cys Met Ser Ile Cys Leu Cys Cys Ala Asn Ile Asp Leu Phe Ser Ser Gln Met Ile His Gly Tyr Glu Arg Trp Val Leu His Cys His Cys Arg Asp Pro Gly Ser Leu Arg Cys Met Ala Gly Gly Ala Val Leu Ala Leu Trp Phe Arg Arg Ile Ile Arg Gly Cys Met Phe Asn Gln Arg Val Met Trp Tyr Arg Glu Val Val Asn Arg His Met Pro Lys Glu Ile Met Tyr Val Gly Ser Val Phe Trp Arg Gly His His Leu Ile Tyr Leu Arg Ile Trp Tyr Asp Gly His Val Gly Ser Ile Leu Pro Ala Met Ser Phe Gly Trp Ser Val Leu Asn Tyr Gly Leu Leu Asn Asn Leu Val Val Leu Cys Cys Thr Tyr Cys Ser Asp Leu Ser Glu Ile Arg Met Arg Cys Cys Ala Arg Arg Thr Arg Arg Leu Met Leu Arg Ala Val Gly Ile Met Leu Arg Glu Ser Leu Asp Pro Asp Pro Leu Ser Ser Ser Leu Thr Glu Arg Arg Arg Gln Arg Leu Leu Arg Gly Leu Met Arg His His Arg Pro Ile Pro Phe Ala Asp Tyr Asp Ser His Arg Arg Ser Ser Ala Ser Ser Arg <210> 30 <211> 121 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 30 Met Val Leu Pro Val Leu Pro Ser Pro Ala Val Thr Glu Thr Gln Gln Asn Cys Ile Ile Trp Leu Gly Leu Ala His Ser Thr Val Val Asp Val Ile Arg Ala Ile Arg His Asp Gly Ile Phe Ile Thr Pro Glu Ala Leu Asp Leu Leu His Gly Leu Arg Glu Trp Leu Phe Tyr Asn Phe Asn Thr Glu Arg Ser Lys Arg Arg Asp Arg Arg Arg Arg Ser Val Cys Ser Ala Arg Thr Arg Phe Cys Tyr Ser Lys Tyr Glu Asn Val Arg Lys Gln Leu His His Asp Thr Val Ala Asn Thr Ile Ser Arg Val Pro Pro Ser Pro Val Ser Ala Gly Pro Leu Thr Thr Leu <210> 31 <211> 117 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (45)..(45) <223> Xaa can be any amino acid <400> 31 Met Arg Val Cys Leu Arg Met Pro Val Glu Gly Ala Leu Arg Glu Leu Phe Ile Met Ala Gly Leu Asp Leu Pro His Glu Leu Val Arg Ile Ile Gln Gly Trp Lys Asn Glu Asn Tyr Leu Gly Met Val Xaa Glu Cys Asn Met Met Ile Glu Glu Leu Glu Asn Pro Pro Ala Phe Ala Ile Val Leu Phe Leu Asp Val Arg Val Glu Ala Leu Leu Glu Ala Thr Val Glu His Leu Glu Asn Arg Ile Thr Phe Asp Leu Ala Val Ile Phe His Gln His Ser Gly Gly Glu Arg Cys His Leu Arg Asp Leu His Phe Glu Val Leu Arg Asp Arg Leu Asp <210> 32 <211> 129 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 32 Met Leu Glu Arg Thr Ala Cys Ile Tyr Phe Ile Val Val Pro Glu Ala Leu Asn Val His Leu Glu Asp Phe Ser Phe Val Asp Phe Leu Lys Asn Cys Leu Gly Asp Phe Leu Ser Ser Tyr Leu Glu Asp Ile Thr Gly Ser Ser Gln His Ala Tyr Ser Ser Leu Ala Phe Gly Asn Ala His Trp Gly Gly Leu Arg Phe Ile Cys Thr Val Ala Cys Pro Asn Leu Ile Pro Gly Gly Pro Met Ala Lys Asn Phe Gly Glu Asp Met Lys Glu Tyr Leu Gln Leu Leu Leu Arg Glu Glu Leu Arg Asp Arg Gly Arg Asp Phe Asp Ile Pro Leu Val Asn Leu Leu Gln Val Asn Gln Glu Gln Asn Ile Leu Glu Leu <210> 33 <211> 36521 <212> DNA
<213> chimpanzee C68 adenovirus <220>
<221> misc feature <222> (8268)..(8268) <223> can be a or c or g or t <220>
<221> misc feature <222> (8322)..(8322) <223> can be a or c or g or t <220>
<221> misc feature <222> (8535)..(8535) <223> can be a or c or g or t <220>
<221> misc feature <222> (16753)..(16753) <223> can be a or c or g or t <220>
<221> misc feature <222> (28095)..(28095) <223> can be a or c or g or t <220>
<221> misc feature <222> (29373)..(29373) <223> can be a or c or g or t <220>
<221> misc feature <222> (30447)..(30447) <223> can be a or c or g or t <220>
<221> misc feature <222> (31015)..(31015) <223> can be a or c or g or t <400> 33 ccttcttcaa taatatacct tcaaactttt tgtgcgcgtt aatatgcaaa tgaggcgttt 60 gaatttgggg aggaagggcg gtgattggtc gagggatgag cgaccgttag gggcggggcg 120 agtgacgttt tgatgacgtg gttgcgagga ggagccagtt tgcaagttct cgtgggaaaa 180 gtgacgtcaa acgaggtgtg gtttgaacac ggaaatactc aattttcccg cgctctctga 240 caggaaatga ggtgtttctg ggcggatgca agtgaaaacg ggccattttc gcgcgaaaac 300 tgaatgagga agtgaaaatc tgagtaattt cgcgtttatg gcagggagga gtatttgccg 360 agggccgagt agactttgac cgatcacgtg ggggtttcga ttaccgtgtt tttcacctaa 420 atttccgcgt acggtgtcaa agtccggtgt ttttacgtag gtgtcagctg atcgccaggg 480 tatttaaacc tgcgctctcc agtcaagagg ccactcttga gtgccagcga gaagagtttt 540 ctcctccgcg ccgcgagtca gatctacact ttgaaagatg aggcacctga gagacctgcc 600 cgatgagaaa atcatcatcg cttccgggaa cgagattctg gaactggtgg taaatgccat 660 gatgggcgac gaccctccgg agccccccac cccatttgag acaccttcgc tgcacgattt 720 gtatgatctg gaggtggatg tgcccgagga cgatcccaat gaggaggcgg taaatgattt 780 ttttagcgat gccgcgctgc tagctgccga ggaggcttcg agctctagct cagacagcga 840 ctcttcactg cataccccta gacccggcag aggtgagaaa aagatccccg agcttaaagg 900 ggaagagatg gacttgcgct gctatgagga atgcttgccc ccgagcgatg atgaggacga 960 gcagggaatc cagaacgcag cgagccaggg agtgcaagcc gccagcgaga gctttgcgct 1020 ggactgcccg cctctgcccg gacacggctg taagtcttgt gaatttcatc gcatgaatac 1080 tggagataaa gctgtgttgt gtgcactttg ctatatgaga gcttacaacc attgtgttta 1140 cagtaagtgt gattaagttg aactttagag ggaggcagag agcagggtga ctgggcgatg 1200 actggtttat ttatgtatat atgttcttta tataggtccc gtctctgacg cagatgatga 1260 gacccccact acaaagtcca cttcgtcacc cccagaaatt ggcacatctc cacctgagaa 1320 tattgttaga ccagttcctg ttagagccac tgggaggaga gcagctgtgg aatgtttgga 1380 tgacttgcta cagggtgggg ttgaaccttt ggacttgtgt acccggaaac gccccaggca 1440 ctaagtgcca cacatgtgtg tttacttgag gtgatgtcag tatttatagg gtgtggagtg 1500 caataaaaaa tgtgttgact ttaagtgcgt ggtttatgac tcaggggtgg ggactgtgag 1560 tatataagca ggtgcagacc tgtgtggtta gctcagagcg gcatggagat ttggacggtc 1620 ttggaagact ttcacaagac tagacagctg ctagagaacg cctcgaacgg agtctcttac 1680 ctgtggagat tctgcttcgg tggcgaccta gctaggctag tctacagggc caaacaggat 1740 tatagtgaac aatttgaggt tattttgaga gagtgttctg gtctttttga cgctcttaac 1800 ttgggccatc agtctcactt taaccagagg atttcgagag cccttgattt tactactcct 1860 ggcagaacca ctgcagcagt agcctttttt gcttttattc ttgacaaatg gagtcaagaa 1920 acccatttca gcagggatta ccagctggat ttcttagcag tagctttgtg gagaacatgg 1980 aagtgccagc gcctgaatgc aatctccggc tacttgccgg tacagccgct agacactctg 2040 aggatcctga atctccagga gagtcccagg gcacgccaac gtcgccagca gcagcagcag 2100 gaggaggatc aagaagagaa cccgagagcc ggcctggacc ctccggcgga ggaggaggag 2160 tagctgacct gtttcctgaa ctgcgccggg tgctgactag gtcttcgagt ggtcgggaga 2220 gggggattaa gcgggagagg catgatgaga ctaatcacag aactgaactg actgtgggtc 2280 tgatgagtcg caagcgccca gaaacagtgt ggtggcatga ggtgcagtcg actggcacag 2340 atgaggtgtc ggtgatgcat gagaggtttt ctctagaaca agtcaagact tgttggttag 2400 agcctgagga tgattgggag gtagccatca ggaattatgc caagctggct ctgaggccag 2460 acaagaagta caagattact aagctgataa atatcagaaa tgcctgctac atctcaggga 2520 atggggctga agtggagatc tgtctccagg aaagggtggc tttcagatgc tgcatgatga 2580 atatgtaccc gggagtggtg ggcatggatg gggttacctt tatgaacatg aggttcaggg 2640 gagatgggta taatggcacg gtctttatgg ccaataccaa gctgacagtc catggctgct 2700 ccttctttgg gtttaataac acctgcatcg aggcctgggg tcaggtcggt gtgaggggct 2760 gcagtttttc agccaactgg atgggggtcg tgggcaggac caagagtatg ctgtccgtga 2820 agaaatgctt gtttgagagg tgccacctgg gggtgatgag cgagggcgaa gccagaatcc 2880 gccactgcgc ctctaccgag acgggctgct ttgtgctgtg caagggcaat gctaagatca 2940 agcataatat gatctgtgga gcctcggacg agcgcggcta ccagatgctg acctgcgccg 3000 gcgggaacag ccatatgctg gccaccgtac atgtggcttc ccatgctcgc aagccctggc 3060 ccgagttcga gcacaatgtc atgaccaggt gcaatatgca tctggggtcc cgccgaggca 3120 tgttcatgcc ctaccagtgc aacctgaatt atgtgaaggt gctgctggag cccgatgcca 3180 tgtccagagt gagcctgacg ggggtgtttg acatgaatgt ggaggtgtgg aagattctga 3240 gatatgatga atccaagacc aggtgccgag cctgcgagtg cggagggaag catgccaggt 3300 tccagcccgt gtgtgtggat gtgacggagg acctgcgacc cgatcatttg gtgttgccct 3360 gcaccgggac ggagttcggt tccagcgggg aagaatctga ctagagtgag tagtgttctg 3420 gggcggggga ggacctgcat gagggccaga ataactgaaa tctgtgcttt tctgtgtgtt 3480 gcagcagcat gagcggaagc ggctcctttg agggaggggt attcagccct tatctgacgg 3540 ggcgtctccc ctcctgggcg ggagtgcgtc agaatgtgat gggatccacg gtggacggcc 3600 ggcccgtgca gcccgcgaac tcttcaaccc tgacctatgc aaccctgagc tcttcgtcgt 3660 tggacgcagc tgccgccgca gctgctgcat ctgCCgccag ccccgtgcgc ggaatggcca 3720 tgggcgccgg ctactacggc actctggtgg ccaactcgag ttccaccaat aatcccgcca 3780 gcctgaacga ggagaagctg ttgctgctga tggcccagct cgaggccttg acccagcgcc 3840 tgggcgagct gacccagcag gtggctcagc tgcaggagca gacgcgggcc gcggttgcca 3900 cggtgaaatc caaataaaaa atgaatcaat aaataaacgg agacggttgt tgattttaac 3960 acagagtctg aatctttatt tgatttttcg cgcgcggtag gccctggacc accggtctcg 4020 atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt ggatgttgag 4080 gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg cctcgtgctc 4140 gggggtggtg ttgtaaatca cccagtcata gcaggggcgc agggcatggt gttgcacaat 4200 atctttgagg aggagactga tggccacggg cagccctttg gtgtaggtgt ttacaaatct 4260 gttgagctgg gagggatgca tgcgggggga gatgaggtgc atcttggcct ggatcttgag 4320 attggcgatg ttaccgccca gatcccgcct ggggttcatg ttgtgcagga ccaccagcac 4380 ggtgtatccg gtgcacttgg ggaatttatc atgcaacttg gaagggaagg cgtgaaagaa 4440 tttggcgacg cctttgtgcc cgcccaggtt ttccatgcac tcatccatga tgatggcgat 4500 gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat catagttgtg 4560 gtcctgggtg aggtcatcat aggccatttt aatgaatttg gggcggaggg tgccggactg 4620 ggggacaaag gtaccctcga tcccgggggc gtagttcccc tcacagatct gcatctccca 4680 ggctttgagc tcggaggggg ggatcatgtc cacctgcggg gcgataaaga acacggtttc 4740 cggggcgggg gagatgagct gggccgaaag caagttccgg agcagctggg acttgccgca 4800 gccggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga gggagagaca 4860 gctgccgtcc tcccggagga ggggggccac ctcgttcatc atctcgcgca cgtgcatgtt 4920 ctcgcgcacc agttccgcca ggaggcgctc tccccccagg gataggagct cctggagcga 4980 ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga gggtttgttg 5040 caagagttcc aggcggtccc agagctcggt gatgtgctct acggcatctc gatccagcag 5100 acctcctcgt ttcgcgggtt gggacggctg cgggagtatg gcaccagacg atgggcgtcc 5160 agcgcagcca gggtccggtc cttccagggt cgcagcgtcc gcgtcagggt ggtctccgtc 5220 acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag gctcatccgg 5280 ctggtcgaaa accgctcccg atcggcgccc tgcgcgtcgg ccaggtagca attgaccatg 5340 agttcgtagt tgagcgcctc ggccgcgtgg cctttggcgc ggagcttacc tttggaagtc 5400 tgcccgcagg cgggacagag gagggacttg agggcgtaga gcttgggggc gaggaagacg 5460 gactcggggg cgtaggcgtc cgcgccgcag tgggcgcaga cggtctcgca ctccacgagc 5520 caggtgaggt cgggctggtc ggggtcaaaa accagtttcc cgccgttctt tttgatgcgt 5580 ttcttacctt tggtctccat gagctcgtgt ctccgctggg tgacaaagag ctgtccgtgt 5640 ccccgtagac cgactttatg ggccggtcct cgagcggtgt gccgcggtcc tcctcgtaga 5700 ggaaccccgc ccactccgag acgaaagccc gggtccaggc cagcacgaaw gaggccacgt 5760 gggacgggta tcggtcgttg tccaccagcg ggtccacctt ttccagggta tgcaaacaca 5820 tgtccccctc gtccacatcc aggaaggtga ttggcttgta agtgtakgcc acgtgaccgg 5880 gggtcccggc cgggggggta taaaagggtg cgggtccctg ctcgtcctca ctgtcttccg 5940 gatcgctgtc cacgagcgcc agctgttggg gtaggtattc cctctcgaag gcgggcatga 6000 cctcggcact caggttgtca gtttctagaa acgaggagga tttgatattg acggtgccgg 6060 cggagatgcc tttcaagagc ccctcgtcca tctggtcaga aaagacgatc tttttgttgt 6120 cgagcttggt ggcgaaggag ccgtakaggg cgttggagag gagcttggcg atggagcgca 6180 tggtctggtt tttttccttg tcggcgcgct ccttggcggc gatgttgagc tgcacgtact 6240 cgcgcgccac gcacttccat tcggggaaga cgtggtcagc tcgtcgggca cgattctgac 6300 ctgccagccc cgattatgca gggtgatgag gtccacactg gtggccacct cgccgcgcag 6360 gggctcatta ktccagcaga ggcgtccgcc cttgcgcgag cagaaggggg gcagggggtc 6420 cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag atgccgggca ggaggtcggg 6480 gtcaaagtag ctgatggaag tggccagatc gtccagggca gcttgccatt cgcgcacggc 6540 cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc atgggatggg taagcgcgga 6600 ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc tcgaggatgc cgatgtaggt 6660 ggggtagcag cgccccccgc ggatgctggc gcgcacgtag tcatacagct cgtgcgaggg 6720 ggcgaggagc cccgggccca ggttggtgcg actgggcttt tcggcgcggt agacgatctg 6780 gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt tggaagatgt tgaagtgggc 6840 gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag gagtcttgca gcttggcgac 6900 gagctcggcg gtgactagga cgtccagagc gcagtagtcg agggtctcct ggatgatgtc 6960 atacttgagc tgtccctttt gtttccacag ctcgcggttg agaaggaact cttcgcggtc 7020 cttccagtac tcttcgaggg ggaacccgtc ctgatctgca cggtaagagc ctagcatgta 7080 gaactggttg acggccttgt aggcgcagca gccyttctcc acggggargg cgtaggcctg 7140 ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg tccctgacca tgaccttgag 7200 gaactggtgc ttgaagtcga tattgtcgca gcccccccgc tcccagagct ggaagtccgt 7260 gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca tcgttgaaga ggatcttgcc 7320 cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg ggcacctcgg cccggttgtt 7380 gatgacctgg gcggcgagca cgatctcgtc gaagccgttg atgttgtggc ccacgatgta 7440 gagttccacg aatcgcggac ggcccttgac gtggggcagt ttcttgagct cctcgtaggt 7500 gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc cagtcggcga gatgggggtt 7560 ggcgcggagg aaggaagtcc agagatccac ggccagggcg gtttgcagac ggtcccggta 7620 ctgacggaac tgctgcccga cggccatttt ttcgggggtg acgcagtaga aggtgcgggg 7680 gtccacgtgc cagcgatccc atttgagctg gagggcgaga tcgagggcga gctcgacgag 7740 ccggtcgtcc ccggagagtt tcatgaccag catgaagggg acgagctgct tgccgaagga 7800 ccccatccag gtgtaggttt ccacatcgta ggtgaggaag agcctttcgg tgcgaggatg 7860 cgagccgatg gggaagaact ggatctcctg ccaccaattg gaggaatggc tgttgatgtg 7920 atggaagtag aaatgccgac ggcgcgccga acactcgtgc ttgtgtttat acaagcggcc 7980 acagtgctcg caacgctgca cgggatgcac gtgctgcacg agctgtacct gagttccttt 8040 gacgaggaat ttcagtggga agtggagtcg tggcgcctgc atctcgtgct gtactacgtc 8100 gtggtggtcg gcctggccct cttctgcctc gatggtggtc atgctgacga gcccccgcgg 8160 gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg aagacgaagg cgcgcaggcc 8220 ggagctgtcc agggtcctga gacgctgcgg agtcaggtca gtgggcancg gcggcgcgcg 8280 gttgacttgc argagttttt ccagggcgcg cgggaggtcc anatggtact tgatctccac 8340 cgcgccattg gtggcgaact ccatggcttg cagggtcccg tgcccctggg gtgtgaccac 8400 cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc ggtgcctctt ccatggttag 8460 aascggcggc gaagacgcgc gccgggcggc aggggcggct cggggcccgg atgcaggggc 8520 ggcaggggca cttcngcgcc gcgcgcgggt aggttctggt actgcgcccg gagaaaactg 8580 gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 ggacccgtga gtttgaacct gaaagaaagt tcgacagaat caatctcggt atcgttgacg 8700 gcggcctgcc gcaagatctc ttgcacgtcc cccgagttgt cctggtatgc gatctcggtc 8760 atgaactgct cgatctcctc ctcttgaagg tctccgcggc cggcgcgctc cacggtggcc 8820 gcgaagtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc cgcctcgttc 8880 cagacgcggc tgtagaccac gacgccctcg ggatcgcggg cgcgcatgac cacctgggcg 8940 aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 ttgagcgtgg tggcgatgtg ctcggtgacr aagaaataca tgatccagcg gcggagcggc 9060 atctcgctga cgtcgcccag cgcctccaaa cgttccatgg cctcgtaaaa gtccacggcg 9120 aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc ccgggagttc ctccacttcc 9240 tcttcttcct cctccactaa catctcttct acttcctcct caggcggcag tggtggcggg 9300 ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt cgatgaagcg ctcgatggtc 9360 tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc cgtcctcgcg gggccgcakc 9420 gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt cccccgttgg gcagggagag 9480 ggcgctgacg atgcatctta tcaattgccc cgtagggact ccgcgcaagg acctgagcgt 9540 ctcgagatcc acgggatctg aaaaccgctg aacgaaggct tcgaagccag tcgcagtcgc 9600 aaggtakgct gagcacggtt tcttctggcg ggtcatgttg gttgggagcg gggcgggcga 9660 tgctgctggt gatgaagttg aaataggcgg ttctgagacg gcggatggtg gcgargagca 9720 ccaggtcttt gggcccggct tgctggatgc gcagacggtc ggccatgccc caggcgtggt 9780 cctgacacct ggccaggtcc ttgtagtagt cctgcatgag ccgctccaac gggcacctcc 9840 tcctcgcccg cgcggccgtg catgcgcgtg agcccgaagc cgcgctgggg ctggacgagc 9900 gccaggtcgg cgacgacgcg ctcggcgagg atggcttgct ggatctgggt gagggtggtc 9960 tggaagtcat caaagtcgac gaagcggtgg taggctccgg tgttgatggt ggaggagcag 10020 ttggccatga cggaccagtt gacggtctgg tggcccggac gcacgagctc gtggtacttg 10080 aggcgcgagt aggcgcgcgt gtcgaagatg tagtcgttgc aggtgcgcac caggtactgg 10140 tagccgatga ggaagtgcgg cggcggctgg cggtagagcg gccatcgctc ggtggcgggg 10200 gcgccgggcg cgaggtcctc gagcatggtg cggtggtagc cgtagatgta cctggacatc 10260 caggtgatgc cggcggcggt ggtggaggcg cgcgggaact cgcggacgcg ttccagatgt 10320 tgcgcagcgg caggaagtag ttcatggtgg gcacggtctg gcccgtgagg cgcgcgcagt 10380 cgtggatgct ctatacgggc aaaaacgaaa gcggtcagcg gctcgactcc gtggcctgga 10440 ggctaagcga acgggttggg ctgcgcgtgt accccggttc gaatctcgaa tcaggctgga 10500 gccgcagcta acgtggtatt ggcactcccg tctmgaccca agcbtgcacc aaccctccag 10560 gatacggagg cgggtcgttt tgcaactttt ttttggaggc cggatgagac tagtaagcgc 10620 ggaaagcggc cgaccgcgat ggctcgtctg ccgtagtctg gagaagaatc gccagggttg 10680 cgttgcggtg tgccccggtt cgaggccggc cggattccgc ggctaacgag ggcgtggctg 10740 ccccgtcgtt tccaagaccc catagccagc cgacttctcc agttacggag cgaggtcctc 10800 ttttgttttg tttgtttttg ccagatgcat cccgtactgc ggcagatgcg cccccaccac 10860 cctccaccgc aacaacagcc ccctccacag ccggcgcttc tgcccccgcc ccagcagcaa 10920 cttccagcca cgaccgccgc ggccgccgtg agcggggctg gacagagtta tgatcaccag 10980 ctggccttgg aagagggcga ggggctggcg cgcctggggg cgtcgtcgcc ggagcggcac 11040 ccgcgcgtgc agatgaaaag ggacgctcgc gaggcatacg tgcccaagca gaacctgttc 11100 agagacagga gcggcgagga gcccgaggag atgcgcgcgg cccggttcca cgcggggcgg 11160 gagctgcggc gcggcctgga ccgaaagagg gtgctgaggg acgaggattt cgaggcggac 11220 gagctgacgg ggatcagccc cgcgcgcgcg cacgtggccg cggccaacct ggtcacggcg 11280 tacgagcaga ccgtgaagga ggagagcaac ttccaaaaat ccttcaacaa ccacgtgcgc 11340 accctgatcg cgcgcgagga ggtgaccctg ggcctgatgc acctgtggga cctgctggag 11400 gccatcgtgc agaaccccac cagcaagccg ctgacggcgc agctgttcct ggtggtgcag 11460 catagtcggg acaacgaagc gttcagggag gcgctgctga atatcaccga gcccgagggc 11520 cgctggctcc tggacctggt gaacattctg cagagcatcg tggtgcagga gcgcgggctg 11580 ccgctgtccg agaagctggc ggccatcaac ttctcggtgc tgagtttggg caagtactac 11640 gctaggaaga tctacaagac cccgtacgtg cccatagaca aggaggtgaa gatcgacggg 11700 ttttacatgc gcatgaccct gaaagtgctg accctgaggg acgatctggg ggtgtaccgc 11760 aacgacagga tgcaccgtgc ggtgagcgcc agcaggcggc gcgagctgag cgaccaggag 11820 ctgatgcata gtctgcagcg ggccctgacc ggggccggga ccgaggggga gagctacttt 11880 gacatgggcg cggacctgca ctggcagccc agccgccggg ccttggaggc ggcggcagga 11940 ccctacgtag aagaggtgga cgatgaggtg gacgaggagg gcgagtacct ggaagactga 12000 tggcgcgacc gtatttttgc tagatgcaac aacaacagcc acctcctgat cccgcgatgc 12060 gggcggcgct gcagagccag ccgtccggca ttaactcctc ggacgattgg acccaggcca 12120 tgcaacgcat catggcgctg acgacccgca accccgaagc ctttagacag cagccccagg 12180 ccaaccggct ctcggccatc ctggaggccg tggtgccctc gggctccaac cccacgcacg 12240 agaaggtcct ggccatcgtg aacgcgctgg tggagaacaa ggccatccgc ggcgacgagg 12300 ccggcctggt gtacaacgcg ctgctggagc gcgtggcccg ctacaacagc accaacgtgc 12360 agaccaacct ggaccgcatg gtgaccgacg tgcgcgaggc cgtggcccag cgcgagcggt 12420 tccaccgcga gtccaacctg ggatccatgg tggcgctgaa cgccttcctc agcacccagc 12480 ccgccaacgt gccccggggc caggaggact acaccaactt catcagcgcc ctgcgcctga 12540 tggtgaccga ggtgccccag agcgaggtgt accagtccgg gccggactac ttcttccaga 12600 ccagtcgcca gggcttgcag accgtgaacc tgagccaggc tttcaagaac ttgcagggcc 12660 tgtggggcgt gcaggccccg gtcggggacc gcgcgacggt gtcgagcctg ctgacgccga 12720 actcgcgcct gctgctgctg ctggtggccc ccttcacgga cagcggcagc atcaaccgca 12780 actcgtacct gggctacctg attaacctgt accgcgaggc catcggccag gcgcacgtgg 12840 acgagcagac ctaccaggag atcacccacg tgagccgcgc cctgggccag gacgacccgg 12900 gcaacctgga agccaccctg aactttttgc tgaccaaccg gtcgcagaag atcccgcccc 12960 agtacgcgct cagcaccgag gaggagcgca tcctgcgtta cgtgcagcag aagcgtgggc 13020 ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac cgcgcgcaac 13080 atggtagccc atcatgtacg ccagcaaccg cccgttcatc aataaactga tggactactt 13140 gcatcgggcg gccgccatga actctgacta tttcaccaac gccatcctga atccccactg 13200 gctcccgccg ccggggttct acacgggcga gtacgacatg cccgacccca atgacgggtt 13260 cctgtgggac gatgtggaca gcagcgtgtt ctccccccga ccgggtgcta acgagcgccc 13320 cttgtggaag aaggaaggca gcgaccgacg cccgtcctcg gcgctgtccg gccgcgaggg 13380 tgctgccgcg gcgctgtccg aggccgccag tcctttcccg agcttgccct tctcgctgaa 13440 cagtatccgc agcagcgagc tgggcaggat cacgcgcccg cgcttgctgg gcgaagagga 13500 gtacttgaat gactcgctgt tgagacccga gcgggagaag aacttcccca ataacgggat 13560 agaaagcctg gtggacaaga tgagccgctg gaagacgtat gcgcaggagc acagggacga 13620 tccccgggcg tcgcaggggg ccacgagccg gggcagcgcc gcccgtaaac gccggtggca 13680 cgacaggcag cggggacaga tgtgggacga tgaggactcc gccgacgaca gcagcgtgtt 13740 ggacttgggt gggagtggta acccgttcgc tcacctgcgc ccccgtatcg ggcgcatgat 13800 gtaagagaaa ccgaaaataa atgatactca ccaaggccat ggcgaccagc gtgcgttcgt 13860 ttcttctctg ttgttgttgt atctagtatg atgaggcgtg cgtacccgga gggtcctcct 13920 ccctcgtacg agagcgtgat gcagcaggcg atggcggcgg cggcgatgca gcccccgctg 13980 gaggctcctt acgtgccccc gcggtacctg gcgcctacgg aggggcggaa cagcattcgt 14040 tactcggagc tggcaccctt gtacgatacc acccggttgt acctggtgga caacaagtcg 14100 gcggacatcg cctcgctgaa ctaccagaac gaccacagca acttcctgac caccgtggtg 14160 cagaacaatg acttcacccc cacggaggcc agcacccaga ccatcaactt tgacgagcgc 14220 tcgcggtggg gcggccagct gaaaaccatc atgcacacca acatgcccaa cgtgaacgag 14280 ttcatgtaca gcaacaagtt caaggcgcgg gtgatggtct cccgcaagac ccccaatggg 14340 gtgacagtga cagaggatta tgatggtagt caggatgagc tgaagtatga atgggtggaa 14400 tttgagctgc ccgaaggcaa cttctcggtg accatgacca tcgacctgat gaacaacgcc 14460 atcatcgaca attacttggc ggtggggcgg cagaacgggg tgctggagag cgacatcggc 14520 gtgaagttcg acactaggaa cttcaggctg ggctgggacc ccgtgaccga gctggtcatg 14580 cccggggtgt acaccaacga ggctttccat cccgatattg tcttgctgcc cggctgcggg 14640 gtggacttca ccgagagccg cctcagcaac ctgctgggca ttcgcaagag gcagcccttc 14700 caggaaggct tccagatcat gtacgaggat ctggaggggg gcaacatccc cgcgctcctg 14760 gatgtcgacg cctatgagaa aagcaaggag gatgcagcag ctgaagcaac tgcagccgta 14820 gctaccgcct ctaccgaggt caggggcgat aattttgcaa gcgccgcagc agtggcagcg 14880 gccgaggcgg ctgaaaccga aagtaagata gtcattcagc cggtggagaa ggatagcaag 14940 aacaggagct acaacgtact accggacaag ataaacaccg cctaccgcag ctggtaccta 15000 gcctacaact atggcgaccc cgagaagggc gtgcgctcct ggacgctgct caccacctcg 15060 gacgtcacct gcggcgtgga gcaagtctac tggtcgctgc ccgacatgat gcaagacccg 15120 gtcaccttcc gctccacgcg tcaagttagc aactacccgg tggtgggcgc cgagctcctg 15180 cccgtctact ccaagagctt cttcaacgag caggccgtct actcgcagca gctgcgcgcc 15240 ttcacctcgc ttacgcacgt cttcaaccgc ttccccgaga accagatcct cgtccgcccg 15300 cccgcgccca ccattaccac cgtcagtgaa aacgttcctg ctctcacaga tcacgggacc 15360 ctgccgctgc gcagcagtat ccggggagtc cagcgcgtga ccgttactga cgccagacgc 15420 cgcacctgcc cctacgtcta caaggccctg ggcatagtcg cgccgcgcgt cctctcgagc 15480 cgcaccttct aaatgtccat tctcatctcg cccagtaata acaccggttg gggcctgcgc 15540 gcgcccagca agatgtacgg aggcgctcgc caacgctcca cgcaacaccc cgtgcgcgtg 15600 cgcgggcact tccgcgctcc ctggggcgcc ctcaaaggcc gcgtgcggtc gcgcaccacc 15660 gtcgacgacg tgatcgacca ggtggtggcs gacgcgcgca amtacacccc cgccgcggcg 15720 cctgtttcca cygtggacgc cgtcatcgac agcgtggtgg cggacgcgcg ccggtacgcc 15780 cgcgccaaga gccggcggcg gcgcatcgcc cggcggcacc ggagcacccc cgccatgcgc. 15840 gcggcgcgga gccttgttgc gcagggccag gcgcacggga cgcagggcca tgttcagggc 15900 ggccagacgc gcggcttcag gcgccagcgc cggcaggacc cggagacgcg cggccacggc 15960 ggcggcagcg gccatcgcca gcatgtcccg cccgcggcga gggaacgtgt actgggtgcg 16020 cgacgccgcc accggtgtgc gcgtgcccgt gcgcacccgc ccccctcgca cttgaagatg 16080 ttcacttcgc gatgttgatg tgtcccagcg gcgaggagga tgtccaagcg caaattcaag 16140 gaagagatgc tccaggtcat cgcgcctgag atatacggcc ctgcggtggt gaaggaggaa 16200 agaaagcccc gcaaaatcaa gcgggtcaaa aaggacaaaa aggaagaaga aagtgatgtg 16260 gacggattgg tggagtttgt gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg 16320 cggaaggtgc aaccggtgct gagacccggc accaccgtgg tcttcacgcc cggcgagcgc 16380 tccggcaccg cttccaagcg ctcctacgac gaggtgtacg gggatgatga tattctggag 16440 caggcggccg akcgcctggg cgagtttgct tacggcaagc gcagccgttc cgcaccgaag 16500 gaagaggcgg tgtccatccc gctggaccac ggcaacccca cgccgagcct caagcccgtg 16560 accttgcagc aggtgctgcc gaccgcggcg ccgcgccggg ggttcaagcg cgagggcgag 16620 gatctgtacc ccaccatgca gctgatggtg cccaagcgcc agaaghtgga agacgtgctg 16680 gagaccatga aggtggaccc ggacgtgcag cccgaggtca aggtgcggcc catcaagcag 16740 gtggccccgg gcntgggcgt gcagaccgtg gacatcwaga ttcccacgga gcccatggaa 16800 acgcagaccg agcccatgat caagcccagc accagcacca tggaggtgca gacggatccc 16860 tggatgccat cggctcctag tcgaagaccc cggcgcaagt acggcgcggc cagcctgctg 16920 atgcccaact acgcgctgca tccttccatc atccccacgc cgggctactg cggcacgcgc 16980 ttctaccgcg gtcataccag cagccgccgc cgcaagacca ccactcgccg ctcgccgtcg 17040 ccgcaccgcc gctgcaacca cccctgccgc cctggtgcgg agagtgtacc gccgcggccg 17100 cgcacctctg accctgccgc gcgcgcgcta ccacccgagc atcgccattt aaactttcgc 17160 cagctttgca gatcaatggc cctcacatga ccgccttcgc gttcccatta cgggctaccg 17220 aggaagaaaa ccgcgccgta gaaggctggc ggggaacggg atgcgtcgcc accaccaccg 17280 gcggcggcgc gccatcagca agcggttggg gggaggcttc ctgCCCgcgc tgatccccat 17340 catcgccgcg gcgatcgggg cgatccccgg cattgcttcc gtggcggtgc aggcctctca 17400 gcgccactga gacacacttg gaaacatctt gtaatagacc ratggactct gacgctcctg 17460 gtcctgtgat gtgttttcgt agacagatgg aagacatcaa tttttcgtcc ctggctccgc 17520 gacacggcac gcggccgttc atgggcacct ggagcgacat cggcaccagc caactgaacg 17580 ggggcgcctt caattggagc agtctctgga gcgggcttaa gaatttcggg tccacgctta 17640 aaacctatgg cagcaaggcg tggaacagca ccacagggca ggcgctgagg gataagctga 17700 aagagcagaa cttccagcag aaggtggtcg atgggctcgc ctcgggcatc aacggggtgg 17760 tggacctggc caaccaggcc gtgcagcggc agatcaacag ccgcctggac ccggtgccgc 17820 ccgccggctc cgtggagatg ccgcaggtgg aggaggagct gcctcccctg gacaagcggg 17880 gcgagaagcg accccgcccc gatgcggagg agacgctgct gacgcacacg gacgagccgc 17940 ccccgtacga ggaggcggtg aaactgggtc tgcccaccac gcggcccatc gcgcccctgg 18000 ccaccggggt gctgaaaccc gaaaagcccg cgaccctgga cttgcytcct ccccagcctt 18060 cccgcccatv tacagtggct aagcccctgc cgccggtggc cgtggcccgc gcgcgacccg 18120 ggggcaccgc ccgccctcat gcgaactggc agagcactct gaacagcatc gtgggtctgg 18180 gagtgcagag tgtgaagcgc cgccgctgmt attaaaccta ccgtagcgct taacttgctt 18240 gtctgtgtgt gtatgtatta tgtcgccgcc gcygctgtcc accagaagga ggagtgaaga 18300 ggggcggtgc cgagttgcra gatggccacc ccatcgatgc tgccccagtg ggcgtacatg 18360 cacatcgccg gacaggacgc ttcggagtac ctgagtccgg gtctggtgaa gtttgcccgc 18420 gccacagaca cctacttcag tctggggaac aagtttagga accccacggt ggcgcccacg 18480 caygatgtga ccaccgaccg cagccagcgg ctgacgctgc gcttcgtgcc cgtggaccgc 18540 gaggacaaca cctacttgta caaagtgcgc tacacgctgg ccgtgggcga caaccgcgtg 18600 ctggacatgg ccagcaccta ctttgacatc cgcggcgtgc tggatcgggg ccctagcttc 18660 aaaccctact ccggcaccgc ctacaacagt ctggccccca agggagcacc caacacttgt 18720 cagtggacat ataaagccga tggtgaaact gccacagaaa aaacctatac atatggaaat 18780 gcacccgtgc agggcattaa catcacaaaa gatggtattc aacttggaac tgacaccgat 18840 gatcagccaa tctacgcaga taaaacctat cagcctgaac ctcaagtggg tgatgctgaa 18900 tggcatgaca tcactggtac tgatgaaaag tatggaggca gagctcttaa gcctgatacc 18960 aaaatgaagc cttgttatgg ttcttttgcc aagcctacta ataaagaagg aggtcaggca 19020 aatgtgaaaa caggaacagg cactactaaa gaatatgaca tagacatggc tttctttgac 19080 aacagaagtg cggctgctgc tggcctagct ccagaaattg ttttgtatac tgaaaatgtg 19140 gatttggaaa ctgcagatac ccatattgta tacaaagcag gcacagatga cagcagctct 19200 tctattaatt tgggtcagca agccatgccc aacagaccta actacattgg tttcagagac 19260 aactttatcg ggctcatgta ctacaacagc actggcaata tgggggtgct ggccggtcag 19320 gcttctcagc tgaatgctgt ggttgacttg caagacagaa acaccgagct gtcctaccag 19380 ctcttgcttg actctctggg tgacagaacc cggtatttca gtatgtggaa tcaggcggtg 19440 gacagctatg atcctgatgt gcgcattatt gaaaatcatg gtgtggagga tgaacttccc 19500 aactattgtt tccctctgga tgctgttggc agaacagata cttatcaggg aattaaggct 19560 aatggaactg atcaaaccac atggaccaaa gatgacagtg tcaatgatgc taatgagata 19620 ggcaagggta atccattcgc catggaaatc aacatccaag ccaacctgtg gaggaacttc 19680 ctctacgcca acgtggccct gtacctgccc gactcttaca agtacacgcc ggccaatgtt 19740 accctgccca ccaacaccaa cacctacgat tacatgaacg gccgggtggt ggcgccctcg 19800 ctggtggact chtacatcaa catcggggcg cgctggtcgc tggatcccat ggacaacgtg 19860 aaccccttca accaccaccg caatggggcg ctgcgctacc gctccatgct cctgggcaac 19920 gggcgctacg tgcccttcca catccaggtg ccccagaaat ttttcgccat caagagcctc 19980 ctgctcctgc ccgggtccta cacctacgag tggaacttcc gcaaggacgt caacatgatc 20040 ctgcagagct ccctcggcaa cgacctgcgc acggacgggg cctccatctc cttcaccagc 20100 atcaacctct acgccacctt cttccccatg gcgcacaaca cggcctccac gctcgaggcc 20160 atgctgcgca acgacaccaa cgaccagtcc ttcaacgact acctctcggc ggccaacatg 20220 ctctacccca tcccggccaa cgccaccaac gtgcccatct ccatcccctc gcgcaactgg 20280 gccgccttcc gcggctggtc cttcacgcgt ctcaagacca aggagacgcc ctcgctgggc 20340 tccgggttcg acccctactt cgtctactcg ggctccatcc cctacctcga cggcaccttc 20400 tacctcaacc acaccttcaa gaaggtctcc atcaccttcg actcctccgt cagctggccc 20460 ggcaacgacc ggctcctgac gcccaacgag ttcgaaatca agcgcaccgt cgacggcgag 20520 ggctacaacg tggcccagtg caacatgacc aaggactggt tcctggtcca gatgctggcc 20580 cactacaaca tcggctacca gggcttctac gtgcccgagg gctacaagga ccgcatgtac 20640 tccttcttcc gcaacttcca gcccatgagc cgccaggtgg tggacgaggt caactacaag 20700 gactaccagg ccgtcaccct ggcctaccag cacaacaact cgggcttcgt cggctacctc 20760 gcgcccacca tgcgccaggg ccagccmtac cccgccaamt acccmtcccc gctcatcggc 20820 aagagcgccg tcaccagcgt cacccagaaa aagttcctct gcgacagggt catgtggcgc 20880 atccccttct ccagcaactt catgtccatg ggcgcgctca ccgacctcgg ccagaacatg 20940 ctctatgcca actccgccca cgcgctagac atgaatttcg aagtcgaccc catggatgag 21000 tccacccttc tctatgttgt cttcgaagtc ttcgacgtcg tccgagtgca ccagccccac 21060 cgcggcgtca tcgaggccgt ctacmtgcgc acccccttct cggccggtaa cgccaccacc 21120 taagctcttg cttcttgcaa gccatggccg cgggctccgg cgagcaggag ctcagggcca 21180 tcatccgcga cctgggctgc gggccmtact tcctgggcac sttcgataag cgcttcccgg 21240 gattcatggc cccgcacaag ctggcctgcg ccatcgtcaa cacggccggc cgcgagaccg 21300 ggggcgagca ctggctggcc ttcgcctgaa cccgcgctcg aacacctgct acctcttcga 21360 ccccttcggg ttctcggacg agcgcctcaa gcagatctac cagttcgagt acgagggcct 21420 gctgcgccgc agcgccctgg ccaccgagga ccgctgcgtc accctggaaa agtccaccca 21480 gaccgtgcag ggtccgcgct cggccgcctg cgggctcttc tgctgcatgt tcctgcacgc 21540 cttcgtgcac tggcccgacc gccccatgga caagaacccc accatgaact tgctgaaggg 21600 ggtgcccaac ggcatgctcc agtcgcccca ggtggaaccc accctgcgcc gcaaccagga 21660 ggcgctytac cgcttcctca actcccactc cgcmtacttt cgctcccacc gcgcgcgcat 21720 cgagaaggcc accgccttcg accgcatgaa tcaagacatg taaaccgtgt gtgtatgtta 21780 aatgtcttta ataaacagca ctttcatgtt acacatgcat ctgagatgat ttatttagaa 21840 atcsaaaggg ttcttccggg tctcggcatg gcccgcgggc agggacacgt tgcggaactg 21900 gtacttggcc agccacttga actcggggat cagcagtttg ggcagcgggg tgtcggggaa 21960 ggagtcggtc cacagcttcc gcgtcagttg cagggcgccc agcaggtcgg gcgcggagat 22020 cttgaaatcg cagttgggac ccgcgttctg cgcgcgggag ttgcggtaca cggggttgca 22080 gcactggaac accatcaggg ccgggtgctt cacgctcgcc agcaccgtcg cgtcggtgat 22140 gctctccacg tcgaggtcct cggcgttggC catcccgaag ggggtcatct tgcaggtctg 22200 ccttcccatg gtgggcacgc acccgggctt gtggttgcaa tcgcagtgca gggggatcag 22260 catcatctgg gcctggtcgg cgttcatccc cgggtacatg gccttcatga aagcctccaa 22320 ttgcctgaac gcctgctggg ccttggctcc ctcggtgaag aagaccccgc aggacttgct 22380 agagaactgg ttggtggcgc acccggcgtc gtgcacgcag cagcgcgcgt cgttgttggc 22440 cagctgcacc acgctgcgcc cccagcggtt ctgggtgatc ttggcccggt cggggttctc 22500 cttcagcgcg cgctgcccgt tctcgctcgc cacatccatc tcgatcatgt gctccttctg 22560 gatcatggtg gtcccgtgca ggcaccgcag cttgccctcg gcctcggtgc acccgtgcag 22620 ccacagcgcg cacccggtgc actcccagtt cttgtgggcg atctgggaat gcgcgtgcac 22680 gaagccctgc aggaagcggc ccatcatggt ggtcagggtc ttgttgctag tgaaggtcag 22740 cggaatgccg cggtgctcct cgttgatgta caggtggcag atgcggcggt acacctcgcc 22800 ctgctcgggc atcagctgga agttggcttt caggtcggtc tccacgcggt agcggtccat 22860 cagcatagtc atgatttcca tacccttctc ccaggccgag acgatgggca ggctcatagg 22920 gttcttcacc atcatcttag cgctagcagc cgcggccagg gggtcgctct cgtccagggt 22980 ctcaaagctc cgcttgccgt ccttctcggt gatccgcacc ggggggtagc tgaagcccac 23040 ggccgccagc tcctcctcgg cctgtctttc gtcctcgctg tcctggctga cgtcctgcag 23100 gaccacatgc ttggtcttgc ggggtttctt cttgggcggc agcggcggcg gagatgttgg 23160 agatggcgag ggggagcgcg agttctcgct caccactact atctcttcct cttcttggtc 23220 cgaggccacg cggcggtagg tatgtctctt cgggggcaga ggcggaggcg acgggctctc 23280 gccgccgcga cttggcggat ggctggcaga gccccttccg cgttcggggg tgcgctcccg 23340 gcggcgctct gactgacttc ctccgcggcc ggccattgtg ttctcctagg gaggaacaac 23400 aagcatggag actcagccat cgccaacctc gccatctgcc cccaccgccg acgagaagca 23460 gcagcagcag aatgaaagct taaccgcccc gccgcccagc cccgccacct ccgacgcggc 23520 cgtcccagac atgcaagaga tggaggaatc catcgagatt gacctgggct atgtgacgcc 23580 cgcggagcac gaggaggagc tggcagtgcg cttttcacaa gaagagatac accaagaaca 23640 gccagagcag gaagcagaga atgagcagag tcaggctggg ctcgagcatg acggcgacta 23700 cctccacctg agcggggggg aggacgcgct catcaagcat ctggcccggc aggccaccat 23760 cgtcaaggat gcgctgctcg accgcaccga ggtgcccctc agcgtggagg agctcagccg 23820 cgcctacgag ttgaacctct tctcgccgcg cgtgcccccc aagcgccagc ccaatggcac 23880 ctgcgagccc aacccgcgcc tcaacttcta cccggtcttc gcggtgcccg aggccctggc 23940 cacctaccac atctttttca agaaccaaaa gatccccgtc tcctgccgcg ccaaccgcac 24000 ccgcgccgac gcccttttca acctgggtcc cggcgcccgc ctacctgata tcgcctcctt 24060 ggaagaggtt cccaagatct tcgagggtct gggcagcgac gagactcggg ccgcgaacgc 24120 tctgcaagga gaaggaggag agcatgagca ccacagcgcc ctggtcgagt tggaaggcga 24180 caacgcgcgg ctggcggtgc tcaaacgcac ggtcgagctg acccatttcg cgtacccggc 24240 tctgaacctg ccccccaaag tcatgagcgc ggtcatggac caggtgctca tcaagcgcgc 24300 gtcgcccatc tccgaggacg agggcatgca agactccgag gagggcaagc ccgtggtcag 24360 cgacgagcag ctggcccggt ggctgggtcc taatgctagt ccccagagtt tggaagagcg 24420 gcgcaaactc atgatggccg tggtcctggt gaccgtggag ctggagtgcc tgcgccgctt 24480 cttcgccgac gcggagaccc tgcgcaaggt cgaggagaac ctgcactacc tcttcaggca 24540 cgggttcgtg cgccaggcct gcaagatctc caacgtggag ctgaccaacc tggtctcgta 24600 catgggcatc ttgcacgaga accgcctggg gcagaacgtg ctgcacacca ccctgcgcgg 24660 ggaggcccgg cgcgactaca tccgcgactg cgtctacctc tacctctgcc acacctggca 24720 gacgggcatg ggcgtgtggc agcagtgtct ggaggagcag aacctgaaag agctctgcaa 24780 gctcctgcag aagaactcaa gggtctgtgg accgggttcg acgagcgcac caccgcctcg 24840 gacctggccg acctcatttt ccccgagcgc ctcaggctga cgctgcgcaa cggcctgccc 24900 gactttatga gccaaagcat gttgcaaaac tttcgctctt tcatcctcga acgctccgga 24960 atcctgcccg ccacctgctc cgggctgccc tcggacttcg tgccgctgac cttccgcgag 25020 tgccccccgc cgctgtggag ccactgctac ctgctgcgcc tggccaacta cctggcctac 25080 cactcggacg tgattgagga cgtcagcggc gagggcctgc tcgagtgcca ctgccgctgc 25140 aacctctgca cgCcgcaccg ctccctggcc tgcaaccccc agctgytgag cgagacccag 25200 atcatcggca ccttcgagtt gcaagggccc agcgaaggcg agggttcagc cgccaagggg 25260 ggtctgaaac tcaccccggg gctgtggacc tcggcctact tgcgcaagtt cgtgcccgag 25320 gactaccatc ccttcgagat caggttctac gaggaccaat cccatccgcc caaggccgag 25380 ctgtcggcct gcgtcatcac ccagggggcg atcctggccc aattgcaagc catccagaaa 25440 tcccgccaag aattcttgct gaaaaagggc cgcggggtct acctcgaccc ccagaccggt 25500 gaggagCtCa accccggctt cccccaggat gccccgagga aacaagaagc tgaaagtgga 25560 gctgccgccc gtggaggatt tggaggaaga ctgggagaac agcagtcagg cagaggagga 25620 ggagatggag gaagactggg acagcactca ggcagaggag gacagcctgc aagacagtct 25680 ggaggaagac gaggaggagg cagaggagga ggtggaagaa gcagccgccg ccagaccgtc 25740 gtcctcggcg ggggagaaag caagcagcac ggataccatc tccgctccgg gtcggggtcc 25800 cgctcgacca cacagtagat gggacgagac cggacgattc ccgaacccca ccacccagac 25860 cggtaagaag gagcggcagg gatacaagtc ctggcggggg cacaaaaacg ccatcgtctc 25920 ctgcttgcag gcctgcgggg gcaacatctc cttcacccgg cgctacctgc tcttccaccg 25980 cggggtgaac tttccccgca acatcttgca ttactaccgt cacctccaca gcccctacta 26040 cttccaagaa gaggcagcag cagcagaaaa agaccagcag aaaaccagca gctagaaaat 26100 ccacagcggc ggcagcaggt ggactgagga tcgcggcgaa cgagccggcg caaacccggg 26160 agctgaggaa ccggatcttt cccaccctct atgccatctt ccagcagagt cgggggcagg 26220 agcaggaact gaaagtcaag aaccgttctc tgcgctcgct cacccgcagt tgtctgtatc 26280 acaagagcga agaccaactt cagcgcactc tcgaggacgc cgaggctctc ttcaacaagt 26340 actgcgcgct cactcttaaa gagtagcccg cgcccgccca gtcgcagaaa aaggcgggaa 26400 ttacgtcacc tgtgcccttc gccctagccg cctccaccca tcatcatgag caaagagatt 26460 cccacgcctt acatgtggag ctaccagccc cagatgggcc tggccgccgg tgccgcccag 26520 gactactcca cccgcatgaa ttggctcagc gccgggcccg cgatgatctc acgggtgaat 26580 gacatccgcg cccaccgaaa ccagatactc ctagaacagt cagcgctcac cgccacgccc 26640 cgcaatcacc taaatccgcg taattggccc gccgccctgg tgtaccagga aattccccag 26700 cccacgaccg tactacttcc gcgagacgcc caggccgaag tccagctgac taactcaggt 26760 gtccagctgg cgggcggcgc caccctgtgt cgtcaccgcc ccgctcaggg tataaagcgg 26820 ctggtgatcc ggggcagaag cacacagctc aacgacgaag tggtgagctc ttcgctgggt 26880 ctgcgacctg acggagtctt ccaactcgcc ggatcgggga gatcttcctt cacgcctcgt 26940 caggccgtcc tgactttgga gagttcgtcc tcgcagcccc gctcgggtgg catcggcact 27000 ctccagttcg tggaggagtt cactccctcg gtctacttca accccttctc cggctccccc 27060 ggccactacc cggacgagtt catcccgaac ttcgacgcca tcagcgagtc ggtggacggc 27120 tacgattgaa tgtcccatgg tggcgcagct gacctagctc ggcttcgaca cctggaccac 27180 tgccgccgct tccgctgctt cgctcgggat ctcgccgagt ttgcctactt tgagctgccc 27240 gaggagcacc ctcagggccc ggcccacgga gtgcggatcg tcgtcgaagg gggcctcgac 27300 tcccacctgc ttcggatctt cagccagcgt ccgatcctgg tcgagcgcga gcaaggacag 27360 acccttctga ctctgtactg catctgcaac caccccggcc tgcatgaaag tctttgttgt 27420 ctgctgtgta ctgagtataa taaaagctga gacagcgact actccggact tccgtttgtt 27480 cctgaatcca tcaaccagtc tttgttcttc accgggaacg agaccgagct ccagctccag 27540 tgtaagcccc acaagaagta cctcacctgg ctgttccagg gctccccgat cgccgttgtc 27600 aaccactgcg acaacgacgg agtcctgstg agcggccctg ccaaccwtac tttttccacc 27660 cgcagaagca agctccagct sttccaaccc ttcctccccg ggacctatca gtgcgtctcg 27720 ggaccctgcc atcacacctt ccacctgatc ccgaatacca cagcgtcgct ccccgmtact 27780 aacaaccaaa ctaacctcca ccaacgccac cgtcgcgacc tttctgaatc taatactacc 27840 acccacaccg gaggtgagct ccgaggtcaa ccaacctctg ggatttacta cggcccctgg 27900 gaggtggttg ggttaataac gctaggccta gttgcgggtg ggcttttggt tctctgctac 27960 ctatacctcc cttgctgttc gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga 28020 agatcaccct agtgagctgc ggtgcgctgg tggcggtgtt gctttcgatt gtgggactgg 28080 gcggtgcggc tgtantgaaa gagaaggccg atccctgctt gcatttcaat cccaacaaat 28140 gccagctgag ttttcagccc gatggcaatc ggtgtgcggt actgatcaag tgcggatggg 28200 aatgcgagaa cgtgagaatc gagtacaata acaagactcg gaacaatact ctcgcgtccg 28260 tgtggcagcc cggggacccc gagtggtaca ccgtctctgt ccccggtgct gacggctccc 28320 cgcgcaccgt gaataatact ttcatttttg cgcacatgtg cgacacggtc atgtggatga 28380 gcaagcagta cgatatgtgg ccccccacga aggagaacat cgtggtcttc tccatcgctt 28440 acagcctgtg cacggcgcta atcaccgcta tcgtgtgcct gagcattcac atgctcatcg 28500 ctattcgccc cagaaataat gccgaaaaag aaaaacagcc ataacgtttt ttttcacacc 28560 tttttcagac catggcctct gttaaatttt tgcttttatt tgccagtctc attgccgtca 28620 ttcatggaat gagtaatgag aaaattacta tttacactgg cactaatcac acattgaaag 28680 gtccagaaaa agccacagaa gtttcatggt attgttattt taatgaatca gatgtatcta 28740 ctgaactctg tggaaacaat aacaaaaaaa atgagagcat tactctcatc aagtttcaat 28800 gtggatctga cttaacccta attaacatca ctagagacta tgtaggtatg tattatggaa 28860 ctacagcagg catttcggac atggaatttt atcaagtttc tgtgtctgaa cccaccacgc 28920 ctagaatgac cacaaccaca aaaactacac ctgttaccac tatgcagctc actaccaata 28980 acatttttgc catgcgtcaa atggtcaaca atagcactca acccacccca cccagtgagg 29040 aaattcccaa atccatgatt ggcattattg ttgctgtagt ggtgtgcatg ttgatcatcg 29100 ccttgtgcat ggtgtactat gccttctgct acagaaagca cagactgaac gacaagctgg 29160 aacacttact aagtgttgaa ttttaatttt ttagaaccat gaagatccta ggccttttaa 29220 ttttttctat cattacctct gctctatgca attctgacaa tgaggacgtt actgtcgttg 29280 tcggatcaaa ttatacactg aaaggtccag cgaagggtat gctttcgtgg tattgctatt 29340 ttggatctga cactacagaa actgaattat gcnatcttaa gaatggcaaa attcaaaatt 29400 cttaaaatta acaattatat atgcaatggt actgatctaa tactcctcaa tatcacgaaa 29460 tcatatgstg gcagttacac ctgccctgga gatgatgctg acagtatgat tttttacaaa 29520 gtaactgttg ttgatcccat actccacctc cacccaccac aattactcac accacacaca 29580 cagatcaaac cgcagcagag gaggcagcaa agttagcctt gcaggtccaa gacagttcat 29640 ttgttggcat tacccctaca catgatcagc ggtgtccggg gctgctagtc agcggcattg 29700 tcggtgtgct ttcgggatta gcagtcataa tcatctgcat gttcattttt gcttgctgct 29760 atagaaggct ttaccgacaa aaatcagacc cactgctgaa cctctatgtt taattttttc 29820 cagagtcatg aaggcagtta gcgctctagt tttttgttct wtgattggca ttgttttttg 29880 caatcctatt cctaaagtta gctttattaa agatgtgaat gttactgagg ggggcaatgt 29940 gacactggta ggtgtagagg gtgctgaaaa caccacctgg acaaaatacc acctcaatgg 30000 gtggaaagat atttgcaatt ggagtgtatt agtttataca tgtgagggag ttaatcttac 30060 cattgtcaat gccacctcag ctcaaaatgg tagaattcaa ggacaaagtg tcagtgtatc 30120 taatgggtat tttacccaac atacttttat ctatgacgtt aaagtcatac cactgcwtac 30180 gcttagccca cttagcatta ccacacagac aacccacatt acacagacaa ccacatacag 30240 tacattaaat cagcbtacca ccactacagc agcagaggtt gccagctcgt ctggggtccg 30300 agtggcattt ttgatgtggg ccccatmtag cagtcccact gctagtacca atgagcagac 30360 tactgaattt ttgtccactg tcgagagcca caccacagct acctccagtg ccttctctag 30420 caccgccaat ctctcctcgc tttcctntac accaatcagt cccgytaata ctcctagccc 30480 cgtcctcttc ccactcccct gaagcaaaca gacggcggca tgcaatggca gatcaccctg 30540 ctcattgtga tcgggttggt catcctggcc gtgttgctct actacatctt ctgccgccgc 30600 attcccaacg cgcaccgcaa gccggtatac aagcccatca ttgtcgggca gccggagccg 30660 cttcaggtgg aagggggtct aaggaatctt ctcttctctt ttacagtatg gtgattgaac 30720 tatgattcct agacaattct tgatcactat tcttatctgc ctcctccaag tctgtgccac 30780 cctcgctctg gtggccaacg ccagtccaga ctgtattggg cccttcgcct cctacgtgct 30840 ctttgccttc accacctgca tctgctgctg tagcatagtc tgcctgctta tcaccttctt 30900 ccagttcatt gactggatct ttgtgcgcat cgcmtacctg cgccaccacc cccagtaccg 30960 cgaccagcga gtggcgcggc tgctcaggct cctctgataa gcatgcgggc tgtgntactt 31020 ctcgcgcttc tgctgttagt gctcccccgt cccgtcgacc cccggtcccc cacccagtcc 31080 cccgaggagg tccgcaaatg caaattccaa gaaccctgga aattcctcaa atgctaccgc 31140 caaaaatcag acatgcatcc cagctggatc atgatcattg ggatcgtgaa cattctggcc 31200 tgcaccctca tctcctttgt gatttacccc tgctttgact ttggttggaa ctcgccagag 31260 gcgctctatc tcccgcctga acctgacaca ccaccacagc aacctcaggc acacgcacta 31320 ccaccactac agcctaggcc acaatacatg cccatattag actatgaggc cgagccacag 31380 cgacccatgc tccccgctat tagttacttc aatctaaccg gcggagatga ctgacccact 31440 ggccaacaac aacgtcaacg accttctcct ggacatggac ggccgcgcct cggagcagcg 31500 actcgcccaa cttcgcattc gccagcagca ggagagagcc gtcaaggagc tgcaggatgc 31560 ggtggccatc caccagtgca agagaggcat cttctgcctg gtgaaacagg ccaagatctc 31620 ctacgaggtc actccaaacg accatcgcct ctcctacgag ctcctgcagc agcgccagaa 31680 gttcaccttc ctggtcggag tcaaccccat cgtcatcacc cagcagtctg gcgataccaa 31740 ggggtccatc cactgctcct gcgactcccc cgactgcgtc cacactctga tcaagaccct 31800 stgcggcctc cgcgacctcc tccccatgaa ctaatcaccc ccttatccag tgaaataaag 31860 atcatattga tgatgatttt acagaaataa aaaataatca tttgatttga aataaagata 31920 caatcatatt gatgatttga gtttaacaaa aaaataaaga atcacttact tgaaatctga 31980 taccaggtct ctgtccatgt tttctgccaa caccacttca ctcccctctt cccagctctg 32040 gtactgcagg ccccggcggg ctgcaaactt cctccacacg ctgaagggga tgtcaaattc 32100 ctcctgtccc tcaatcttca ttttatcttc tatcagatgt ccaaaaagcg cgtccgggtg 32160 gatgatgact tcgaccccgt ctacccctac gatgcagaca acgcaccgac cgtgcccttc 32220 atcaaccccc ccttcgtctc ttcagatgga ttccaagaga agcccctggg ggtgttgtcc 32280 ctgcgactgg ccgaccccgt caccaccaag aacggggaaa taaccctcaa gctgggagag 32340 ggggtggacc tcgattcctc gggaaaactc atctccaaca cggccaccaa ggccgccgcc 32400 cctctcagtt tttccaacaa caccatttcc cttaacatgg atcacccctt ttacactaaa 32460 gatggaaaat tatccttaca agtttctcca ccattaaata tactgagaac aagcattcta 32520 aacacactag ctttaggttt tggatcaggt ttaggactcc gtggctctgc cttggcagta 32580 cagttagtct ctccacttac atttgatact gatggaaaca taaagcttac cttagacaga 32640 ggtttgcatg ttacaacagg agatgcaatt gaaagcaaca taagctgggc taaaggttta 32700 aaatttgaag atggagccat agcaaccaac attggaaatg ggttagagtt tggaagcagt 32760 agtacagaaa caggtgttga tgatgcttac ccaatccaag ttaaacttgg atctggcctt 32820 agctttgaca gtacaggagc cataatggct ggtaacaaag aagacgataa actcactttg 32880 tggacaacac ctgatccatc accaaactgt caaatactcg cagaaaatga tgcaaaacta 32940 acactttgct tgactaaatg tggtagtcaa atactggcca ctgtgtcagt cttagttgta 33000 ggaagtggaa acctaaaccc cattactggc accgtaagca gtgctcaggt gtttctacgt 33060 tttgatgcaa acggtgttct tttaacagaa cattctacac taaaaaaata ctgggggtat 33120 aggcagggag atagcataga tggcactcca tataccaatg ctgtaggatt catgcccaat 33180 ttaaaagctt atccaaagtc acaaagttct actactaaaa ataatatagt agggcaagta 33240 tacatgaatg gagatgtttc aaaacctatg cttctcacta taaccctcaa tggtactgat 33300 gacagcaaca gtacatattc aatgtcattt tcatacacct ggactaatgg aagctatgtt 33360 ggagcaacat ttggggctaa ctcttatacc ttctcataca tcgcccaaga atgaacactg 33420 tatcccaccc tgcatgccaa cccttcccac cccactctgt ggaacaaact ctgaaacaca 33480 aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33540 tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagccttga 33600 acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33660 atggagccag tctcgggtcg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33720 cctcacaggt caacagctga ggattgtcct cggtggtcgg gatcacggtt atctggaaga 33780 agcagaagag cggcggtggg aatcatagtc cgcgaacggg atcggccggt ggtgtcgcat 33840 caggccccgc agcagtcgct gccgccgccg ctccgtcaag ctgctgctca gggggtccgg 33900 gtccagggac tccctcagca tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg 33960 ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acagaaccac 34020 caggttgttc aacagtccat agttcaacac gctCcagccg aaactcatcg cgggaaggat 34080 gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggtgcc ccctccagaa 34140 cacgctgccc acgtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34200 catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34260 cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34320 ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34380 tatgctcatg catctcttca gcactctcaa ctcctcgggg gtcaaaacca tatcccaggg 34440 cacggggaac tcttgcagga cagcgaaccc cgcagaacag ggcaatcctc gcacagaact 34500 tacattgtgc atggacaggg tatcgcaatc aggcagcacc gggtgatcct ccaccagaga 34560 agcgcgggtc tcgttctcct cacagcgtgg taagggggcc ggccgatacg ggtgatggcg 34620 ggacgcggct gatcgtgttc gcgaccgtgt catgatgcag ttgctttcgg acattttcgt 34680 acttgctgta gcagaacctg gtccgggcgC tgcacaccga tcgacggcgg cggtctcggc 34740 gcttggaacg ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34800 ctagggcctc aggagtgatg aagatcccat catgcctgat ggctctgatc acatcgacca 34860 ccgtggaatg ggccagaccc agccagatga tgcaattttg ttgggtttcg gtgacggcgg 34920 gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagtactt 34980 caaaatgaag atcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35040 ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggctttcagc aaagcctcca 35100 cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35160 tcatgttaca ctcstgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35220 gaactagttc gtgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35280 ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35340 cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctgagct cctccctcag 35400 caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35460 aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35520 aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35580 cagaaaatcg cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtggac 35640 gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35700 ttagctgatc tgtagaaaaa acaaaaatga acattaaacc atgctagcct ggcgaacagg 35760 tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35820 aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35880 gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35940 gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36000 atgaagcaca aaattctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36060 caaagccccc gatccctcca ggtacacata caaagcctca gcgtccatag cttaccgagc 36120 agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36180 ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36240 gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36300 cgcgcacttc ctcaaacgcc caaaactgcc gtcatttccg ggttcccacg ctacgtcatc 36360 aaaacacgac tttcaaattc cgtcgaccgt taaaaacgtc acccgccccg cccctaacgg 36420 tcgcccgtct ctcagccaat cagcgccccg catccccaaa ttcaaacacc tcatttgcat 36480 attaacgcgc acaaaaagtt tgaggtatat tattgatgat g 36521 <210> 34 <211> 314 <212> PRT
<213> Human adenovirus type 4 <400> 34 Asn Thr Cys Gln Trp Lys Asp Ser Asp Ser Lys Met His Thr Phe Gly Ala Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile Glu Ala Asp Gly Leu Pro Ile Arg Ile Asp Ser Thr Ser Gly Thr Asp Thr Val Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly Asn Asp Ser Trp Val Asp Thr Asn Gly Ala Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys Asp Thr Thr Lys Met Asn Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Leu Lys Asp Ser Glu Pro Ala Ala Thr Thr Pro Asn Tyr Asp Ile Asp Leu Ala Phe Phe Asp Ser Lys Thr Ile Val Ala Asn Tyr Asp Pro Asp Ile Val Met Tyr Thr Glu Asn Val Asp Leu Gln Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Glu Asp Thr Ser Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Val Gly Leu Thr Asp Thr Tyr Gln Gly Val Lys Val Lys Thr Asp Ala Gly Ser Glu Lys Trp Asp Lys Asp Asp Thr Thr Val Ser Asn Ala Asn Glu Ile His Val Gly Asn Pro Phe Ala Met <210> 35 <211> 318 <212> PRT
<213> Human adenovirus type 16 <400> 35 Asn Thr Cys Gln Trp Lys Asp Ser Asp Ser Lys Met His Thr Phe Gly Val Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile Glu Ala Asp Gly Leu Pro Ile Gly Ile Asp Ser Thr Ser Gly Thr Asp Thr Val Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly Asn Ala Ser Trp Val Asp Ala Asn Gly Thr Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys Asp Thr Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Leu Lys Asp Ser Glu Thr Ala Ala Thr Thr Pro Asn Tyr Asp Ile Asp Leu Ala Phe Phe Asp Asn Lys Asn Ile Ala Ala Asn Tyr Asp Pro Asp Ile Val Met Tyr Thr Glu Asn Val Asp Leu Gln Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Glu Asp Thr Ser Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Val Gly Phe Thr Asp Thr Tyr Gln Gly Val Lys Val Lys Thr Asp Ala Val Ala Gly Thr Ser Gly Thr Gln Trp Asp Lys Asp Asp Thr Thr Val Ser Thr Ala Asn Glu Ile His Gly Gly Asn Pro Phe Ala Met <210> 36 <211> 323 <212> PRT
<213> Human adenovirus type 3 <400> 36 Asn Thr Ser Gln Trp Ile Val Thr Thr Asn Gly Asp Asn Ala Val Thr Thr Thr Thr Asn Thr Phe Gly Ile Ala Ser Met Lys Gly Gly Asn Ile Thr Lys Glu Gly Leu Gln Ile Gly Lys Asp Ile Thr Thr Thr Glu Gly Glu Glu Lys Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Glu Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe Gly Gly Arg Ala Leu Lys Pro Ala Thr Asn Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg Pro Thr Asn Ile Lys Gly Gly Gln Ala Lys Asn Arg Lys Val Lys Pro Thr Thr Glu Gly Gly Val Glu Thr Glu Glu Pro Asp Ile Asp Met Glu Phe Phe Asp Gly Arg Asp Ala Val Ala Gly Ala Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Ser His Val Val Tyr Lys Pro Glu Thr Ser Asn Asn Ser His Ala Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Ile Gly Pro Gly His Thr Tyr Gln Gly Ile Lys Lys Val Lys Thr Asp Asp Thr Asn Gly Trp Glu Lys Asp Ala Asn Val Ala Pro Ala Asn Glu Ile Thr Ile Gly Asn Asn Leu Ala Met <210> 37 <211> 315 <212> PRT
<213> Human adenovirus type 7 <400> 37 Asn Thr Ser Gln Trp Ile Val Thr Ala Gly Glu Glu Arg Ala Val Thr Thr Thr Thr Asn Thr Phe Gly Ile Ala Ser Met Lys Gly Asp Asn Ile Thr Lys Glu Gly Leu Glu Ile Gly Lys Asp Ile Thr Ala Asp Asn Lys Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Glu Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe Gly Gly Arg Ala Leu Lys Pro Ala Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg Pro Thr Asn Ile Lys Gly Gly Gln Ala Lys Asn Arg Lys Val Lys Pro Thr Glu Gly Asp Val Glu Thr Glu Glu Pro Asp Ile Asp Met Glu Phe Phe Asp Gly Arg Glu Ala Ala Asp Ala Phe Ser Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Ser His Val Val Tyr Lys Pro Gly Thr Ser Asp Asp Asn Ser His Ala Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Giy Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ile Gly Pro Ala Lys Thr Tyr Gln Gly Ile Lys Ser Lys Asp Asn Giy Trp Glu Lys Asp Asp Asn Val Ser Lys Ser Asn Glu Ile Ala Ile Gly Asn Asn Gln Ala Met <210> 38 <211> 345 <212> PRT
<213> Human adenovirus type 2 <400> 38 Asn Ser Cys Glu Trp Glu Gln Thr Glu Asp Ser Gly Arg Ala Val Ala Glu Asp Glu Glu Glu Glu Asp Glu Asp Glu Glu Glu Glu Glu Glu Glu Gln Asn Ala Arg Asp Gln Ala Thr Lys Lys Thr His Val Tyr Ala Gln Ala Pro Leu Ser Gly Glu Thr Leu Thr Lys Ser Gly Leu Gln Ile Gly Ser Lys Asn Ala Glu Thr Gln Ala Lys Pro Val Tyr Ala Asp Pro Ser Tyr Gln Pro Glu Pro Gln Ile Gly Glu Ser Gln Trp Asn Giu Ala Asp Ala Asn Ala Ala Gly Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys Pro Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Pro Phe Gly Gly Gln Ser Val Leu Val Pro Asp Glu Lys Gly Val Pro Leu Pro Lys Val Asp Leu Gln Phe Phe Ser Asn Thr Thr Ser Leu Asn Asp Arg Gln Gly Asn Ala Thr Lys Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Met Glu Thr Pro Asp Thr His Leu Ser Tyr Lys Pro Gly Lys Gly Asp Glu Asn Ser Lys Ala Met Leu Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Ile Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Val Thr Asp Thr Tyr Gln Ala Ile Lys Ala Asn Gly Asn Gly Ser Gly Asp Asn Gly Asp Thr Thr Trp Thr Lys Asp Glu Thr Phe Ala Thr Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met <210> 39 <211> 183 <212> PRT
<213> human adenovirus protein <400> 39 Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg Ile His Ser Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gln Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser Met Thr Gly Thr Val Ala Ser Val Ser Ile Phe Leu Arg Phe Asp Gln Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gln Ser Gln Thr Ala Lys Asn Asn Ile Val Ser Gln Val Tyr Leu His Gly Asp Lys Thr Lys Pro Met Ile Leu Thr Ile Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 40 <211> 182 <212> PRT
<213> human adenovirus protein <400> 40 Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro Ile Ser Gly Thr Val Gln Ser Ala His Leu Ile Ile Arg Phe Asp Glu Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala Lys Ser Asn Ile Val Ser Gln Val Tyr Leu Asn Gly Asp Lys Thr Lys Pro Val Thr Leu Thr Ile Thr Leu Asn Gly Thr Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr Ile Asn Glu Ile Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 41 <211> 338 <212> PRT
<213> human adenovirus protein <400> 41 Ala Pro Lys Gly Ala Pro Asn Pro Cys Glu Trp Asp Glu Ala Ala Thr Ala Leu Glu Ile Asn Leu Glu Glu Glu Asp Asp Asp Asn Glu Asp Glu Val Asp Glu Gln Ala Glu Gln Gln Lys Thr His Val Phe Gly Gln Ala Pro Tyr Ser Gly Ile Asn Ile Thr Lys Glu Gly Ile Gln Ile Gly Val Glu Gly Gln Thr Pro Lys Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Ile Gly Glu Ser Gin Trp Tyr Glu Thr Glu Ile Asn His Ala Ala Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys Pro Cys Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Asn Gly Gly Gln Gly Ile Leu Val Lys Gln Gln Asn Gly Lys Leu Glu Ser Gln Val Glu Met Gln Phe Phe Ser Thr Thr Glu Ala Thr Ala Gly Asn Gly Asp Asn Leu Thr Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asp Ile Glu Thr Pro Asp Thr His Ile Ser Tyr Met Pro Thr Ile Lys Glu Gly Asn Ser Arg Glu Leu Met Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Ile Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Val Ile Asn Thr Glu Thr Leu Thr Lys Val Lys Pro Lys Thr Gly Gln Glu Asn Gly Trp Glu Lys Asp Ala Thr Glu Phe Ser Asp Lys Asn Glu Ile Arg Val Gly Asn Asn Phe Ala Met Glu Ile
With ref to SEQ ID NO:33.
E3 11.6 kDa 27130... 27450 20 16 kDa (27404 ... 27477,27666.. 21 28032) 19.3 kDa 28014.. 28544 22 22.3 28572.. 29186 23 9.9 kDa 30722.. 30997 24 15.6 kDa 31003.. 31434 25 14.7 kDa 31427.. 31834 26 L5 Fiber - p1V 32137.. 33414 27 E4 ORF7-like protein Complement (33521.. 28 >33772) Orf 6 - 33 kDa Complement (33769..34674) 29 Orf4 - 13.2 kDa Complement (34580.. 34945) 30 Orf 3 - 12. 8 kDa Complement (34955.. 35308) 31 Orf 2 - 14.2 kDa Complement (35305.. 35694 32 Thus, the invention provides unique C68 proteins, peptides and fragments thereof, which are produced recombinantly or by other methods. Suitably, such fragments are at least 8 amino acids in length. However, fragments of other desired lengths are readily utilized. In addition, the invention encompasses such modifications as may be introduced to enhance yield and/or expression of a C68 protein or fragment, construction of a fusion molecule in which all or a fragment of the C68 protein or fragment is fused (either directly or via a linker) with a fusion partner to enhance. Other suitable modifications include, without limitation, truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre-or pro-protein ordinarily cleaved to produce the mature protein or enzyme and/or mutation of a coding region to provide a secretable gene product. Still other modifications will be readily apparent to one of skill in the art. The invention further encompasses proteins having at least about 95% to 99% identity to the C68 proteins provided herein.
The term "substantial homology" or "substantial similarity," when referring to a protein or fragment thereof, indicates that, when optimally aligned with appropriate amino acid insertions or deletions with another protein, there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences .
The term "percent sequence identity" or "identical" in the context of proteins or fragments thereof refers to the amino acids in the two sequences that are the same when aligned for maximum correspondence. The length of sequence identity comparison may be over the full length of a protein, enzyme, polypeptide, peptide, or other fragment of at least about 200 to 500 amino acids, is desired. However, identity among smaller fragments, e.g. of at least about 8 amino acids, usually at least about 20 to 24 amino acids, at least about 28 to 32 amino acids, at least about 50 or more amino acids, may also be desired.
Identity is readily determined by one of skill in the art by resort to algorithms and computer programs known by those of skill in the art. As described herein, alignments are performed using any of a variety of publicly or commercially available Multiple Sequence Alignment Programs, such as "Clustal W", accessible through Web Servers on the internet.
Alternatively, Vector NTI utilities are also used. There are also a number of algorithms known in the art that can be used to measure amino acid sequence identity, including those contained in the programs described above. Generally, these programs are used at default settings, although one of skill in the art can alter these settings as needed.
Alternatively, one of skill in the art can utilize another algorithm or computer program that provides at least the level of identity or alignment as that provided by the referenced algorithms and programs.
As described herein, the C68-derived capsid proteins of the invention are particularly well suited for use in applications in which the neutralizing antibodies diminish the effectiveness of other Ad serotype based targeting proteins and vectors, as well as other viral vectors. The C68-derived constructs of the invention are particularly advantageous in readministration for repeat gene therapy or for boosting immune response (vaccine titers).
Also provided by the present invention are artificial adenoviral capsid proteins, which involve modifications and chimeric capsids constructed using the C68 adenoviral capsid proteins of the invention. Such artificial capsid proteins can be constructed using the amino acid sequences of the chimp C68 Ad hexon of the invention. Because the hexon protein is the determinant for serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses having artificial serotypes. Other artificial capsid proteins can also be constructed using the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments thereof.
In one embodiment, a chimeric C68 capsid is constructed using C68 hexon and fiber and a penton from another adenovirus. Alternatively, a chimeric C68 capsid comprises a C68 hexon and a fiber and penton from one or more different adenoviruses.
Another chimeric adenovirus capsid comprises the C68 fiber and a penton and a hexon from one or more different different adenovirus serotypes. Yet another chimeric adenovirus capsid comprises the C68 penton and a fiber and hexon from one or more different adenovirus serotypes. Suitably, for such chimeric and artificial capsids constructed from C68 proteins, the non-C68 adenovirus components may be readily selected from other adenovirus serotypes.
Under certain circumstances, it may be desirable to use one or more of the C68-derived capsid proteins or a fragment thereof to generate an antibody. The term "an antibody," as used herein, refers to an immunoglobulin molecule which is able to specifically bind to an epitope. The antibodies in the present invention exist in a variety of forms including, for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. Such antibodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE.
Such antibodies may be generated using any of a number of methods know in the art.
Suitable antibodies may be generated by well-known conventional techniques, e.g. Kohler and Milstein and the many known modifications thereof. Similarly desirable high titer antibodies are generated by applying known recombinant techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, e.g., PCT Patent Application No.
PCT/GB85/00392; British Patent Application Publication No. GB2188638A; Amit et al., 1986 Science, 233:747-753; Queen et al., 1989 Proc. Nat'l. Acad. Sci. USA, 86:10029-10033;
PCT Patent Application No. PCT/W09007861; and Riechmann et al., Nature, 332:323-327 (1988); Huse et al, 1988a Science, 246:1275-1281]. Alternatively, antibodies can be produced by manipulating the complementarity determining regions of animal or human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, "Humanization of Monoclonal Antibodies", Chapter 4, The Handbook of Experimental Pharmacology, Vol.
113, The Pharmacology of Monoclonal Antibodies, Springer-Verlag (June, 1994);
Harlow et al., 1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et al., 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York;
Houston et al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Bird et al., 1988, Science 242:423-426.
Alternatively, one or more of the C68 capsid proteins of the invention are assembled as multi-antigenic complexes [see, e.g., European Patent Application 0339695, published November 2, 1989] and employed to elicit high titer antibodies. Further provided by the present invention are anti-idiotype antibodies (Ab2) and anti-anti-idiotype antibodies (Ab3).
See, e.g., M. Wettendorff et al., "Modulation of anti-tumor immunity by anti-idiotypic antibodies." In Idiotypic Network and Diseases, ed. by J. Cerny and J.
Hiernaux, 1990 J. Am.
Soc. Microbiol., Washington DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced using techniques well known to those of skill in the art. These antibodies may be used for a variety of purposes, including diagnostic and clinical methods and kits.
Under certain circumstances, it may be desirable to introduce a detectable label or a tag onto a C68 antibody or other construct of the invention. As used herein, a detectable label is a molecule which is capable, alone or upon interaction with another molecule, of providing a detectable signal. Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in immunohistochemical analyses or immunofluorescent microscopy. For example, suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin (APC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PC5), and PE-Texas Red (ECD). All of these fluorescent dyes are commercially available, and their uses known to the art. Other useful labels include a colloidal gold label. Still other useful labels include radioactive compounds or elements. Additionally, labels include a variety of enzyme systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase (which uses glucose as a substrate) releases peroxide as a product which in the presence of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an oxidized TMB that is seen as a blue color. Other examples include horseradish peroxidase (HRP) or alkaline phosphatase (AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is detected as increased absorbance at 340 nm wavelength. Other label systems that are utilized in the methods of this invention are detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, Indiana] in which a dye is embedded are used in place of enzymes to form conjugates with the target sequences provide a visual signal indicative of the presence of the resulting complex in applicable assays.
Methods for coupling or associating the label with a desired molecule are similarly conventional and known to those of skill in the art. Known methods of label attachment are described [see, for example, Handbook of Fluorescent probes and Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, selection of the label and coupling methods do not limit this invention.
The C68-derived proteins, peptides, and fragments described herein can be produced by any suitable means, including chemical synthesis, or other synthetic means, or by recombinant production and conventional genetic engineering methodologies. For example, peptides can be synthesized by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc., 85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). These and other suitable production methods are within the knowledge of those of skill in the art and are not a limitation of the present invention.
Alternatively, suitable methods for recombinant production can be used.
Selection of suitable expression systems, including expression vectors and host cells for protein expression and/or viral packaging is within the ability of one of skill in the art and is not a limitation of the present invention. See, e.g., Sambrook et al, Molecular Cloning: A
Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY).
Nucleic acid sequences for the C68 genome, which is 36521 bp in length, may be obtained using information available in US Patent 6,083,716 and from the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209 (Pan-9).
This sequences is also available from GenBank. Other chimpanzee adenovirus sequences are available from the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209, and other sources. Desirable chimpanzee strains Pan 5 [ATCC VR-591], Pan 6 [ATCC VR-592], and Pan 7 [ATCC VR-593]. Another particularly desirable chimpanzee adenovirus strain is chimpanzee adenovirus strain Bertha or Cl [ATCC
Accession No. VR-20]. The sequence of the Cl serotype, and the location of the adenovirus genes Ela, Elb, E2a, E2b, E3, E4, L1, L2, L3, L4 and L5 are provided in US
Patent 6,083,716. Optionally, non-chimpanzee simian adenoviral sequences may be used.
Such non-chimpanzee adenovirus include those obtained from baboon adenovirus strains [e.g., ATCC VR-275], adenovirus strains isolated from rhesus monkeys [e.g., ATCC VR-209, ATCC VR-275, ATCC VR-353, ATCC VR-355], and adenovirus strains isolated from African green monkeys [e.g., ATCC VR-541; ATCC VR-941; ATCC VR-942; ATCC VR-943]. Alternatively, one may readily select from among the at least 51 different human serotypes, including, without limitation, human adenovirus serotypes 1, 2, 3, 4, 5, 12, 35, 37, and 40, and other, non-human primate adenovirus serotypes. Further, the sequences of these and other suitable serotypes are available from a variety of databases including, e.g., PubMed and GenBank [see, for example, US Patent No. 5,240,846]. Selection of an appropriate adenovirus is not a limitation of the present invention.
The invention further provides molecules useful for production of the C68 and derived proteins of the invention, including such molecules which carry polynucleotides including DNA sequences. Thus, the invention further encompasses the nucleic acid sequences encoding the C68-derived constructs of the invention, and molecules and host cells useful in expression thereof, including suitable DNA molecules and vectors, which can be any suitable genetic element as defined herein. Preferably, these vectors are DNA-based (e.g., plasmids) or viral vectors.
In one embodiment, the C68-derived capsid proteins and other C68 adenovirus proteins described herein are used for non-viral, protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and immunogenic molecules. A desired molecule for delivery to a target call may be associated with a C68-derived capsid protein or other protein by any suitable means, including, e.g., covalent or non-covalent binding. For example, the C68 penton protein may be readily utilized for such a purpose by production of a fusion protein using the C68 penton sequences of SEQ ID NO:12 in a manner analogous to that described in Medina-Kauwe LK, et al, Gene Ther. 2001 May; 8(10):795-803 and Medina-Kauwe LK, et al, Gene Ther. 2001 Dec; 8(23): 1753-1761. Alternatively, the amino acid sequences of C68 protein IX may be utilized for targeting vectors by associating the protein IX with a ligand that binds to a cell surface receptor, as described in US
Patent Appln 20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or polylysine-containing sequence, and the like. Still other C68 proteins may be used for used for these and similar purposes.
Further, the C68 adenovirus proteins of the invention are particularly well suited for use in producing viral vectors in C68-derived capsids. Suitably, these adenoviruses are pseudotyped such that a nucleic acid molecule carrying adenovirus ITRs from a non-C68 serotype and a minigene are packaged in a C68-derived adenoviral capsid of the invention.
Alternatively, adenoviruses may be generated which contain at least the 5' ITRs or the 3' ITRs from C68, in a C68-derived capsid protein. The adenoviral vectors described herein may contain adenoviral sequences derived from one, more than one adenoviral strain. In yet another alternative, other C68 elements described herein may be utilized in production of recombinant vectors, or other desirable constructs.
The C68 proteins of the invention are useful for a variety of purposes, including construction of recombinant viruses. The C68-derived capsid proteins of the invention are useful in producing hybrid vectors, including, hybrid C68-adeno-associated viruses, Epstein-Barr virus, and retroviruses [Caplen et al, Gene Ther. 6: 454-459 (1999); Tan et al, J Virol., 73:7582-7589 (1999)]. Such viruses include C68-derived capsids which encapsidated vectors with adeno-associated virus (AAV) ITRs [Lieber et al, J Virol, 73:9314-9324 (1999), Recchia et al, Proc Natl Acad Sci USA, 96:2615-2620 (1999); or lentivirus ITRs (Zheng et al, Nat Biotech, 18:176-180 (2000), using Maloney leukemia virus long terminal repeats).
In a particularly desirable embodiment, the C68-derived capsid proteins, and optionally, the other C68 sequences described herein, are used to produce recombinant adenoviruses and pseudotyped adenoviruses. However, it will be readily understood that the C68-derived capsid proteins and other novel C68 sequences can be utilized for a variety of purposes, including production of other types of viral vectors (such as, e.g., hybrid vectors) carrying the therapeutic and immunogenic transgenes described below.
Additionally, it will be readily understood that viral vectors carrying the unique C68 proteins and other sequences of the invention can be utilized for targeting and/or delivery of other types of molecules, including proteins, chemical molecules and other moieties useful for diagnostic, therapeutic and/or immunization purposes.
II. Recombinant Adenoviral Vectors The compositions of this invention include vectors that deliver a heterologous molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector may include any genetic element including, without limitation, a cosmid, episome, plasmid, or a virus. In a particularly preferred embodiment, these vectors are viral vectors having capsid proteins derived from the C68 proteins of the invention. Alternatively, these vectors may contain other C68 sequences of the invention. These viral vectors suitably contain a minigene. By "minigene" is meant the combination of a selected heterologous gene and the other regulatory elements necessary to drive translation, transcription and/or expression of the gene product in a host cell.
Typically, an adenoviral vector is designed such that the minigene is flanked on its 5' end and/or its 3' end by adenoviral sequences which include, at a minimum, the cis-elements necessary for replication and virion encapsidation. Thus, in one embodiment, the vector contains adenoviral sequences encompassing at least the 5' end of the adenoviral genome, i.e., the 5' inverted terminal repeat sequences (which functions as origins of replication) and the native 5' packaging enhancer domains (that contain sequences necessary for packaging linear Ad genomes and enhancer elements for the El promoter). The vector is also provided with the cis-acting 3' ITRs. Suitably, the minigene is located between the 5' adenoviral elements and the 3' adenoviral elements. An adenoviral vector of the invention may also contain additional adenoviral sequences. For example, the minigene may be located in the site of such as the site of a functional El deletion or functional E3 deletion, among others that may be selected. Alternatively, the minigene may be inserted into an existing gene region to disrupt the function of that region, if desired.
The term "functionally deleted" or "functional deletion" means that a sufficient amount of the gene region is removed or otherwise damaged, e.g., by mutation or modification, so that the gene region is no longer capable of producing functional products of gene expression. If desired, the entire gene region may be removed.
Suitably, these adenoviral vectors of the invention contain one or more adenoviral elements derived from C68. In one embodiment, the vectors contain adenoviral ITRs from an adenoviral serotype which differs from C68. Alternatively, C68 ITRs may be utilized in a viral vector of the invention in which the capsid is not naturally occurring, but contains one or more C68 proteins, or fragments thereof. The selection of the serotype of the ITRs and the serotype of any other adenoviral sequences present in vector is not a limitation of the present invention. A variety of adenovirus strains are described herein.
The viral sequences, helper viruses, if needed, and recombinant viral particles, and other vector components and sequences employed in the construction of the vectors described herein are obtained as described above. See, e.g., US Patent No. 5,240,846.
The DNA
sequences of the adenovirus sequences are employed to construct vectors and cell lines useful in the preparation of such vectors. See, e.g., US Patent No. 6,083,716.
Modifications of the nucleic acid sequences forming the vectors of this invention, including sequence deletions, insertions, and other mutations may be generated using standard molecular biological techniques and are within the scope of this invention.
A. The "Minigene"
The methods employed for the selection of the transgene, the cloning and construction of the "minigene" and its insertion into the viral vector are within the skill in the art given the teachings provided herein.
1. The transgene The transgene is a nucleic acid sequence, heterologous to the vector sequences flanking the transgene, which encodes a polypeptide, protein, or other product, of interest. The nucleic acid coding sequence is operatively linked to regulatory components in a manner which permits transgene transcription, translation, and/or expression in a host cell.
The composition of the transgene sequence will depend upon the use to which the resulting vector will be put. For example, one type of transgene sequence includes a reporter sequence, which upon expression produces a detectable signal. Such reporter sequences include, without limitation, DNA sequences encoding (3-lactamase, J3-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins including, for example, CD2, CD4, CD8, the influenza hemagglutinin protein, and others well known in the art, to which high affinity antibodies directed thereto exist or can be produced by conventional means, and fusion proteins comprising a membrane bound protein appropriately fused to an antigen tag domain from, among others, hemagglutinin or Myc. These coding sequences, when associated with regulatory elements which drive their expression, provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays for beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the signal may be measured visually by color or light production in a luminometer.
However, desirably, the transgene is a non-marker sequence encoding a product which is useful in biology and medicine, such as proteins, peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence in the treated animal.
The transgene may be used for treatment, e.g., of genetic deficiencies, as a cancer therapeutic or vaccine, for induction of an immune response, and/or for prophylactic vaccine purposes. As used herein, induction of an immune response refers to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral immune response to the molecule. The invention further includes using multiple transgenes, e.g., to correct or ameliorate a condition caused by a multi-subunit protein. In certain situations, a different transgene may be used to encode each subunit of a protein, or to encode different peptides or proteins. This is desirable when the size of the DNA encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order for the cell to produce the multi-subunit protein, a cell is infected with the recombinant virus containing each of the different subunits. Alternatively, different subunits of a protein may be encoded by the same transgene. In this case, a single transgene includes the DNA
encoding each of the subunits, with the DNA for each subunit separated by an internal ribozyme entry site (IRES). This is desirable when the size of the DNA
encoding each of the subunits is small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five kilobases. As an alternative to an IRES, the DNA may be separated by sequences encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L.
Donnelly, et al, J. Gen. Virol., 78(Pt 1):13-21 (Jan 1997); Furler, S., et al, Gene Ther., 8(11):864-873 (June 2001); Klump H., et al., Gene Ther., 8(10):811-817 (May 2001). This 2A peptide is significantly smaller than an IRES, making it well suited for use when space is a limiting factor. However, the selected transgene may encode any biologically active product or other product, e.g., a product desirable for study.
Suitable transgenes may be readily selected by one of skill in the art. The selection of the transgene is not considered to be a limitation of this invention.
2. Regulatory Elements In addition to the major elements identified above for the minigene, the vector also includes conventional control elements necessary which are operably linked to the transgene in a manner that permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus produced by the invention. As used herein, "operably linked" sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest.
Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA
processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence);
sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and may be utilized.
Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV
enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the J3-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1a promoter [Invitrogen].
Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. For example, inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc.
Natl. Acad.
Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl.
Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)]. Other systems include the FK506 dimer, VP16 or p65 using castradiol, diphenol murislerone, the RU486-inducible system [Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 (1997)]. The effectiveness of some inducible promoters increases over time. In such cases one can enhance the effectiveness of such systems by inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES.
Alternatively, one can wait at least 3 days before screening for the desired function. Once can enhance expression of desired proteins by known means to enhance the effectiveness of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE).
In another embodiment, the native promoter for the transgene will be used. The native promoter may be preferred when it is desired that expression of the transgene should mimic the native expression. The native promoter may be used when expression of the transgene must be regulated temporally or developmentally, or in a tissue-specific manner, or in response to specific transcriptional stimuli. In a further embodiment, other native expression control elements, such as enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used to mimic the native expression.
Another embodiment of the transgene includes a transgene operably linked to a tissue-specific promoter. For instance, if expression in skeletal muscle is desired, a promoter active in muscle should be used. These include the promoters from genes encoding skeletal (3-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, as well as synthetic muscle promoters with activities higher than naturally occurring promoters (see Li et al., Nat. Biotech., 17:241-245 (1999)). Examples of promoters that are tissue-specific are known for liver (albumin, Miyatake et al., J. Virol., 71:5124-32 (1997);
hepatitis B virus core promoter, Sandig et al., Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al., Hum. Gene Ther., 7:1503-14 (1996)), bone osteocalcin (Stein et al., Mol.
Biol. Rep., 24:185-96 (1997)); bone sialoprotein (Chen et al., J. Bone Miner.
Res., 11:654-64 (1996)), lymphocytes (CD2, Hansal et al., J. Immunol., 161:1063-8 (1998);
immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen el al., Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene (Piccioli et al., Neuron, 15:373-84 (1995)), among others.
Optionally, vectors carrying transgenes encoding therapeutically useful or immunogenic products may also include selectable markers or reporter genes may include sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such selectable reporters or marker genes (preferably located outside the viral genome to be packaged into a viral particle) can be used to signal the presence of the plasmids in bacterial cells, such as ampicillin resistance. Other components of the vector may include an origin of replication. Selection of these and other promoters and vector elements are conventional and many such sequences are available [see, e.g., Sambrook et al.].
These vectors are generated using the techniques and sequences provided herein, in conjunction with techniques known to those of skill in the art.
Such techniques include conventional cloning techniques of cDNA such as those described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence.
III. Production of the Recombinant Viral Particle In one embodiment, the chimpanzee adenoviral plasmids (or other vectors) are used to produce recombinant adenoviral particles. In one embodiment, the recombinant adenoviruses are functionally deleted in the E 1 a or E 1 b genes, and optionally bearing other mutations, e.g., temperature-sensitive mutations or deletions in other genes. In other embodiments, it is desirable to retain an intact El a and/or El b region in the recombinant adenoviruses. Such an intact El region may be located in its native location in the adenoviral genome or placed in the site of a deletion in the native adenoviral genome (e.g., in the E3 region).
In the construction of useful chimpanzee adenovirus vectors for delivery of a gene to the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can be employed in the vectors. For example, all or a portion of the adenovirus delayed early gene E3 may be eliminated from the C68 adenovirus sequence which forms a part of the recombinant virus. The function of adenovirus E3 is believed to be irrelevant to the function and production of the recombinant virus particle. Adenovirus vectors may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and more desirably because of the redundancy in the function of this region, the entire E4 region. Still another vector of this invention contains a deletion in the delayed early gene E2a.
Deletions may also be made in any of the late genes L1 through L5 of the chimpanzee adenovirus genome.
Similarly, deletions in the intermediate genes IX and IVa2 may be useful for some purposes.
Other deletions may be made in the other structural or non-structural adenovirus genes. The above discussed deletions may be used individually, i.e., an adenovirus sequence for use in the present invention may contain deletions in only a single region.
Alternatively, deletions of entire genes or portions thereof effective to destroy their biological activity may be used in any combination. For example, in one exemplary vector, the adenovirus sequence may have deletions of the E 1 genes and the E4 gene, or of the E 1, E2a and E3 genes, or of the E 1 and E3 genes, or of E1, E2a and E4 genes, with or without deletion of E3, and so on. As discussed above, such deletions may be used in combination with other mutations, such as temperature-sensitive mutations, to achieve a desired result.
An adenoviral vector lacking any essential adenoviral sequences (e.g., El a, Elb, E2a, E2b, E4 ORF6, L1, L2, L3, L4 and L5) may be cultured in the presence of the missing adenoviral gene products which are required for viral infectivity and propagation of an adenoviral particle. These helper functions may be provided by culturing the adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or virus) or a packaging host cell. See, for example, the techniques described for preparation of a "minimal" human Ad vector in International Patent Application W096/13597, published May 9, 1996.
1. Helper Viruses Thus, depending upon the chimpanzee adenovirus gene content of the viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus fragment may be necessary to provide sufficient chimpanzee adenovirus gene sequences necessary to produce an infective recombinant viral particle containing the minigene. Useful helper viruses contain selected adenovirus gene sequences not present in the adenovirus vector construct and/or not expressed by the packaging cell line in which the vector is transfected. In one embodiment, the helper virus is replication-defective and contains a variety of adenovirus genes in addition to the sequences described above. Such a helper virus is desirably used in combination with an E l -expressing cell line.
Helper viruses may also be formed into poly-cation conjugates as described in Wu et a!, J. Biol. Chem., 264:16985-16987 (1989); K. J. Fisher and J. M.
Wilson, Biochem. J., 299:49 (April 1, 1994). Helper virus may optionally contain a second reporter minigene. A number of such reporter genes are known to the art. The presence of a reporter gene on the helper virus which is different from the transgene on the adenovirus vector allows both the Ad vector and the helper virus to be independently monitored. This second reporter is used to enable separation between the resulting recombinant virus and the helper virus upon purification.
2. Complementation Cell Lines To generate recombinant chimpanzee adenoviruses (Ad) deleted in any of the genes described above, the function of the deleted gene region, if essential to the replication and infectivity of the virus, must be supplied to the recombinant virus by a helper virus or cell line, i.e., a complementation or packaging cell line. In many circumstances, a cell line expressing the human E1 can be used to transcomplement the chimp Ad vector. This is particularly advantageous because, due to the diversity between the chimp Ad sequences of the invention and the human AdEI sequences found in currently available packaging cells, the use of the current human E 1-containing cells prevents the generation of replication-competent adenoviruses during the replication and production process. However, in certain circumstances, it will be desirable to utilize a cell line which expresses the E1 gene products can be utilized for production of an El-deleted chimpanzee adenovirus. Such cell lines have been described. See, e.g., US Patent 6,083,716.
If desired, one may utilize the sequences provided herein to generate a packaging cell or cell line that expresses, at a minimum, the adenovirus El gene under the transcriptional control of a promoter for expression in a selected parent cell line. Inducible or constitutive promoters may be employed for this purpose. Examples of such promoters are described in detail elsewhere in this specification. A parent cell is selected for the generation of a novel cell line expressing any desired Ad gene. Without limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL
185], KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among others.
These cell lines are all available from the American Type Culture Collection, University Boulevard, Manassas, Virginia 20110-2209. Other suitable parent cell lines may be obtained from other sources.
Such E 1-expressing cell lines are useful in the generation of recombinant chimpanzee adenovirus El deleted vectors. Additionally, or alternatively, the invention provides cell lines that express one or more chimpanzee adenoviral gene products, e.g., Ela, Elb, E2a, and/or E4 ORF6, can be constructed using essentially the same procedures for use in the generation of recombinant chimpanzee viral vectors.
Such cell lines can be utilized to transcomplement adenovirus vectors deleted in the essential genes that encode those products, or to provide helper functions necessary for packaging of a helper-dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to this invention involves techniques such as assembly of selected DNA sequences.
This assembly may be accomplished utilizing conventional techniques. Such techniques include cDNA and genomic cloning, which are well known and are described in Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and any other suitable methods which provide the desired nucleotide sequence.
In still another alternative, the essential adenoviral gene products are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a suitable host cell can be selected from any biological organism, including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and mammalian cells.
Particularly desirable host cells are selected from among any mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, IOTI/2, 293 cells (which express functional adenoviral El), Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection of the mammalian species providing the cells is not a limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc.
3. Assembly of Viral Particle and Transfection of a Cell Line Generally, when delivering the vector comprising the minigene by transfection, the vector is delivered in an amount from about 5 g to about 100 .tg DNA, and preferably about 10 to about 50 g DNA to about 1 x 104 cells to about 1 x 1013 cells, and preferably about 105 cells. However, the relative amounts of vector DNA to host cells may be adjusted, taking into consideration such factors as the selected vector, the delivery method and the host cells selected.
The vector may be any vector known in the art or disclosed above, including naked DNA, a plasmid, phage, transposon, cosmids, viruses, etc.
Introduction into the host cell of the vector may be achieved by any means known in the art or as disclosed above, including transfection, and infection. One or more of the adenoviral genes may be stably integrated into the genome of the host cell, stably expressed as episomes, or expressed transiently. The gene products may all be expressed transiently, on an episome or stably integrated, or some of the gene products may be expressed stably while others are expressed transiently. Furthermore, the promoters for each of the adenoviral genes may be selected independently from a constitutive promoter, an inducible promoter or a native adenoviral promoter. The promoters may be regulated by a specific physiological state of the organism or cell (i.e., by the differentiation state or in replicating or quiescent cells) or by exogenously-added factors, for example.
Introduction of the molecules (as plasmids or viruses) into the host cell may also be accomplished using techniques known to the skilled artisan and as discussed throughout the specification. In preferred embodiment, standard transfection techniques are used, e.g., CaPO4 transfection or electroporation.
Assembly of the selected DNA sequences of the adenovirus (as well as the transgene and other vector elements into various intermediate plasmids, and the use of the plasmids and vectors to produce a recombinant viral particle are all achieved using conventional techniques. Such techniques include conventional cloning techniques of cDNA
such as those described in texts [Sambrook et al, cited above], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence. Standard transfection and co-transfection techniques are employed, e.g., CaPO4 precipitation techniques.
Other conventional methods employed include homologous recombination of the viral genomes, plaquing of viruses in agar overlay, methods of measuring signal generation, and the like.
For example, following the construction and assembly of the desired minigene-containing viral vector, the vector is transfected in vitro in the presence of a helper virus into the packaging cell line. Homologous recombination occurs between the helper and the vector sequences, which permits the adenovirus-transgene sequences in the vector to be replicated and packaged into virion capsids, resulting in the recombinant viral vector particles. The current method for producing such virus particles is transfection-based.
However, the invention is not limited to such methods.
The resulting recombinant chimpanzee adenoviruses are useful in transferring a selected transgene to a selected cell. In in vivo experiments with the recombinant virus grown in the packaging cell lines, the E1-deleted recombinant chimpanzee adenoviral vectors of the invention demonstrate utility in transferring a transgene to a non-chimpanzee, preferably a human, cell.
IV. Use of Non-Viral C68 Proteins and C68-derived Adenoviruses The recombinant adenovirus vectors of the invention are useful for gene transfer to a human or non-chimpanzee veterinary patient in vitro, ex vivo, and in vivo. In addition, a variety of C68 proteins described herein are useful in non-viral targeting of transgenes, proteins, chemical molecules, and other moieties or molecules to cells.
Suitable methods of delivery and dosing regimens are readily determined based upon the targeted molecule and targeting protein. Examples of suitable genes and sources of proteins for protein-mediated delivery are provided in the sections below relating to viral delivery of therapeutic and immunogenic molecules. While the discussion below focuses on viral vectors, it will be appreciated that the C68-derived proteins of the invention may be formulated as described herein for the C68-derived viral vectors and the same routes of administration and regimens may be utilized.
The recombinant adenovirus vectors described herein can be used as expression vectors for the production of the products encoded by the heterologous genes in vitro. For example, the recombinant adenoviruses containing a gene inserted into the location of an El deletion may be transfected into an E1-expressing cell line as described above. Alternatively, replication-competent adenoviruses may be used in another selected cell line.
The transfected cells are then cultured in the conventional manner, allowing the recombinant adenovirus to express the gene product from the promoter. The gene product may then be recovered from the culture medium by known conventional methods of protein isolation and recovery from culture.
A C68-derived vector or C68-derived protein of the invention provides an efficient gene transfer vehicle that can deliver a selected transgene or other molecule to a selected host cell in vivo or ex vivo even where the organism has neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV and the cells are mixed ex vivo;
the infected cells are cultured using conventional methodologies; and the transduced cells are re-infused into the patient. These compositions are particularly well suited to gene delivery for therapeutic purposes and for immunization, including inducing protective immunity.
More commonly, the C68-derived vectors and C68-derived proteins of the invention will be utilized for delivery of therapeutic or immunogenic molecules, as described below. It will be readily understood for both applications, that the C68-derived constructs of the invention are useful for use in regimens involving single administrations, as well as in .regimens involving repeat delivery of adenoviral vectors or non-viral targeted delivery, or repeat delivery of the transgene or other molecule to the cells.
Such regimens typically involve delivery of a series of viral vectors in which the viral capsids are alternated. The viral capsids may be changed for each subsequent administration, or after a pre-selected number of administrations of a particular serotype capsid (e.g., one, two, three, four or more). For example, a regimen may involve delivery of a rAd with a C68-derived capsid and delivery with a rAd with another human or non-human primate adenovirus serotype. Optionally, these regimens may involve administration of rAd with capsids of other non-human primate adenoviruses, human adenoviruses, or artificial serotypes such as are described herein. Alternativley, the regimens involve administration of C68-derived proteins for non-viral targeting with repeat administrations of C68-derived proteins, or with other protein-based delivery systems. Each phase of these regimens can involve administration of a series of injections (or other delivery routes) with a single C68-derived construct followed by a series with another Ad serotype construct.
Alternatively, the C68-derived vectors and proteins of the invention may be utilized in regimens involving other non-adenoviral-mediated delivery systems, including other viral systems, non-viral delivery systems, protein, peptides, and other biologically active molecules.
The following sections will focus on exemplary molecules which may be delivered via the adenoviral vectors of the invention.
A. Ad-Mediated Delivery of Therapeutic Molecules In one embodiment, the above-described C68-derived constructs are administered to humans according to published methods for gene therapy. A C68-derived construct bearing a transgene can be administered to a patient, preferably suspended in a biologically compatible solution or pharmaceutically acceptable delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions known to be pharmaceutically acceptable carriers and well known to those of skill in the art may be employed for this purpose.
The C68-derived adenoviral vectors are administered in sufficient amounts to transduce the target cells and to provide sufficient levels of gene transfer and expression to provide a therapeutic benefit without undue adverse or with medically acceptable physiological effects, which can be determined by those skilled in the medical arts.
Conventional and pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to the retina and other intraocular delivery methods, direct delivery to the liver, intranasal, intravenous, intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. Routes of administration may be combined, if desired, or adjusted depending upon the transgene or the condition. The route of administration primarily will depend on the nature of the condition being treated.
Dosages of the viral vector will depend primarily on factors such as the condition being treated, the age, weight and health of the patient, and may thus vary among patients. For example, a therapeutically effective adult human or veterinary dosage of the viral vector is generally in the range of from about 100 L to about 100 mL of a carrier containing concentrations of from about 1 x 106 to about 1 x 1015 particles, about 1 x 1011 to 1 x 1013 particles, or about 1 x 109 to IX 1012 particles virus. Dosages will range depending upon the size of the animal and the route of administration. For example, a suitable human or veterinary dosage (for about an 80 kg animal) for intramuscular injection is in the range of about 1 x 109 to about 5 x 1012 particles per mL, for a single site.
Optionally, multiple sites of administration may be delivered. In another example, a suitable human or veterinary dosage may be in the range of about 1 x 1011 to about 1 x 1015 particles for an oral formulation.
When C68 proteins of the invention are utilized for targeted delivery, suitable dosage ranges, a therapeutically effective adult human or veterinary dosage of the construct is generally in the range of from about 100 L to about 100 mL of a carrier containing concentrations of from about 0.01 g to about 100 mg protein, about 0.1 g to about 10 mg, about I pg to about I mg protein. Dosages will range depending upon the size of the animal and the route of administration. Routes of administration may be readily selected from any suitable route including, without limitation, the routes described above.
One of skill in the art may adjust these doses, depending the route of administration, and the therapeutic or vaccinal application for which the C68-derived construct is employed. The levels of expression of the transgene, or for an immunogen, the level of circulating antibody, can be monitored to determine the frequency of dosage administration. Yet other methods for determining the timing of frequency of administration will be readily apparent to one of skill in the art.
An optional method step involves the co-administration to the patient, either concurrently with, or before or after administration of the C68-derived construct, of a suitable amount of a short acting immune modulator. The selected immune modulator is defined herein as an agent capable of inhibiting the formation of neutralizing antibodies directed against the recombinant vector of this invention or capable of inhibiting cytolytic T
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with the interactions between the T helper subsets (TH} or TH2) and B cells to inhibit neutralizing antibody formation. Alternatively, the immune modulator may inhibit the interaction between THI cells and CTLs to reduce the occurrence of CTL elimination of the vector. A
variety of useful immune modulators and dosages for use of same are disclosed, for example, in Yang et al., J. Virol., 70(9) (Sept., 1996); International Patent Application No.
W096/12406, published May 2, 1996; and International Patent Application No. PCT/US96/03035.
1. Therapeutic Transgenes Useful therapeutic products encoded by the transgene include hormones and growth and differentiation factors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor (TGF), platelet-derived growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of the transforming growth factor superfamily, including TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase.
Other useful transgene products include proteins that regulate the immune system including, without limitation, cytokines and lymphokines such as thrombopoietin (TPO), interleukins (IL) IL-1 through IL-18, monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system are also useful in the invention. These include, without limitations, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors, chimeric T
cell receptors, single chain T cell receptors, class I and class II MHC
molecules, as well as engineered immunoglobulins and MHC molecules. Useful gene products also include complement regulatory proteins such as complement regulatory proteins, membrane cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59.
Still other useful gene products include any one of the receptors for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune system proteins. The invention encompasses receptors for cholesterol regulation, including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also encompasses gene products such as members of the steroid hormone receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other nuclear receptors. In addition, useful gene products include transcription factors such as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP 1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix proteins.
Other useful gene products include, carbamoyl synthetase I, ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucose-6-phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta-synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence.
Other useful gene products include non-naturally occurring polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid sequence containing insertions, deletions or amino acid substitutions.
For example, single-chain engineered immunoglobulins could be useful in certain immunocompromised patients. Other types of non-naturally occurring gene sequences include antisense molecules and catalytic nucleic acids, such as ribozymes, which could be used to reduce overexpression of a target.
Reduction and/or modulation of expression of a gene are particularly desirable for treatment of hyperproliferative conditions characterized by hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those polypeptides which are produced exclusively or at higher levels in hyperproliferative cells as compared to normal cells. Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene products as target antigens, target polypeptides for anti-cancer treatments and protective regimens include variable regions of antibodies made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used as target antigens for autoimmune disease. Other tumor-associated polypeptides can be used as target polypeptides such as polypeptides which are found at higher levels in tumor cells including the polypeptide recognized by monoclonal antibody 17-1A and folate binding polypeptides. Such target polypeptides and their ligands are also useful in forming fusion partners with a C68 protein of the invention.
Other suitable therapeutic polypeptides and proteins include those which may be useful for treating individuals suffering from autoimmune diseases and disorders by conferring a broad based protective immune response against targets that are associated with autoimmunity including cell receptors and cells which produce self-directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis (RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is characterized by T
cell receptors (TCRs) that bind to endogenous antigens and initiate the inflammatory cascade associated with autoimmune diseases.
The C68-derived constructs of the invention are particularly well suited for therapeutic regimens in which multiple deliveries of transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in combination regimens involving delivery of other transgenes. Such regimens may involve administration of a C68-derived construct, followed by re-administration with a vector from the same serotype adenovirus.
Particularly desirable regimens involve administration of a C68-derived construct of the invention, in which the serotype of the viral vector delivered in the first administration differs from the serotype of the viral vector utilized in one or more of the subsequent administrations. For example, a therapeutic regimen involves administration of a C68-derived vector and repeat administration with one or more adenoviral vectors of the same or different serotypes. In another example, a therapeutic regimen involves administration of an adenoviral vector followed by repeat administration with a C68-derived vector of the invention which differs from the serotype of the first delivered adenoviral vector, and optionally further administration with another vector which is the same or, preferably, differs from the serotype of the vector in the prior administration steps. These regimens are not limited to delivery of adenoviral vectors constructed using the C68-derived capsids of the invention. Rather, these regimens can readily utilize constructs, including non-viral targeting proteins and viral vectors, from other adenoviral serotypes, including, without limitation, other chimpanzee adenoviral serotypes (e.g., Cl, etc), other non-human primate adenoviral serotypes, or human adenoviral serotypes, in combination with one or more of the C68-derived constructs of the invention. Examples of such chimpanzee, other non-human primate and human adenoviral serotypes are discussed elsewhere in this document.
Further, these therapeutic regimens may involve either simultaneous or sequential delivery of C68-derived constructs of the invention in combination with non-adenoviral vectors, non-viral vectors, and/or a variety of other therapeutically useful compounds or molecules. The present invention is not limited to these therapeutic regimens, a variety of which will be readily apparent to one of skill in the art.
B. Ad-Mediated Delivery of Immunogenic Transgenes The C68-derived constructs of the invention, including viral vectors and proteins, may also be employed as immunogenic compositions. As used herein, an immunogenic composition is a composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) response is mounted to a transgene product delivered by the immunogenic composition following delivery to a mammal, and preferably a primate. The present invention provides a recombinant C68-derived Ad that can contain in any of its adenovirus sequence deletions a gene encoding a desired immunogen, or a C68 protein capable of targeting an immunogenic molecule. The C68-derived adenovirus is well suited for use as a live recombinant virus vaccine in different animal species compared to an adenovirus of human origin, but is not limited to such a use. The recombinant adenoviruses and C68 proteins can be used as prophylactic or therapeutic vaccines against any pathogen for which the antigen(s) crucial for induction of an immune response and able to limit the spread of the pathogen has been identified and for which the cDNA is available.
Such vaccinal (or other immunogenic) compositions are formulated in a suitable delivery vehicle, as described above. Generally, doses for the immunogenic compositions are in the range defined above for therapeutic compositions. The levels of immunity of the selected gene can be monitored to determine the need, if any, for boosters.
Following an assessment of antibody titers in the serum, optional booster immunizations may be desired.
Optionally, a vaccinal composition of the invention may be formulated to contain other components, including, e.g. adjuvants, stabilizers, pH
adjusters, preservatives and the like. Such components are well known to those of skill in the vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, monophosphoryl lipid A, and any biologically active factor, such as cytokine, an interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of these biologically active factors can be expressed in vivo, e.g., via a polynucleotide, plasmid or viral vector. For example, such an adjuvant can be administered with a priming DNA vaccine encoding an antigen to enhance the antigen-specific immune response compared with the immune response generated upon priming with a DNA vaccine encoding the antigen only.
The recombinant adenoviruses are administered in a "an immunogenic amount", that is, an amount of recombinant adenovirus that is effective in a route of administration to transfect the desired cells and provide sufficient levels of expression of the selected gene to induce an immune response. Where protective immunity is provided, the recombinant adenoviruses are considered to be vaccine compositions useful in preventing infection and/or recurrent disease.
Alternatively, or in addition, the vectors of the invention may contain, or capsid or other protein can be utilized to target a transgene encoding a peptide, polypeptide or protein which induces an immune response to a selected immunogen. The C68-derived viruses of this invention are expected to be highly efficacious at inducing cytolytic T cells and antibodies to the inserted heterologous antigenic protein expressed by the vector.
1. Immunogenic Transgenes For example, immunogens may be selected from a variety of viral families. Example of desirable viral families against which an immune response would be desirable include, the picornavirus family, which includes the genera rhinoviruses, which are responsible for about 50% of cases of the common cold; the genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are responsible for foot and mouth diseases, primarily in non-human animals. Within the picornavirus family of viruses, target antigens include the VP1, VP2, VP3, VP4, and VPG. Another viral family includes the calcivirus family, which encompasses the Norwalk group of viruses, which are an important causative agent of epidemic gastroenteritis. Still another viral family desirable for use in targeting antigens for inducing immune responses in humans and non-human animals is the togavirus family, which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis viruses.
Other target antigens may be generated from the Hepatitis C or the coronavirus family, which includes a number of non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus (dog), and human respiratory coronaviruses, which may cause the common cold and/or non-A, B or C hepatitis. Within the coronavirus family, target antigens include the El (also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N
(nucleocapsid).
Still other antigens may be targeted against the rhabdovirus family, which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies).
Within the rhabdovirus family, suitable antigens may be derived from the G
protein or the N
protein. The family filoviridae, which includes hemorrhagic fever viruses such as Marburg and Ebola virus may be a suitable source of antigens. The paramyxovirus family includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes measles and canine distemper, and pneumovirus, which includes respiratory syncytial virus.
The influenza virus is classified within the family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the N 1 protein). The bunyavirus family includes the genera bunyavirus (California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The arenavirus family provides a source of antigens against LCM
and Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine encephalosis, blue tongue).
The retrovirus family includes the sub-family oncorivirinal which encompasses such human and veterinary diseases as feline leukemia virus, HTLVI
and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have been described and can readily be selected. Examples of suitable HIV and SIV
antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, as well as various fragments thereof. For example, suitable fragments of the Env protein may include any of its subunits such as the gp120, gp160, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the tat protein may be selected.
[See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, the HIV and SIV
proteins described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 (March 2001), and R.R. Amara, et al, Science, 292:69-74 (6 April 2001). In another example, the HIV and/or SIV
immunogenic proteins or peptides may be used to form fusion proteins or other immunogenic molecules.
See, e.g., the HIV-1 Tat and/or Nef fusion proteins and immunization regimens described in WO 01/54719, published August 2, 2001, and WO 99/16884, published April 8, 1999. The invention is not limited to the HIV and/or SIV immunogenic proteins or peptides described herein. In addition, a variety of modifications to these proteins have been described or could 3o readily be made by one of skill in the art. See, e.g., the modified gag protein that is described in US Patent 5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or in combination. Such combinations may include expression from a single vector or from multiple vectors. Optionally, another combination may involve delivery of one or more expressed immunogens with delivery of one or more of the immunogens in protein form.
Such combinations are discussed in more detail below.
The papovavirus family includes the sub-family polyomaviruses (BKU
and JCU viruses) and the sub-family papillomavirus (associated with cancers or malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the genera simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which may be suitable source of antigens is the Hepatitis delta virus.
Still other viral sources may include avian infectious bursal disease virus and porcine respiratory and reproductive syndrome virus. The alphavirus family includes equine arteritis virus and various Encephalitis viruses.
The present invention may also encompass immunogens which are useful to immunize a human or non-human animal against other pathogens including bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci.
Pathogenic gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H.
ducreyi (which causes chancroid); brucella; Franisella tularensis (which causes tularemia);
yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria (diphtheria);
cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis.
Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism;
other clostridia;
tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis.
Other infections caused by higher pathogen bacteria and pathogenic fungi include actinomycosis;
nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis;
candidiasis, aspergillosis, and mucormycosis; sporotrichosis;
paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox.
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae;
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections.
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; toxoplasmosis;
Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; giardiasis; trichinosis;
filariasis;
schistosomiasis; nematodes; trematodes or flukes; and cestode (tapeworm) infections.
Many of these organisms and/or toxins produced thereby have been identified by the Centers for Disease Control [(CDC), Department of Heath and Human Services, USA], as agents which have potential for use in biological attacks.
For example, some of these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum and its toxin (botulism), Yersiniapestis (plague), variola major (smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fever, all of which are currently classified as Category A agents; Coxiella burnetti (Q fever); Brucella species (brucellosis), Burkholderia mallei (glanders), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and its toxin (epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), all of which are currently classified as Category B agents; and Nipan virus, multidrug-resistant tuberculosis, yellow fever, tickborne hemorrhagic fever viruses, tickborne encephalitis viruses, and hantaviruses, which are currently classified as Category C
agents. In addition, other organisms, which are so classified or differently classified, may be identified and/or used for such a purpose in the future. It will be readily understood that the viral vectors and other constructs described herein are useful to deliver antigens from these organisms, viruses, their toxins or other by-products, which will prevent and/or treat infection or other adverse reactions with these biological agents.
Administration of the vectors and proteins of the invention to deliver immunogens against the variable region of the T cells elicit an immune response including CTLs to eliminate those T cells. In RA, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in RA. In MS, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-7 and Va-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in MS. In scleroderma, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a recombinant chimpanzee adenovirus that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in scleroderma.
C. Ad-Mediated Delivery Methods The therapeutic levels, or levels of immunity, of the selected gene can be monitored to determine the need, if any, for boosters. Following an assessment of CD8+ T
cell response, or optionally, antibody titers, in the serum, optional booster immunizations may be desired. Optionally, the C68-derived constructs of the invention may be delivered in a single administration or in various combination regimens, e.g., in combination with a regimen or course of treatment involving other active ingredients or in a prime-boost regimen. A
variety of such regimens have been described in the art and may be readily selected.
For example, prime-boost regimens may involve the administration of a DNA
(e.g., plasmid) based vector to prime the immune system to second, booster, administration with a traditional antigen, such as a protein or a recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 00/11140, published March 2, 2000.
Alternatively, an immunization regimen may involve the administration of a recombinant chimpanzee adenoviral vector of the invention to boost the immune response to a vector (either viral or DNA-based) carrying an antigen, or a protein. In still another alternative, an immunization regimen involves administration of a protein followed by booster with a vector encoding the antigen.
In one embodiment, the invention provides a method of priming and boosting an immune response to a selected antigen by delivering a plasmid DNA vector carrying said antigen, followed by boosting with a recombinant chimpanzee adenoviral vector of the invention. In one embodiment, the prime-boost regimen involves the expression of multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of protein subunits useful for generating an immune response against HIV and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-l Env is delivered in a recombinant adenovirus construct of the invention. Still other regimens are described in and WO 01/54719.
However, the prime-boost regimens are not limited to immunization for HIV
or to delivery of these antigens. For example, priming may involve delivering with a first chimp vector of the invention followed by boosting with a second chimp vector, or with a composition containing the antigen itself in protein form. In one example, the prime-boost regimen can provide a protective immune response to the virus, bacteria or other organism from which the antigen is derived. In another desired embodiment, the prime-boost regimen provides a therapeutic effect that can be measured using convention assays for detection of the presence of the condition for which therapy is being administered.
The priming composition may be administered at various sites in the body in a dose dependent manner, which depends on the antigen to which the desired immune response is being targeted. The invention is not limited to the amount or situs of injection(s) or to the pharmaceutical carrier. Rather, the regimen may involve a priming and/or boosting step, each of which may include a single dose or dosage that is administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals may receive one or two doses containing between about 10 g to about 50 g of plasmid in carrier. A desirable amount of a DNA
composition ranges between about 1 .tg to about 10,000 .tg of the DNA vector.
Dosages may vary from about 1 g to 1000 g DNA per kg of subject body weight. The amount or site of delivery is desirably selected based upon the identity and condition of the mammal.
The dosage unit of the vector suitable for delivery of the antigen to the mammal is described herein. The vector is prepared for administration by being suspended or dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic saline;
isotonic salts solution or other formulations that will be apparent to those skilled in such administration. The appropriate carrier will be evident to those skilled in the art and will depend in large part upon the route of administration. The compositions of the invention may be administered to a mammal according to the routes described above, in a sustained release formulation using a biodegradable biocompatible polymer, or by on-site delivery using micelles, gels and liposomes. Optionally, the priming step of this invention also includes administering with the priming composition, a suitable amount of an adjuvant, such as are defined herein.
Preferably, a boosting composition is administered about 2 to about 27 weeks after administering the priming composition to the mammalian subject. The administration of the boosting composition is accomplished using an effective amount of a boosting composition containing or capable of delivering the same antigen as administered by the priming DNA vaccine. The boosting composition may be composed of a recombinant viral vector derived from the same viral source (e.g., adenoviral sequences of the invention) or from another source. Alternatively, the "boosting composition" can be a composition containing the same antigen as encoded in the priming DNA vaccine, but in the form of a protein or peptide, which composition induces an immune response in the host.
In another embodiment, the boosting composition contains a DNA sequence encoding the antigen under the control of a regulatory sequence directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or viral vectors. The primary requirements of the boosting composition are that the antigen of the composition is the same antigen, or a cross-reactive antigen, as that encoded by the priming composition.
In another embodiment, the chimpanzee adenoviral vectors and C68 targeting proteins of the invention are also well suited for use in a variety of other immunization and therapeutic regimens. Such regimens may involve delivery of C68 constructs of the invention simultaneously or sequentially with Ad constructs of different serotype capsids, regimens in which C68-derived constructs of the invention are delivered simultaneously or sequentially with non-Ad vectors, regimens in which the adenoviral vectors of the invention are delivered simultaneously or sequentially with proteins, peptides, and/or other biologically useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one of skill in the art.
V. Method for Rapid Screening of Bacterial Transformants An elegant selection method is provided by the present invention, which permits the rapid screening of constructs produced by homologous recombination or direct cloning methods. As used herein, these constructs are preferably viruses, but may include other types of vectors, such as a cosmid, episome, plasmid, or other genetic element that delivers a heterologous molecule to cells.
In one desired embodiment, the method utilizes the gene encoding green fluorescent protein (GFP), to provide a green-white selection method in which the presence of a recombinant is detected by the absence of GFP expression (i.e., the recombinants are observed as white in a green background). Alternatively, the method may utilize another suitable marker genes, including, without limitation, other fluorescent proteins and luciferase.
In one example, the method is used for production of a recombinant construct from homologous recombination of co-transected vectors into a selected host cell.
As used herein, a host cell may be readily selected from an biological organism, including prokaryotic and eukaryotic cells, such as those discussed in the section related to production of a recombinant viral particle. Selection of the host cell is not a limitation of the present invention.
Suitably, each of the vectors contains the marker gene (e.g., GFP) under the control of a promoter that directs expression thereof in a host cell. Alternatively, each of the parental vectors may contain a different marker gene that allows them to be distinguished not only from the recombinant construct produced, but also from each other. Preferably, where prokaryotic GFP is utilized, it is under the control of a prokaryotic promoter such as the promoter from lacZ. However, other suitable prokaryotic or non-prokaryotic promoters may be readily selected from among the promoters described herein and known to those of skill in the art. Advantageously, the GFP protein is placed in the portion of the vectors that are eliminated during homologous recombination and thus, the GFP protein is absent from the recombinant vector produced. In this manner, the presence of unrecombined parental vectors are readily detected under a phase contrast fluorescent microscope (or other suitable detection means) as expressing the marker gene and the recombinant constructs lack expression of the marker. In the methods in which both parent vectors utilize GFP, the recombinant appears as white in a background of green.
In another example, the method is used for production of a recombinant construct involving homologous recombination, in which the host cell stably contains at least one of the parental constructs to be utilized for production of the recombinant construct. In this embodiment, the host cell can be subjected to a single transfection. In still other embodiments, the method of the invention may be utilized for triple transfections. As with the double transfection described above, the parental constructs may contain the same marker gene or may contain different marker genes.
In another example, the method of the invention is used from production of a recombinant construct by direct cloning. Suitably, in this embodiment, the marker gene is present is that portion of the parent construct which is deleted during the cloning process. For example, the marker gene expression cassette (i.e., the gene, promoter, and any other necessary regulatory sequences) is engineered into the E1- or E3-region of an adenoviral vector, into which a transgene or minigene cassette will be cloned. The success of direct cloning into the target region can be readily detected by the absence of marker gene expression.
Optionally, the method of the invention can be readily assembled in the form of a kit which is available in a commercially useful format for production of recombinant constructs, 3o e.g., recombinant adenoviruses. Typically, such kits will include plasmid backbones containing a desired viral genome containing a marker gene inserted at a point upstream or downstream of the recombination site, as appropriate, or a plasmid backbone containing the marker gene inserted at the splice site for direct cloning of a heterologous gene. Such a kit can further include appropriate culture media, host cells, a test control, instructions, and other suitable materials.
In the examples below, this method is used in production of adenoviruses.
However, it will be readily understood that this method may be readily adapted for use in generating other types of adenoviral, or non-adenoviral viral vectors.
The following examples are provided to illustrate the invention and do not limit the scope thereof One skilled in the art will appreciate that although specific reagents and conditions are outlined in the following examples, modifications can be made which are meant to be encompassed by the spirit and scope of the invention.
Example 1 - Creation of an El deleted vector based on Chimpanzee Adenovirus C68 Using Green-white Selection Of Recombinants A replication defective version of C68 was isolated for use in gene transfer.
The classic strategy of creating a recombinant with El deleted, by homologous recombination in an E 1 expressing cell line was pursued. The first step was creation of a plasmid containing m.u. 0 through 1.3 followed by addition of a minigene expressing enhanced green fluorescent protein (GFP) from a CMV promoter and C68 sequence spanning 9-16.7 m.u. This linearized plasmid was cotransfected into an E 1 expressing cell line with Ssp I-digested C68 plasmid (SspI cuts at 3.6 m.u. leaving 4644 bp for homologous recombination).
Experiments were initially conducted with 293 cells which harbor E 1 from human Ad5 with the hope that this would suffice for transcomplementation. Indeed, plaques formed which represented the desired recombinant. The resulting vector was called C68-CMV-GFP.
The strategy for generating recombinants was modified to enable efficient and rapid isolation of recombinants. First, the alkaline phosphatase DNA in the initial shuttle vector was replaced with a prokaryotic GFP gene driven by the prokaryotic promoter from lacZ.
This allowed efficient screening of bacterial transformations when attempting to incorporate a desired eukaryotic RNA po1 II transcriptional unit into the shuttle vector.
The resulting transformation can be screened for expression of GFP; white colonies are recombinants while green colonies are residual parental plasmid.
A green-white selection has been used to screen the products of cotransfection for the isolation of human Ad5 recombinants (A.R. Davis et al, Gene Thera., 5:1148-1152 (1998)).
In the present system, and in contrast to Davis, the initial shuttle vector was revised to include extended 3' sequences from 9 to 26 MU. This vector was cotransfected with viral DNA from the original C68-CMV-GFP isolate that had been restricted with Xba I, which cuts at MU
16.5 allowing for 9.5 Kb of overlap for homologous recombination. The resulting plaques were screened under a phase contrast fluorescent microscope for non-fluorescing isolates that represent the desired recombinants. This greatly simplified screening in comparison to the standard methods based on structure or transgene expression. Thus, this method may be readily adapted for use in generating other types of adenoviral, or non-adenoviral viral vectors.
A. Shuttle Plasmid To construct a plasmid shuttle vector for creation of recombinant C68 virus, the plasmid pSP72 (Promega, Madison, WI) was modified by digestion with Bgl II
followed by filling-in of the ends with Klenow enzyme (Boehringer Mannheim, Indianapolis, IN) and ligation with a synthetic 12 bp Pac I linker (New England Biolabs, Beverly, MA) to yield pSP72-Pac. A 456 bp Pac I/SnaB I fragment spanning map unit (m.u. or MU) 0-1.3 of the C68 genome was isolated from the pNEB-BamE plasmid containing BamHI E fragment of the C68 genome and cloned into Pac I and EcoR V treated pSP72-Pac to yield pSP-0-1.3. A minigene cassette consisting of the cytomegalovirus early promoter driving lacZ with a SV40 poly A signal was separated from pCMV(3 (Clontech, Palo Alto, CA) as a 4.5 kb EcoRI/SaII fragment and ligated to pSP-C68-MU 0-1.3 restricted with the same set of enzymes, resulting in pSP-C68-MU 0-1.3-CMVLacZ.
For the initial step in the isolation of the 9-16.7 MU region of C68, both pGEM-3Z (Promega, Madison, MI) and pBS-C68-BamF were double-digested with BamHI
and Sph I enzymes. Then the 293 bp fragment from pBS-C68-BamF was ligated with pGEM-3Z backbone to form pGEM-C68-MU 9-9.8. A 2.4 kb fragment including the C68 MU
9.8-16.7 was obtained from the pBS-C68 BamHB clone after Xbal digestion, filling in reaction and subsequent BamHI treatment and cloned into BamHI/Smal double digested pGEM-MU 9-9.8 to generate pGEM-C68-MU 9-16.7. The C68 9-16.7 m.u. region was isolated from pGEM-C68-MU 9-16.7 by digestion with EcoRl, filling in of the ends with Klenow enzyme (Boehringer Mannheim, Indianapolis, IN), ligation of a synthetic 12 bp HindI1l linker (NEB) and then digestion with HindIII. This 2.7 kb fragment spanning the C68 MU 9-16.7 was cloned into the HindIII site of pSP-C68-MU 0- 1.3-CMVIacZ to form the final shuttle plasmid pC68-CMV-LacZ. In addition, an 820 bp alkaline phosphatase (AP) cDNA fragment was isolated from pAdCMVALP (K. J. Fisher, et al., J. Virol., 70:520-532 (1996)) and exchanged for lacZ at Not I sites of pC68-CMV-lacZ, resulting in pC68-CMV-AP.
B. Construction of Recombinant Virus To create the E1-deleted recombinant C68-CMVEGFP vector, a pC68-CMV-EGFP shuttle plasmid was first constructed by replacing the lacZ transgene in pC68-CMV-lacZ with the enhanced green fluorescent protein (EGFP) gene. The replacement cloning process was carried out as the follows. An additional Notl restriction site was introduced into the 5' end of the EGFP coding sequence in the pEGFP-1 (Clontech, Palo Alto, CA) by BamHI digestion, filling in reaction and ligation of a 8 bp synthetic NotI
linker (NEB). After NotI restriction of both constructs, the EGFP sequence was isolated from the modified pEGFP-l and used to replace the lacZ gene in the pC68-CMV-LacZ. The pC68-CMVEGFP
construct (3 pg) was co-transfected with Ssp I-digested C68 genomic DNA (1 jig) into 293 cells for homologous recombination as previously described (G. Gao, et al, J.
Virol, 70:8934-8943 (1996)). Green plaques visualized by fluorescent microscopy were isolated for 2 rounds of plaque purification, expansion and purification by CsCI gradient sedimentation (G. Gao, et al, cited above).
The invention provides a uniquely modified version of the green/white selection process (A. R. Davis, et al., Gene Thera., 5:1148-1152 (1998)). The present example illustrates use of this method for construction of recombinant C68 vectors. A 7.2 kb fragment spanning 9 to 36 MU was isolated from the pBSC68-BamB plasmid by treatment with Agel and Bsiwl restriction endonucleases and cloned into Asp718 and Agel sites of pC68-CMV-AP shuttle plasmid, resulting in a new plasmid called pC68CMV-AP-MU36. A
further modification was made to remove 26 to 36 m.u. from pC68CMV-AP-MU36 by Eco47111 and Nrul digestions. The new shuttle plasmid called pC68CMV-AP-MU26 has a shorter region for homologous recombination (i.e., 16.7-26 MU) 3' to the minigene. To make a recombinant C68 vector, alkaline phosphatase (AP) is replaced with the gene of interest. The resulting pC68CMV-Nugene-MU26 construct is co-transfected with Xba I (16.5 MU) restricted C68-CMVGFP viral DNA into 293 cells, followed by top agar overlay. The recombinant virus plaques (white) are generated through the homologous recombination in the region of 16.7-26 MU which is shared between pC68CMV-Nugene construct and viral backbone; the recombinants which form white plaques are selected from green plaques of uncut C68-CMVGFP virus.
The green/white selection mechanism was also introduced to the process of cloning of the gene of interest into the pC68 shuttle plasmid. The AP gene in both pC68CMV-AP-MU36 and pC68CMV-AP-MU26 was replaced with a cassette of prokaryotic GFP gene driven by the lacZ promoter isolated from pGFPMU31 (Clontech, Palo Alto, CA).
Thus, white colonies of bacterial transformants will contain the recombinant plasmid. This green/white selection process for bacterial colonies circumvented the need for making and characterizing large numbers of minipreped DNAs and so further enhanced the efficiency in creating recombinant C68 vectors.
Example 2 - Chimpanzee C68 Virus Stock and Replication Examples 3 through 5 which follow provide additional characterization of the chimpanzee C68. It will be appreciated by one of skill in the art that this information can be readily used in the construction of novel recombinant chimpanzee adenoviral constructs.
The C68 virus stock was obtained from ATCC (Rockville, MD) and propagated in 293 cells (ATCC) cultured in DMEM (Sigma, St. Louis, MO) supplemented with 10%
fetal calf serum (FCS; Sigma or Hyclone, Logan, UT) and 1% Penicillin-Streptomycin (Sigma).
Infection of 293 cells was carried out in DMEM supplemented with 2% FCS for the first 24 hours, after which FCS was added to bring the final concentration to 10%.
Infected cells were harvested when 100% of the cells exhibited virus-induced cytopathic effect (CPE), collected, and concentrated by centrifugation. Cell pellets were resuspended in 10mM Tris (pH 8.0), and lysed by 3 cycles of freezing and thawing. Virus preparations were obtained following 2 ultra centrifuge steps on cesium chloride density gradients and stocks of virus were diluted to 1 x 1012 particles/ml in 10mM Tris/I OOmM NaC1/50% glycerol and stored at -70 C.
Example 3 - Cloning and sequencing of viral genomic DNA
Genomic DNA was isolated from the purified virus preparation following standard methods and digested with a panel of 16 restriction enzymes following the manufacturer's recommendations. Except as noted, all restriction and modifying enzymes were obtained from Boehringer Mannheim, Indianapolis, IN. Genomic DNA was digested with BamHI, PstI, Sall, HindIII or XbaI and the fragments were subcloned into plasmids (K.
L. Berkner and P.A. Sharp, Nuci. Acids Res., 11:6003-20 (1983)). After deproteination, synthetic 10bp PacI linkers (New England Biolabs, Beverly, MA) were double digested with PacI
and BamHI, or Pstl.
The PstI, BamHI and HindIII clones generated from C68 are illustrated in Figure 1, parts C, D and E, respectively. The fragments indicated by the shaded boxes were not cloned, but the sequence of the entire genome has been determined through sequencing overlapping clones and viral DNA directly (unshaded boxes). The cloned fragments and insert sizes are described in Table 1. In the following table, pBS = pBluescript SK+ clone;
pNEB = pNEB
193 clone; pBR = pBR322 clone; No prefix = fragment not cloned Table 1. C68 plasmid clones and insert sizes Construct Name Insert Size Fragment Fragment 5' End 3' End (base 5' End 3' End Map Unit Map Unit pairs) Pst-I Fragments C68-Pst-A 6768 24784 31551 67.9% 86.4%
pBS:C68-Pst-B 6713 4838 11550 13.2% 31.6%
pBS:C68-Pst-C 5228 14811 20038 40.6% 54.9%
pBS:C68-Pst-D 2739 12072 14810 33.1% 40.6%
pBS:C68-Pst-E 2647 20039 22685 54.9% 32.1%
pBS:C68-Pst-F 1951 32046 33996 87.8% 93.1%
pNEB:C68-Pst-G 1874 1 1874 0.0% 5.1%
pBS:C68-Pst-H 1690 23094 24783 63.2% 67.9%
pBS:C68-Pst-I 1343 33997 35339 93.1% 96.8%
pNEB:C68-Pst-J 1180 35340 36519 96.8% 100.0%
pBS:C68-Pst-K 1111 2763 3873 7.6% 10.6%
pBS:C68-Pst-L 964 3874 4837 10.6% 13.2%
pBS:C68-Pst-M 888 1875 2762 5.1% 7.6%
pBS:C68-Pst-N 408 22686 23093 62.1% 63.2%
C68-Pst-O 380 31666 32045 86.7% 87.7%
pBS:C68-Pst-P 285 11551 11835 31.6% 32.4%
C68-Pst-Q 236 11836 12071 32.4% 33.1%
pBS:C68-Pst-R 114 31552 31665 86.4% 86.7%
BamHI Fragments C68-Bam-A 16684 19836 36519 54.3% 100.0%
pBS:C68-Bam-B 8858 3582 12439 9.8% 34.1%
pBS:C68-Bam-C 4410 12440 16849 34.1% 46.1%
pBS:C68-Bam-D 2986 16850 19835 46.1% 54.3%
pNEB:C68-Bam-E 2041 1 2041 0.0% 5.6%
pBS:C68-Bam-F 1540 2042 3581 5.6% 9.8%
Hindlll Fragments pBR:C68-Hind-B 9150 23471 32620 64.3% 89.3%
Chimpanzee adenovirus, C68, was obtained from ATCC and propagated in human 293 cells. Viral genomic DNA was isolated from purified virions using established procedures (A. R. Davis, et al., Gene Thera., 5:1148-1152 (1998)) and digested with a panel of restriction enzymes; the data were consistent with previous studies (data not shown) (G. R.
Kitchingman, Gene, 20:205-210 (1982); Q. Li and G. Wadell, Arch Virol. 101:65-77 (1998);
R. Wigand, et al., Intervirology. 30:1-9 (1989)). Restriction fragments spanning the entire genome of C68 were subcloned into plasmids. A schematic drawing of the C68 genome is shown in Figure IA, and the Pst-I, BamHI and HindIll fragments that were cloned into plasmid vectors are indicated by the unshaded boxes, in Figs. 1 B, 1 C, and 1 D, respectively.
The cloned fragments, fragment sizes and genomic position are also listed in Table 1. Both plasmid clones and genomic DNA were used as templates for sequencing. The genome was sequenced by primer walking in both directions and each base was included in an average of approximately four reactions.
The C68 genome is 36521 bp in length [see, US Patent 6,083,716]. Preliminary comparison with GenBank sequences indicated varying degrees of similarity with other human and animal adenoviruses along the entire length of the viral genome.
Regions with homology to all of the previously described adenoviral genetic units, early regions 1-4 and the major late genes, were found in the C68 genome (Fig. IA). DNA homology between and the human adenoviruses that have been completely sequenced, Ad2 (NC001405), Ad5 (N0001405), Ad12 (N0001460), Ad17 (N0002067) and Ad40 (NCO1464), was used to order the clones. The open reading frames (ORF) were determined and the genes were identified based on homology to other human adenoviruses. All of the major adenoviral early and late genes are present in C68. The inverted terminal repeats (ITR=s) are 130 bp in length.
Example 4 - Analysis of C68 sequence The complete nucleotide sequence of every member of the Mastadenovirus genus accessible from GenBank, including isolates from different species, were screened for identity to C68. The Ad4 minigenome was assembled from the following GenBank sequences: Left-hand ITR (JO 1964); E 1 A region (M 14918); DNA pol and pTP
(X74508, 74672); VA RNA-I, II (U10682); 52, 55K (U52535); pVII (U70921); hexon (X84646);
endoprotease (M16692); DNA-binding protein (M12407); fiber (X76547); right-hand ITR
(JO1965). The Adz composite genome was created from the following sequence data: Mu 3-21 (X03000); VA RNA-1, II, pTP & 52, 55K (U52574); penton (AD001675); pVI, hexon and endoprotease (AF065065); DNA-binding protein (K02530); E3 and fiber region (AF104384);
right-hand ITR (V00037).
The amino acid sequence alignment was generated with Clustal X, edited with Jalview and analyzed with Boxshade. Publicly available hexon protein sequences from all human adenovirus serotypes were initially aligned to identify the set showing the highest homology to C68.
The nucleotide sequence and predicted amino acid sequences of all significant open reading frames in the C68 genome were compared to known DNA and protein sequences.
The nucleotide sequence of C68 was compared to sequences of Ad 2, 4, 5, 7, 12, 17 and 40.
In agreement with previous restriction analysis (Kitchingman, cited above) C68 is most similar to human Ad4 (subgroup E).
The EIA region of C68 extends from the TATA box at nt 480 to the poly A
addition site at 1521. The consensus splice donor and acceptor sites are in the analogous position of the human Ad counterparts, and the 28.2K and 24.8K proteins are similar in size to the human Ad proteins. The ORF for the smallest EIA protein of C68 is predicted to encode 101 residues as opposed to approximately 60 amino acids for other adenoviruses.
There is a TTA
codon at residue 60 for C68 where other adenoviruses often have a TGA stop codon. The first 60 residues of C68 ElA I00R protein have 85% identity to the Ad4 homolog.
The C68 genome encodes genes for the four E I B proteins, 20.5K, 54.7K, 10.1 K
and 18.5K as well as pIX. All five C68 encoded proteins are similar in size to that of other Ad E I B and pIX proteins. The Ad4 homolog of the E 1 B 21 K protein has only 142 amino acids, where C68 has 186 residues and other human adenoviruses have 163-178 residues.
The C68 and Ad4 proteins share 95% identity over the first 134 aa, then the similarity ends and the Ad4 protein terminates at 142 amino acids.
The C68 genome encodes homologs of the E2A 55K DNA binding protein and the Iva2 maturation protein, as well as the E2B terminal protein and the DNA
polymerase. All of the E2 region proteins are similar in size to their human Ad counterparts, and the E2B
proteins are particularly well conserved. The C68 E2B 123.6K DNA polymerase is predicted to be 1124 residues, while Ad4 is predicted to have 1193 although the other human adenoviruses have smaller polymerases. Residues 1-71 of the Ad4 polymerase have no similarity to any other Ad polymerase, and it is possible that this protein actually initiates at an internal ATG codon. From amino acids 72-1193, Ad4 and C68 polymerases have 96%
amino acid identity.
The E3 regions of human adenoviruses sequenced so far exhibit considerable sequence and coding capacity variability. Ad40 has five E3 region genes, Ad12 has six, C68 and Ad5 have seven, Ad38 has eight and Ad3 as well as Adz (subgroup B human adenoviruses) have nine putative E3 region genes. The Ad4 E3 region has not yet been sequenced. In comparison with the E3 region of Ad35, all 7 E3 gene homologs were identified in the C68 genome (C. F. Basler and M.S. Horwitz, Virology, 215:
(1996)).
The C68 E4 region has 6 ORFs and each is homologous to proteins in the human Ad5, 12 and 40 E4 region. The E4 nomenclature is confusing because the ORF2 homologs of C68, Ad12 and Ad40 are approximately 130 residues, while in Ad5 there are two ORFs encoding proteins of 64 and 67 residues with homology, respectively, to the amino and carboxy terminal ends of the larger ORF2 proteins. ORF5 has been omitted in our nomenclature because the 5th ORF in the E4 region is homologous to the widely studied ORF6 protein of human Ad5.
The major late promoter and the tri-partite leader sequences of the C68 genome were located. ORFs with the potential to encode the 15 major late proteins were located. All of the C68 late proteins are similar in size to their human Ad counterparts. The percent amino acid identity between chimpanzee and human Ad late proteins varies considerably. The C68 fiber protein is predicted to have 90% amino acid identity with the Ad4 protein, but much less similarity to the other human Ad fiber proteins. The CAR binding site in the fiber knob is present in C68.
Example 5 - Virus neutralizing antibody assays Several studies were performed to determine if there is cross-reactivity between type specific antisera of C68 and human adenovirus. The neutralizing activity of sera was tested as follows. Panels of sera from normal human subjects (N=50), rhesus monkeys (N=52) and chimpanzees (N=20) were evaluated for neutralizing antibodies against Ad5 and C68 based vectors using 293 cells as an indicator cell line. Sera collected from individual humans, rhesus monkeys, or chimpanzees were inactivated at 56 C for 30 minutes. A
serial dilution of each sample (1:10, 1:20, 1:40, 1:80, 1:160, 1:320 in 100p1 of DMEM
containing 10%
FCS) was added to equal amounts of H5.010CMVEGFP (1000 PFU/well) or C68CMVEGFP
virus and incubated at 4 C for two hrs. One hundred and fifty microliters of the mixture were transferred onto 2 x 10 293 cells in 96 well flat bottom plates. Control wells were infected with equal amounts of virus (without addition of serum). Samples were incubated at 37 C in 5% CO2 for 48 hrs and examined under a fluorescent microscope. Sample dilutions that showed >50% reduction of green-fluorescent foci as compared to infected controls were scored positive for neutralizing antibodies.
As expected, approximately 35% of normal human subjects demonstrated neutralizing antibody against Ad5, a frequency much higher than observed in sera of rhesus monkeys and chimpanzee. Neutralizing antibody to C68 was observed in 80% of chimpanzee and only 2%
of normal human subjects or rhesus monkeys. Titers of neutralizing antibodies in the non-target species were generally low.
To further evaluate cross-reactivity of C68 with human adenovirus vectors, mice were immunized with 2 x 107 plaque forming units (pfu) of Ad 2, 4, 5, 7 and 12 as well as C68.
Sera were harvested 2 weeks later and tested for antibodies that neutralized either Ad5 or C68 vectors. Neutralizing antibody to Ad5 vector was only detected in animals immunized with Ad5. Importantly, the only animals with neutralizing antibody to C68 vector were those immunized with C68 vector; none of the human serotypes tested, including Ad4, generated antibodies in mice that neutralized C68 in vitro.
Important to the utility of C68 vector in human trials is the absence of neutralizing antibody in the human population. In our study, a screen of 50 normal human subjects failed to detect any significant neutralizing antibodies (>1:10) using the same assay that showed neutralizing antibodies in >50% of chimpanzees. Furthermore, sera of mice immunized with multiple human Ad serotypes including Ad4, did not neutralize infection with C68.
Example 6 - Structural analysis of hexon proteins The absence of neutralizing antibodies between C68 and human serotypes compelled us to more carefully evaluate structural differences in the regions of hexon presumed to harbor type specific epitopes. Previous studies have suggested that these epitopes are located within the 7 hypervariable regions of hexon determined by Crawford-Miksza and Schnurr (J
Virol, 70:1836-1844 (1996)). A comparison of the amino acid sequences of hexon proteins between C68 and several human adenoviruses is shown in Figure 3. Indeed, C68 is substantially dissimilar in significant regions of these hypervariable sequences.
Example 7 - Construction of C68-Derived Capsid Containing a Human Fiber Gene To generate a C68-derived vector with an altered tropism, the chimeric fiber gene construct containing the Ad5 fiber knob fused to the C68 tail and shaft is incorporated into a plasmid carrying the C68 genome. For the precise replacement of the wild-type C68 fiber gene, a plasmid carrying the green fluorescent protein driven by a CMV
promoter is used for modification of C68 fiber. The resulting transfer vector contains a CMV
promoter driven green fluorescent protein (GFP) expression cassette inserted into the E3 region, the chimeric C68/Ad5 fiber gene, and E4. This transfer vector was used for incorporation of GFP cassette and modified fiber gene into the backbone of an E3 deleted C68 infectious plasmid via homologous recombination in E. coli. The viral genome was released from the plasmid by PacI digestion and used to transfect 293 cells. The chimeric C68-derived virus is produced about 3 weeks following transfection using techniques described herein.
Similar techniques can be readily utilized to construct other C68-derived capsids.
While the invention has been described with reference to a particularly preferred embodiment, it will be appreciated that modifications can be made without departing from the spirit of the invention. Such modifications are intended to fall within the scope of the appended claims.
SEQUENCE LISTING
<110> The Trustees of the University of Pennsylvania <120> Method for Rapid Screening of Bacterial Transformants and Novel Simian Adenovirus Proteins <130> 08899274CA
<140>
<141> 2002-06-20 <150> US 60/300,501 <151> 2001-06-22 <150> US 60/385,632 <151> 2002-06-04 <160> 41 <170> Patentln version 3.1 <210> 1 <211> 101 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 1 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Pro Ser Leu Thr Gln Met Met Arg Pro Pro Leu Gln Ser Pro Leu Arg His Pro Gln Lys Leu Ala His Leu His Leu Arg Ile Leu Leu Asp Gln Phe Leu Leu Glu Pro Leu Gly Gly Glu Gln Leu Trp Asn Val Trp Met Thr Cys Tyr Arg Val Gly Leu Asn Leu Trp Thr Cys Val Pro Giy Asn Ala Pro Gly Thr Lys Cys His Thr Cys Val Phe Thr <210> 2 <211> 257 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 2 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His <210> 3 <211> 226 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 3 Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala Val Asn Asp Phe Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Giu Asp Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Val Gln Ala Ala Ser Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Pro Val Ser Asp Ala Asp Asp Glu Thr Pro Thr Thr Lys Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly Val Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His <210> 4 <211> 186 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 4 Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gin Asp Tyr Ser Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg Arg Gln Gln Gln Gln Gln Glu Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Ala Glu Glu Glu Glu <210> 5 <211> 498 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 5 Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His Asp Glu Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr Asp Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gin Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu Val Glu Ile Cys Leu Gln Glu Arg Val Ala Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr Cys Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Met Leu Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val Leu Cys Lys Giy Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Ala Arg Lys Pro Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His Leu Val Leu Pro Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp <210> 6 <211> 169 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 6 Met Glu Ser Arg Asn Pro Phe Gin Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His Leu Val Leu Pro Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp <210> 7 <211> 93 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 7 Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala Gly Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly Gly Gly Gly Val Ala Asp Leu Pro Cys Val Trp Met <210> 8 <211> 142 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 8 Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr Leu Ser Ser Ser Ser Leu Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys <210> 9 <211> 448 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 9 Met Glu Thr Lys Gly Arg Arg Ser Gly Ala Val Phe Asp Gln Pro Asp Glu Pro Glu Ala His Pro Arg Lys Arg Pro Ala Arg Arg Ala Pro Leu His Arg Asp Gly Asp His Pro Asp Ala Asp Ala Ala Thr Leu Glu Gly Pro Asp Pro Gly Cys Ala Gly Arg Pro Ser Ser Gly Ala Ile Leu Pro Gln Pro Ser Gln Pro Ala Lys Arg Gly Gly Leu Leu Asp Arg Asp Ala Val Glu His Ile Thr Glu Leu Trp Asp Arg Leu Glu Leu Leu Gln Gln Thr Leu Ser Lys Met Pro Met Ala Asp Gly Leu Lys Pro Leu Lys Asn Phe Ala Ser Leu Gln Glu Leu Leu Ser Leu Gly Gly Glu Arg Leu Leu Ala Glu Leu Val Arg Glu Asn Met His Val Arg Glu Met Met Asn Glu Val Ala Pro Leu Leu Arg Glu Asp Gly Ser Cys Leu Ser Leu Asn Tyr His Leu Gln Pro Val Ile Gly Val Ile Tyr Gly Pro Thr Gly Cys Gly Lys Ser Gln Leu Leu Arg Asn Leu Leu Ser Ala Gln Leu Ile Ser Pro Ala Pro Glu Thr Val Phe Phe Ile Ala Pro Gln Val Asp Met Ile Pro Pro Ser Glu Leu Lys Ala Trp Glu Met Gln Ile Cys Glu Gly Asn Tyr Ala Pro Gly Ile Glu Gly Thr Phe Val Pro Gln Ser Gly Thr Leu Arg Pro Lys Phe Ile Lys Met Ala Tyr Asp Asp Leu Thr Gln Asp His Asn Tyr Asp Val Ser Asp Pro Arg Asn Val Phe Ala Gln Ala Ala Ala His Gly Pro Ile Ala Ile Ile Met Asp Glu Cys Met Glu Asn Leu Gly Gly His Lys Gly Val Ala Lys Phe Phe His Ala Phe Pro Ser Lys Leu His Asp Lys Phe Pro Lys Cys Thr Gly Tyr Thr Val Leu Val Val Leu His Asn Met Asn Pro Arg Arg Asp Leu Gly Gly Asn Ile Ala Asn Leu Lys Ile Gln Ala Lys Met His Leu Ile Ser Pro Arg Met His Pro Ser Gln Leu Asn Arg Phe Val Asn Thr Tyr Thr Lys Gly Leu Pro Val Ala Ile Ser Leu Leu Leu Lys Asp Ile Val Gln His His Ala Leu Arg Pro Cys Tyr Asp Trp Val Ile Tyr Asn Thr Thr Pro Glu His Glu Ala Leu Gln Trp Ser Tyr Leu His Pro Arg Asp Gly Leu Met Pro Met Tyr Leu Asn Ile Gln Ala His Leu Tyr Arg Val Leu Glu Lys Ile His Arg Val Leu Asn Asp Arg Asp Arg Trp Ser Arg Ala Tyr Arg Ala Arg Lys Ile Lys <210> 10 <211> 200 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (137)..(137) <223> xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (155)..(155) <223> xaa can be any amino acid <400> 10 Met Arg Ala Asp Gly Glu Glu Leu Asp Leu Leu Pro Pro Ile Gly Gly Met Ala Val Asp Val Met Glu Val Glu Met Pro Thr Ala Arg Arg Thr Leu Val Leu Val Phe Ile Gln Ala Ala Thr Val Leu Ala Thr Leu His Gly Met His Val Leu His Glu Leu Tyr Leu Ser Ser Phe Asp Glu Glu Phe Gln Trp Glu Val Glu Ser Trp Arg Leu His Leu Val Leu Tyr Tyr Val Val Val Val Gly Leu Ala Leu Phe Cys Leu Asp Gly Gly His Ala Asp Glu Pro Ala Arg Glu Ala Gly Pro Asp Leu Gly Ala Ser Gly Ser Glu Ser Glu Asp Glu Gly Ala Gln Ala Gly Ala Val Gln Gly Pro Glu Thr Leu Arg Ser Gin Val Ser Gly Xaa Arg Arg Arg Ala Val Asp Leu Gln Glu Phe Phe Gln Gly Ala Arg Glu Val Xaa Met Val Leu Asp Leu His Arg Ala Ile Gly Gly Glu Leu His Gly Leu Gln Gly Pro Val Pro Leu Gly Cys Asp His Arg Pro Pro Phe Leu Leu Gly Arg Leu Gly Arg Arg Gly Arg Cys Leu Phe His Gly <210> 11 <211> 391 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 11 Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gin Ser Tyr Asp His Gln Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp Giu Glu Gly Glu Tyr Leu Glu Asp <210> 12 <211> 534 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 12 Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Thr Val Thr Glu Asp Tyr Asp Gly Ser Gln Asp Glu Leu Lys Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ala Ala Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp Lys Ile Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe <210> 13 <211> 201 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (70)_.(70) <223> Xaa can be any amino acid <400> 13 Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val Val Ala Asp Ala Arg Xaa Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ser Leu Val Ala Gln Gly Gln Ala His Gly Thr Gln Gly His Val Gln Gly Gly Gln Thr Arg Gly Phe Arg Arg Gln Arg Arg Gln Asp Pro Glu Thr Arg Gly His Gly Gly Gly Ser Gly His Arg Gln His Val Pro Pro Ala Ala Arg Glu Arg Val Leu Gly Ala Arg Arg Arg His Arg Cys Ala Arg Ala Arg Ala His Pro Pro Pro Ser His Leu Lys Met Phe Thr Ser Arg Cys <210> 14 <211> 356 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (111)..(111) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (183)..(183) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (212)..(212) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (220)..(220) <223> Xaa can be any amino acid <400> 14 Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Ser Asp Val Asp Gly Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Xaa Arg Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Thr Ala Ala Pro Arg Arg Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val Pro Lys Arg Gln Lys Xaa Glu Asp Val Leu Glu Thr Met Lys Val Asp Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro Gly Xaa Gly Val Gln Thr Val Asp Ile Xaa Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Met Ile Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met Pro Ser Ala Pro Ser Arg Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly His Thr Ser Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Ser Pro Ser Pro His Arg Arg Cys Asn His Pro Cys Arg Pro Gly Ala Glu Ser Val Pro Pro Arg Pro Arg Thr Ser Asp Pro Ala Ala Arg Ala Leu Pro Pro Glu His Arg His Leu Asn Phe Arg Gln Leu Cys Arg Ser Met Ala Leu Thr <210> 15 <211> 257 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (202)..(202) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (210)..(210) <223> Xaa can be any amino acid <220>
<221> MISC_FEATURE
<222> (256)..(256) <223> Xaa can be any amino acid <400> 15 Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala Leu Arg Asp Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys Pro Glu Lys Pro Ala Thr Leu Asp Leu Xaa Pro Pro Gln Pro Ser Arg Pro Xaa Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Xaa Tyr <210> 16 <211> 933 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (826)..(826) <223> Xaa can be any amino acid <220>
<221> MISC FEATURE
<222> (922)..(922) <223> Xaa can be any amino acid <400> 16 Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gln Trp Thr Tyr Lys Ala Asp Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gln Gly Ile Asn Ile Thr Lys Asp Gly Ile Gln Leu Gly Thr Asp Thr Asp Asp Gln Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Asp Ala Glu Trp His Asp Ile Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Val Lys Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp Ile Asp Met Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His Ile Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser Ile Asn Leu Gly Gin Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gln Gly Ile Lys Ala Asn Gly Thr Asp Gln Thr Thr Trp Thr Lys Asp Asp Ser Val Asn Asp Ala Asn Glu Ile Gly Lys Gly Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ser Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Xaa Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr Xaa Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr <210> 17 <211> 513 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (511)..(511) <223> Xaa can be any amino acid <400> 17 Met Ala Gly Arg Gly Gly Ser Gln Ser Glu Arg Arg Arg Glu Arg Thr Pro Glu Arg Gly Arg Gly Ser Ala Ser His Pro Pro Ser Arg Gly Gly Glu Ser Pro Ser Pro Pro Pro Leu Pro Pro Lys Arg His Thr Tyr Arg Arg Val Ala Ser Asp Gln Glu Glu Glu Glu Ile Val Val Val Ser Glu Asn Ser Arg Ser Pro Ser Pro Ser Pro Thr Ser Pro Pro Pro Leu Pro Pro Lys Lys Lys Pro Arg Lys Thr Lys His Val Val Leu Gln Asp Val Ser Gln Asp Ser Glu Asp Glu Arg Gln Ala Glu Glu Glu Leu Ala Ala Val Gly Phe Ser Tyr Pro Pro Val Arg Ile Thr Glu Lys Asp Gly Lys Arg Ser Phe Glu Thr Leu Asp Glu Ser Asp Pro Leu Ala Ala Ala Ala Ser Ala Lys Met Met Val Lys Asn Pro Met Ser Leu Pro Ile Val Ser Ala Trp Glu Lys Gly Met Glu Ile Met Thr Met Leu Met Asp Arg Tyr Arg Val Glu Thr Asp Leu Lys Ala Asn Phe Gln Leu Met Pro Glu Gln Gly Glu Val Tyr Arg Arg Ile Cys His Leu Tyr Ile Asn Glu Glu His Arg Gly Ile Pro Leu Thr Phe Thr Ser Asn Lys Thr Leu Thr Thr Met Met Gly Arg Phe Leu Gln Gly Phe Val His Ala His Ser Gln Ile Ala His Lys Asn Trp Glu Cys Thr Gly Cys Ala Leu Trp Leu His Gly Cys Thr Glu Ala Glu Gly Lys Leu Arg Cys Leu His Gly Thr Thr Met Ile Gln Lys Glu His Met Ile Glu Met Asp Val Ala Ser Glu Asn Gly Gln Arg Ala Leu Lys Glu Asn Pro Asp Arg Ala Lys Ile Thr Gln Asn Arg Trp Gly Arg Ser Val Val Gln Leu Ala Asn Asn Asp Ala Arg Cys Cys Val His Asp Ala Gly Cys Ala Thr Asn Gln Phe Ser Ser Lys Ser Cys Gly Val Phe Phe Thr Glu Gly Ala Lys Ala Gin Gln Ala Phe Arg Gln Leu Glu Ala Phe Met Lys Ala Met Tyr Pro Gly Met Asn Ala Asp Gln Ala Gin Met Met Leu Ile Pro Leu His Cys Asp Cys Asn His Lys Pro Gly Cys Val Pro Thr Met Gly Arg Gln Thr Cys Lys Met Thr Pro Phe Gly Met Ala Asn Ala Glu Asp Leu Asp Val Glu Ser Ile Thr Asp Ala Thr Val Leu Ala Ser Val Lys His Pro Ala Leu Met Val Phe Gln Cys Cys Asn Pro Val Tyr Arg Asn Ser Arg Ala Gln Asn Ala Gly Pro Asn Cys Asp Phe Lys Ile Ser Ala Pro Asp Leu Leu Gly Ala Leu Gln Leu Thr Arg Lys Leu Trp Thr Asp Ser Phe Pro Asp Thr Pro Leu Pro Lys Leu Leu Ile Pro Glu Phe Lys Trp Leu Ala Lys Tyr Gln Phe Arg Asn Val Ser Leu Pro Ala Gly His Ala Glu Thr Arg Lys Asn Pro Xaa Asp Phe <210> 18 <211> 222 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 18 Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gin Ala Glu Glu Glu Glu Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Ala Pro Thr Thr Ser Lys Lys Arg Gln Gln Gln Gln Lys Lys Thr Ser Arg Lys Pro Ala Ala Arg Lys Ser Thr Ala Ala Ala Ala Gly Gly Leu Arg Ile Ala Ala Asn Glu Pro Ala Gln Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr Leu Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys Val Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His Lys Ser Glu Asp Gln Leu Gln Arg Thr Leu Glu Asp Ala Glu Ala Leu Phe Asn Lys Tyr Cys Ala Leu Thr Leu Lys Glu <210> 19 <211> 227 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 19 Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Ser Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp <210> 20 <211> 106 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 20 Met Ser His Gly Giy Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Leu Leu Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser <210> 21 <211> 146 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (62)..(62) <223> Xaa can be any amino acid <400> 21 Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Thr Ala Thr Thr Pro Asp Phe Arg Val Ser Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly Thr Tyr Gln Cys Val Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn Thr Thr Ala Ser Leu Pro Xaa Thr Asn Asn Gin Thr Asn Leu His Gln Arg His Arg Arg Asp Leu Ser Glu Ser Asn Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Gln Pro Thr Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Thr Leu Gly Leu Val Ala Gly Gly Leu Leu Val Leu Cys Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser Pro <210> 22 <211> 176 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (28)_.(28) <223> Xaa can be any amino acid <400> 22 Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Leu Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Xaa Lys Glu Lys Ala Asp Pro Cys Leu His Phe Asn Pro Asn Lys Cys Gln Leu Ser Phe Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro <210> 23 <211> 204 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 23 Met Ala Ser Val Lys Phe Leu Leu Leu Phe Ala Ser Leu Ile Ala Val Ile His Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn His Thr Leu Lys Gly Pro Glu Lys Ala Thr Glu Val Ser Trp Tyr Cys Tyr Phe Asn Glu Ser Asp Val Ser Thr Glu Leu Cys Gly Asn Asn Asn Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Val Thr Thr Met Gln Leu Thr Thr Asn Asn Ile Phe Ala Met Arg Gln Met Val Asn Asn Ser Thr Gln Pro Thr Pro Pro Ser Glu Glu Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe <210> 24 <211> 91 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 24 Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Thr Thr Cys Ile Cys Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile the Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp Gln Arg Val Ala Arg Leu Leu Arg Leu Leu <210> 25 <211> 143 <212> PRT
<213> chimpanzee C68 adenovirus protein <220>
<221> MISC FEATURE
<222> (5)._(5) <223> Xaa can be any amino acid <400> 25 Met Arg Ala Val Xaa Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val Asp Pro Arg Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Leu Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp <210> 26 <211> 135 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 26 Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His Gln Cys Lys Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Ser Tyr Glu Val Thr Pro Asn Asp His Arg Leu Ser Tyr Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn <210> 27 <211> 425 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 27 Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met Asp His Pro Phe Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Arg Thr Ser Ile Leu Asn Thr Leu Ala Leu Gly Phe Gly Ser Gly Leu Gly Leu Arg Gly Ser Ala Leu Ala Val Gln Leu Val Ser Pro Leu Thr Phe Asp Thr Asp Gly Asn Ile Lys Leu Thr Leu Asp Arg Gly Leu His Val Thr Thr Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Ala Lys Gly Leu Lys Phe Glu Asp Gly Ala Ile Ala Thr Asn Ile Gly Asn Gly Leu Glu Phe Gly Ser Ser Ser Thr Glu Thr Gly Val Asp Asp Ala Tyr Pro Ile Gln Val Lys Leu Gly Ser Gly Leu Ser Phe Asp Ser Thr Gly Ala Ile Met Ala Gly Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Leu Ala Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn Pro Ile Thr Gly Thr Val Ser Ser Ala Gln Val Phe Leu Arg Phe Asp Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp Gly Tyr Arg Gln Gly Asp Ser Ile Asp Gly Thr Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gln Ser Ser Thr Thr Lys Asn Asn Ile Val Gly Gln Val Tyr Met Asn Gly Asp Val Ser Lys Pro Met Leu Leu Thr Ile Thr Leu Asn Gly Thr Asp Asp Ser Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 28 <211> 83 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 28 Ile Thr Val Ile Pro Thr Thr Glu Asp Asn Pro Gln Leu Leu Ser Cys Glu Val Gln Met Arg Glu Cys Pro Glu Gly Phe Ile Ser Leu Thr Asp Pro Arg Leu Ala Arg Ser Glu Thr Val Trp Asn Val Glu Thr Lys Ser Met Ser Ile Thr Asn Gly Ile Gin Met Phe Lys Ala Val Arg Gly Glu Arg Val Val Tyr Ser Met Ser Trp Glu Gly Giy Gly Lys Ile Thr Ala Arg Ile Leu <210> 29 <211> 301 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 29 Met Ser Glu Ser Asn Cys Ile Met Thr Arg Ser Arg Thr Arg Ser Ala Ala Ser Arg His His Pro Tyr Arg Pro Ala Pro Leu Pro Arg Cys Glu Glu Thr Glu Thr Arg Ala Ser Leu Val Glu Asp His Pro Val Leu Pro Asp Cys Asp Thr Leu Ser Met His Asn Val Ser Ser Val Arg Gly Leu Pro Cys Ser Ala Gly Phe Ala Val Leu Gln Glu Phe Pro Val Pro Trp Asp Met Val Leu Thr Pro Glu Glu Leu Arg Val Leu Lys Arg Cys Met Ser Ile Cys Leu Cys Cys Ala Asn Ile Asp Leu Phe Ser Ser Gln Met Ile His Gly Tyr Glu Arg Trp Val Leu His Cys His Cys Arg Asp Pro Gly Ser Leu Arg Cys Met Ala Gly Gly Ala Val Leu Ala Leu Trp Phe Arg Arg Ile Ile Arg Gly Cys Met Phe Asn Gln Arg Val Met Trp Tyr Arg Glu Val Val Asn Arg His Met Pro Lys Glu Ile Met Tyr Val Gly Ser Val Phe Trp Arg Gly His His Leu Ile Tyr Leu Arg Ile Trp Tyr Asp Gly His Val Gly Ser Ile Leu Pro Ala Met Ser Phe Gly Trp Ser Val Leu Asn Tyr Gly Leu Leu Asn Asn Leu Val Val Leu Cys Cys Thr Tyr Cys Ser Asp Leu Ser Glu Ile Arg Met Arg Cys Cys Ala Arg Arg Thr Arg Arg Leu Met Leu Arg Ala Val Gly Ile Met Leu Arg Glu Ser Leu Asp Pro Asp Pro Leu Ser Ser Ser Leu Thr Glu Arg Arg Arg Gln Arg Leu Leu Arg Gly Leu Met Arg His His Arg Pro Ile Pro Phe Ala Asp Tyr Asp Ser His Arg Arg Ser Ser Ala Ser Ser Arg <210> 30 <211> 121 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 30 Met Val Leu Pro Val Leu Pro Ser Pro Ala Val Thr Glu Thr Gln Gln Asn Cys Ile Ile Trp Leu Gly Leu Ala His Ser Thr Val Val Asp Val Ile Arg Ala Ile Arg His Asp Gly Ile Phe Ile Thr Pro Glu Ala Leu Asp Leu Leu His Gly Leu Arg Glu Trp Leu Phe Tyr Asn Phe Asn Thr Glu Arg Ser Lys Arg Arg Asp Arg Arg Arg Arg Ser Val Cys Ser Ala Arg Thr Arg Phe Cys Tyr Ser Lys Tyr Glu Asn Val Arg Lys Gln Leu His His Asp Thr Val Ala Asn Thr Ile Ser Arg Val Pro Pro Ser Pro Val Ser Ala Gly Pro Leu Thr Thr Leu <210> 31 <211> 117 <212> PRT
<213> chimpanzee C68 adenovirus protein <220> <221> MISC FEATURE
<222> (45)..(45) <223> Xaa can be any amino acid <400> 31 Met Arg Val Cys Leu Arg Met Pro Val Glu Gly Ala Leu Arg Glu Leu Phe Ile Met Ala Gly Leu Asp Leu Pro His Glu Leu Val Arg Ile Ile Gln Gly Trp Lys Asn Glu Asn Tyr Leu Gly Met Val Xaa Glu Cys Asn Met Met Ile Glu Glu Leu Glu Asn Pro Pro Ala Phe Ala Ile Val Leu Phe Leu Asp Val Arg Val Glu Ala Leu Leu Glu Ala Thr Val Glu His Leu Glu Asn Arg Ile Thr Phe Asp Leu Ala Val Ile Phe His Gln His Ser Gly Gly Glu Arg Cys His Leu Arg Asp Leu His Phe Glu Val Leu Arg Asp Arg Leu Asp <210> 32 <211> 129 <212> PRT
<213> chimpanzee C68 adenovirus protein <400> 32 Met Leu Glu Arg Thr Ala Cys Ile Tyr Phe Ile Val Val Pro Glu Ala Leu Asn Val His Leu Glu Asp Phe Ser Phe Val Asp Phe Leu Lys Asn Cys Leu Gly Asp Phe Leu Ser Ser Tyr Leu Glu Asp Ile Thr Gly Ser Ser Gln His Ala Tyr Ser Ser Leu Ala Phe Gly Asn Ala His Trp Gly Gly Leu Arg Phe Ile Cys Thr Val Ala Cys Pro Asn Leu Ile Pro Gly Gly Pro Met Ala Lys Asn Phe Gly Glu Asp Met Lys Glu Tyr Leu Gln Leu Leu Leu Arg Glu Glu Leu Arg Asp Arg Gly Arg Asp Phe Asp Ile Pro Leu Val Asn Leu Leu Gln Val Asn Gln Glu Gln Asn Ile Leu Glu Leu <210> 33 <211> 36521 <212> DNA
<213> chimpanzee C68 adenovirus <220>
<221> misc feature <222> (8268)..(8268) <223> can be a or c or g or t <220>
<221> misc feature <222> (8322)..(8322) <223> can be a or c or g or t <220>
<221> misc feature <222> (8535)..(8535) <223> can be a or c or g or t <220>
<221> misc feature <222> (16753)..(16753) <223> can be a or c or g or t <220>
<221> misc feature <222> (28095)..(28095) <223> can be a or c or g or t <220>
<221> misc feature <222> (29373)..(29373) <223> can be a or c or g or t <220>
<221> misc feature <222> (30447)..(30447) <223> can be a or c or g or t <220>
<221> misc feature <222> (31015)..(31015) <223> can be a or c or g or t <400> 33 ccttcttcaa taatatacct tcaaactttt tgtgcgcgtt aatatgcaaa tgaggcgttt 60 gaatttgggg aggaagggcg gtgattggtc gagggatgag cgaccgttag gggcggggcg 120 agtgacgttt tgatgacgtg gttgcgagga ggagccagtt tgcaagttct cgtgggaaaa 180 gtgacgtcaa acgaggtgtg gtttgaacac ggaaatactc aattttcccg cgctctctga 240 caggaaatga ggtgtttctg ggcggatgca agtgaaaacg ggccattttc gcgcgaaaac 300 tgaatgagga agtgaaaatc tgagtaattt cgcgtttatg gcagggagga gtatttgccg 360 agggccgagt agactttgac cgatcacgtg ggggtttcga ttaccgtgtt tttcacctaa 420 atttccgcgt acggtgtcaa agtccggtgt ttttacgtag gtgtcagctg atcgccaggg 480 tatttaaacc tgcgctctcc agtcaagagg ccactcttga gtgccagcga gaagagtttt 540 ctcctccgcg ccgcgagtca gatctacact ttgaaagatg aggcacctga gagacctgcc 600 cgatgagaaa atcatcatcg cttccgggaa cgagattctg gaactggtgg taaatgccat 660 gatgggcgac gaccctccgg agccccccac cccatttgag acaccttcgc tgcacgattt 720 gtatgatctg gaggtggatg tgcccgagga cgatcccaat gaggaggcgg taaatgattt 780 ttttagcgat gccgcgctgc tagctgccga ggaggcttcg agctctagct cagacagcga 840 ctcttcactg cataccccta gacccggcag aggtgagaaa aagatccccg agcttaaagg 900 ggaagagatg gacttgcgct gctatgagga atgcttgccc ccgagcgatg atgaggacga 960 gcagggaatc cagaacgcag cgagccaggg agtgcaagcc gccagcgaga gctttgcgct 1020 ggactgcccg cctctgcccg gacacggctg taagtcttgt gaatttcatc gcatgaatac 1080 tggagataaa gctgtgttgt gtgcactttg ctatatgaga gcttacaacc attgtgttta 1140 cagtaagtgt gattaagttg aactttagag ggaggcagag agcagggtga ctgggcgatg 1200 actggtttat ttatgtatat atgttcttta tataggtccc gtctctgacg cagatgatga 1260 gacccccact acaaagtcca cttcgtcacc cccagaaatt ggcacatctc cacctgagaa 1320 tattgttaga ccagttcctg ttagagccac tgggaggaga gcagctgtgg aatgtttgga 1380 tgacttgcta cagggtgggg ttgaaccttt ggacttgtgt acccggaaac gccccaggca 1440 ctaagtgcca cacatgtgtg tttacttgag gtgatgtcag tatttatagg gtgtggagtg 1500 caataaaaaa tgtgttgact ttaagtgcgt ggtttatgac tcaggggtgg ggactgtgag 1560 tatataagca ggtgcagacc tgtgtggtta gctcagagcg gcatggagat ttggacggtc 1620 ttggaagact ttcacaagac tagacagctg ctagagaacg cctcgaacgg agtctcttac 1680 ctgtggagat tctgcttcgg tggcgaccta gctaggctag tctacagggc caaacaggat 1740 tatagtgaac aatttgaggt tattttgaga gagtgttctg gtctttttga cgctcttaac 1800 ttgggccatc agtctcactt taaccagagg atttcgagag cccttgattt tactactcct 1860 ggcagaacca ctgcagcagt agcctttttt gcttttattc ttgacaaatg gagtcaagaa 1920 acccatttca gcagggatta ccagctggat ttcttagcag tagctttgtg gagaacatgg 1980 aagtgccagc gcctgaatgc aatctccggc tacttgccgg tacagccgct agacactctg 2040 aggatcctga atctccagga gagtcccagg gcacgccaac gtcgccagca gcagcagcag 2100 gaggaggatc aagaagagaa cccgagagcc ggcctggacc ctccggcgga ggaggaggag 2160 tagctgacct gtttcctgaa ctgcgccggg tgctgactag gtcttcgagt ggtcgggaga 2220 gggggattaa gcgggagagg catgatgaga ctaatcacag aactgaactg actgtgggtc 2280 tgatgagtcg caagcgccca gaaacagtgt ggtggcatga ggtgcagtcg actggcacag 2340 atgaggtgtc ggtgatgcat gagaggtttt ctctagaaca agtcaagact tgttggttag 2400 agcctgagga tgattgggag gtagccatca ggaattatgc caagctggct ctgaggccag 2460 acaagaagta caagattact aagctgataa atatcagaaa tgcctgctac atctcaggga 2520 atggggctga agtggagatc tgtctccagg aaagggtggc tttcagatgc tgcatgatga 2580 atatgtaccc gggagtggtg ggcatggatg gggttacctt tatgaacatg aggttcaggg 2640 gagatgggta taatggcacg gtctttatgg ccaataccaa gctgacagtc catggctgct 2700 ccttctttgg gtttaataac acctgcatcg aggcctgggg tcaggtcggt gtgaggggct 2760 gcagtttttc agccaactgg atgggggtcg tgggcaggac caagagtatg ctgtccgtga 2820 agaaatgctt gtttgagagg tgccacctgg gggtgatgag cgagggcgaa gccagaatcc 2880 gccactgcgc ctctaccgag acgggctgct ttgtgctgtg caagggcaat gctaagatca 2940 agcataatat gatctgtgga gcctcggacg agcgcggcta ccagatgctg acctgcgccg 3000 gcgggaacag ccatatgctg gccaccgtac atgtggcttc ccatgctcgc aagccctggc 3060 ccgagttcga gcacaatgtc atgaccaggt gcaatatgca tctggggtcc cgccgaggca 3120 tgttcatgcc ctaccagtgc aacctgaatt atgtgaaggt gctgctggag cccgatgcca 3180 tgtccagagt gagcctgacg ggggtgtttg acatgaatgt ggaggtgtgg aagattctga 3240 gatatgatga atccaagacc aggtgccgag cctgcgagtg cggagggaag catgccaggt 3300 tccagcccgt gtgtgtggat gtgacggagg acctgcgacc cgatcatttg gtgttgccct 3360 gcaccgggac ggagttcggt tccagcgggg aagaatctga ctagagtgag tagtgttctg 3420 gggcggggga ggacctgcat gagggccaga ataactgaaa tctgtgcttt tctgtgtgtt 3480 gcagcagcat gagcggaagc ggctcctttg agggaggggt attcagccct tatctgacgg 3540 ggcgtctccc ctcctgggcg ggagtgcgtc agaatgtgat gggatccacg gtggacggcc 3600 ggcccgtgca gcccgcgaac tcttcaaccc tgacctatgc aaccctgagc tcttcgtcgt 3660 tggacgcagc tgccgccgca gctgctgcat ctgCCgccag ccccgtgcgc ggaatggcca 3720 tgggcgccgg ctactacggc actctggtgg ccaactcgag ttccaccaat aatcccgcca 3780 gcctgaacga ggagaagctg ttgctgctga tggcccagct cgaggccttg acccagcgcc 3840 tgggcgagct gacccagcag gtggctcagc tgcaggagca gacgcgggcc gcggttgcca 3900 cggtgaaatc caaataaaaa atgaatcaat aaataaacgg agacggttgt tgattttaac 3960 acagagtctg aatctttatt tgatttttcg cgcgcggtag gccctggacc accggtctcg 4020 atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt ggatgttgag 4080 gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg cctcgtgctc 4140 gggggtggtg ttgtaaatca cccagtcata gcaggggcgc agggcatggt gttgcacaat 4200 atctttgagg aggagactga tggccacggg cagccctttg gtgtaggtgt ttacaaatct 4260 gttgagctgg gagggatgca tgcgggggga gatgaggtgc atcttggcct ggatcttgag 4320 attggcgatg ttaccgccca gatcccgcct ggggttcatg ttgtgcagga ccaccagcac 4380 ggtgtatccg gtgcacttgg ggaatttatc atgcaacttg gaagggaagg cgtgaaagaa 4440 tttggcgacg cctttgtgcc cgcccaggtt ttccatgcac tcatccatga tgatggcgat 4500 gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat catagttgtg 4560 gtcctgggtg aggtcatcat aggccatttt aatgaatttg gggcggaggg tgccggactg 4620 ggggacaaag gtaccctcga tcccgggggc gtagttcccc tcacagatct gcatctccca 4680 ggctttgagc tcggaggggg ggatcatgtc cacctgcggg gcgataaaga acacggtttc 4740 cggggcgggg gagatgagct gggccgaaag caagttccgg agcagctggg acttgccgca 4800 gccggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga gggagagaca 4860 gctgccgtcc tcccggagga ggggggccac ctcgttcatc atctcgcgca cgtgcatgtt 4920 ctcgcgcacc agttccgcca ggaggcgctc tccccccagg gataggagct cctggagcga 4980 ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga gggtttgttg 5040 caagagttcc aggcggtccc agagctcggt gatgtgctct acggcatctc gatccagcag 5100 acctcctcgt ttcgcgggtt gggacggctg cgggagtatg gcaccagacg atgggcgtcc 5160 agcgcagcca gggtccggtc cttccagggt cgcagcgtcc gcgtcagggt ggtctccgtc 5220 acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag gctcatccgg 5280 ctggtcgaaa accgctcccg atcggcgccc tgcgcgtcgg ccaggtagca attgaccatg 5340 agttcgtagt tgagcgcctc ggccgcgtgg cctttggcgc ggagcttacc tttggaagtc 5400 tgcccgcagg cgggacagag gagggacttg agggcgtaga gcttgggggc gaggaagacg 5460 gactcggggg cgtaggcgtc cgcgccgcag tgggcgcaga cggtctcgca ctccacgagc 5520 caggtgaggt cgggctggtc ggggtcaaaa accagtttcc cgccgttctt tttgatgcgt 5580 ttcttacctt tggtctccat gagctcgtgt ctccgctggg tgacaaagag ctgtccgtgt 5640 ccccgtagac cgactttatg ggccggtcct cgagcggtgt gccgcggtcc tcctcgtaga 5700 ggaaccccgc ccactccgag acgaaagccc gggtccaggc cagcacgaaw gaggccacgt 5760 gggacgggta tcggtcgttg tccaccagcg ggtccacctt ttccagggta tgcaaacaca 5820 tgtccccctc gtccacatcc aggaaggtga ttggcttgta agtgtakgcc acgtgaccgg 5880 gggtcccggc cgggggggta taaaagggtg cgggtccctg ctcgtcctca ctgtcttccg 5940 gatcgctgtc cacgagcgcc agctgttggg gtaggtattc cctctcgaag gcgggcatga 6000 cctcggcact caggttgtca gtttctagaa acgaggagga tttgatattg acggtgccgg 6060 cggagatgcc tttcaagagc ccctcgtcca tctggtcaga aaagacgatc tttttgttgt 6120 cgagcttggt ggcgaaggag ccgtakaggg cgttggagag gagcttggcg atggagcgca 6180 tggtctggtt tttttccttg tcggcgcgct ccttggcggc gatgttgagc tgcacgtact 6240 cgcgcgccac gcacttccat tcggggaaga cgtggtcagc tcgtcgggca cgattctgac 6300 ctgccagccc cgattatgca gggtgatgag gtccacactg gtggccacct cgccgcgcag 6360 gggctcatta ktccagcaga ggcgtccgcc cttgcgcgag cagaaggggg gcagggggtc 6420 cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag atgccgggca ggaggtcggg 6480 gtcaaagtag ctgatggaag tggccagatc gtccagggca gcttgccatt cgcgcacggc 6540 cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc atgggatggg taagcgcgga 6600 ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc tcgaggatgc cgatgtaggt 6660 ggggtagcag cgccccccgc ggatgctggc gcgcacgtag tcatacagct cgtgcgaggg 6720 ggcgaggagc cccgggccca ggttggtgcg actgggcttt tcggcgcggt agacgatctg 6780 gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt tggaagatgt tgaagtgggc 6840 gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag gagtcttgca gcttggcgac 6900 gagctcggcg gtgactagga cgtccagagc gcagtagtcg agggtctcct ggatgatgtc 6960 atacttgagc tgtccctttt gtttccacag ctcgcggttg agaaggaact cttcgcggtc 7020 cttccagtac tcttcgaggg ggaacccgtc ctgatctgca cggtaagagc ctagcatgta 7080 gaactggttg acggccttgt aggcgcagca gccyttctcc acggggargg cgtaggcctg 7140 ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg tccctgacca tgaccttgag 7200 gaactggtgc ttgaagtcga tattgtcgca gcccccccgc tcccagagct ggaagtccgt 7260 gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca tcgttgaaga ggatcttgcc 7320 cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg ggcacctcgg cccggttgtt 7380 gatgacctgg gcggcgagca cgatctcgtc gaagccgttg atgttgtggc ccacgatgta 7440 gagttccacg aatcgcggac ggcccttgac gtggggcagt ttcttgagct cctcgtaggt 7500 gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc cagtcggcga gatgggggtt 7560 ggcgcggagg aaggaagtcc agagatccac ggccagggcg gtttgcagac ggtcccggta 7620 ctgacggaac tgctgcccga cggccatttt ttcgggggtg acgcagtaga aggtgcgggg 7680 gtccacgtgc cagcgatccc atttgagctg gagggcgaga tcgagggcga gctcgacgag 7740 ccggtcgtcc ccggagagtt tcatgaccag catgaagggg acgagctgct tgccgaagga 7800 ccccatccag gtgtaggttt ccacatcgta ggtgaggaag agcctttcgg tgcgaggatg 7860 cgagccgatg gggaagaact ggatctcctg ccaccaattg gaggaatggc tgttgatgtg 7920 atggaagtag aaatgccgac ggcgcgccga acactcgtgc ttgtgtttat acaagcggcc 7980 acagtgctcg caacgctgca cgggatgcac gtgctgcacg agctgtacct gagttccttt 8040 gacgaggaat ttcagtggga agtggagtcg tggcgcctgc atctcgtgct gtactacgtc 8100 gtggtggtcg gcctggccct cttctgcctc gatggtggtc atgctgacga gcccccgcgg 8160 gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg aagacgaagg cgcgcaggcc 8220 ggagctgtcc agggtcctga gacgctgcgg agtcaggtca gtgggcancg gcggcgcgcg 8280 gttgacttgc argagttttt ccagggcgcg cgggaggtcc anatggtact tgatctccac 8340 cgcgccattg gtggcgaact ccatggcttg cagggtcccg tgcccctggg gtgtgaccac 8400 cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc ggtgcctctt ccatggttag 8460 aascggcggc gaagacgcgc gccgggcggc aggggcggct cggggcccgg atgcaggggc 8520 ggcaggggca cttcngcgcc gcgcgcgggt aggttctggt actgcgcccg gagaaaactg 8580 gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 ggacccgtga gtttgaacct gaaagaaagt tcgacagaat caatctcggt atcgttgacg 8700 gcggcctgcc gcaagatctc ttgcacgtcc cccgagttgt cctggtatgc gatctcggtc 8760 atgaactgct cgatctcctc ctcttgaagg tctccgcggc cggcgcgctc cacggtggcc 8820 gcgaagtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc cgcctcgttc 8880 cagacgcggc tgtagaccac gacgccctcg ggatcgcggg cgcgcatgac cacctgggcg 8940 aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 ttgagcgtgg tggcgatgtg ctcggtgacr aagaaataca tgatccagcg gcggagcggc 9060 atctcgctga cgtcgcccag cgcctccaaa cgttccatgg cctcgtaaaa gtccacggcg 9120 aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc ccgggagttc ctccacttcc 9240 tcttcttcct cctccactaa catctcttct acttcctcct caggcggcag tggtggcggg 9300 ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt cgatgaagcg ctcgatggtc 9360 tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc cgtcctcgcg gggccgcakc 9420 gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt cccccgttgg gcagggagag 9480 ggcgctgacg atgcatctta tcaattgccc cgtagggact ccgcgcaagg acctgagcgt 9540 ctcgagatcc acgggatctg aaaaccgctg aacgaaggct tcgaagccag tcgcagtcgc 9600 aaggtakgct gagcacggtt tcttctggcg ggtcatgttg gttgggagcg gggcgggcga 9660 tgctgctggt gatgaagttg aaataggcgg ttctgagacg gcggatggtg gcgargagca 9720 ccaggtcttt gggcccggct tgctggatgc gcagacggtc ggccatgccc caggcgtggt 9780 cctgacacct ggccaggtcc ttgtagtagt cctgcatgag ccgctccaac gggcacctcc 9840 tcctcgcccg cgcggccgtg catgcgcgtg agcccgaagc cgcgctgggg ctggacgagc 9900 gccaggtcgg cgacgacgcg ctcggcgagg atggcttgct ggatctgggt gagggtggtc 9960 tggaagtcat caaagtcgac gaagcggtgg taggctccgg tgttgatggt ggaggagcag 10020 ttggccatga cggaccagtt gacggtctgg tggcccggac gcacgagctc gtggtacttg 10080 aggcgcgagt aggcgcgcgt gtcgaagatg tagtcgttgc aggtgcgcac caggtactgg 10140 tagccgatga ggaagtgcgg cggcggctgg cggtagagcg gccatcgctc ggtggcgggg 10200 gcgccgggcg cgaggtcctc gagcatggtg cggtggtagc cgtagatgta cctggacatc 10260 caggtgatgc cggcggcggt ggtggaggcg cgcgggaact cgcggacgcg ttccagatgt 10320 tgcgcagcgg caggaagtag ttcatggtgg gcacggtctg gcccgtgagg cgcgcgcagt 10380 cgtggatgct ctatacgggc aaaaacgaaa gcggtcagcg gctcgactcc gtggcctgga 10440 ggctaagcga acgggttggg ctgcgcgtgt accccggttc gaatctcgaa tcaggctgga 10500 gccgcagcta acgtggtatt ggcactcccg tctmgaccca agcbtgcacc aaccctccag 10560 gatacggagg cgggtcgttt tgcaactttt ttttggaggc cggatgagac tagtaagcgc 10620 ggaaagcggc cgaccgcgat ggctcgtctg ccgtagtctg gagaagaatc gccagggttg 10680 cgttgcggtg tgccccggtt cgaggccggc cggattccgc ggctaacgag ggcgtggctg 10740 ccccgtcgtt tccaagaccc catagccagc cgacttctcc agttacggag cgaggtcctc 10800 ttttgttttg tttgtttttg ccagatgcat cccgtactgc ggcagatgcg cccccaccac 10860 cctccaccgc aacaacagcc ccctccacag ccggcgcttc tgcccccgcc ccagcagcaa 10920 cttccagcca cgaccgccgc ggccgccgtg agcggggctg gacagagtta tgatcaccag 10980 ctggccttgg aagagggcga ggggctggcg cgcctggggg cgtcgtcgcc ggagcggcac 11040 ccgcgcgtgc agatgaaaag ggacgctcgc gaggcatacg tgcccaagca gaacctgttc 11100 agagacagga gcggcgagga gcccgaggag atgcgcgcgg cccggttcca cgcggggcgg 11160 gagctgcggc gcggcctgga ccgaaagagg gtgctgaggg acgaggattt cgaggcggac 11220 gagctgacgg ggatcagccc cgcgcgcgcg cacgtggccg cggccaacct ggtcacggcg 11280 tacgagcaga ccgtgaagga ggagagcaac ttccaaaaat ccttcaacaa ccacgtgcgc 11340 accctgatcg cgcgcgagga ggtgaccctg ggcctgatgc acctgtggga cctgctggag 11400 gccatcgtgc agaaccccac cagcaagccg ctgacggcgc agctgttcct ggtggtgcag 11460 catagtcggg acaacgaagc gttcagggag gcgctgctga atatcaccga gcccgagggc 11520 cgctggctcc tggacctggt gaacattctg cagagcatcg tggtgcagga gcgcgggctg 11580 ccgctgtccg agaagctggc ggccatcaac ttctcggtgc tgagtttggg caagtactac 11640 gctaggaaga tctacaagac cccgtacgtg cccatagaca aggaggtgaa gatcgacggg 11700 ttttacatgc gcatgaccct gaaagtgctg accctgaggg acgatctggg ggtgtaccgc 11760 aacgacagga tgcaccgtgc ggtgagcgcc agcaggcggc gcgagctgag cgaccaggag 11820 ctgatgcata gtctgcagcg ggccctgacc ggggccggga ccgaggggga gagctacttt 11880 gacatgggcg cggacctgca ctggcagccc agccgccggg ccttggaggc ggcggcagga 11940 ccctacgtag aagaggtgga cgatgaggtg gacgaggagg gcgagtacct ggaagactga 12000 tggcgcgacc gtatttttgc tagatgcaac aacaacagcc acctcctgat cccgcgatgc 12060 gggcggcgct gcagagccag ccgtccggca ttaactcctc ggacgattgg acccaggcca 12120 tgcaacgcat catggcgctg acgacccgca accccgaagc ctttagacag cagccccagg 12180 ccaaccggct ctcggccatc ctggaggccg tggtgccctc gggctccaac cccacgcacg 12240 agaaggtcct ggccatcgtg aacgcgctgg tggagaacaa ggccatccgc ggcgacgagg 12300 ccggcctggt gtacaacgcg ctgctggagc gcgtggcccg ctacaacagc accaacgtgc 12360 agaccaacct ggaccgcatg gtgaccgacg tgcgcgaggc cgtggcccag cgcgagcggt 12420 tccaccgcga gtccaacctg ggatccatgg tggcgctgaa cgccttcctc agcacccagc 12480 ccgccaacgt gccccggggc caggaggact acaccaactt catcagcgcc ctgcgcctga 12540 tggtgaccga ggtgccccag agcgaggtgt accagtccgg gccggactac ttcttccaga 12600 ccagtcgcca gggcttgcag accgtgaacc tgagccaggc tttcaagaac ttgcagggcc 12660 tgtggggcgt gcaggccccg gtcggggacc gcgcgacggt gtcgagcctg ctgacgccga 12720 actcgcgcct gctgctgctg ctggtggccc ccttcacgga cagcggcagc atcaaccgca 12780 actcgtacct gggctacctg attaacctgt accgcgaggc catcggccag gcgcacgtgg 12840 acgagcagac ctaccaggag atcacccacg tgagccgcgc cctgggccag gacgacccgg 12900 gcaacctgga agccaccctg aactttttgc tgaccaaccg gtcgcagaag atcccgcccc 12960 agtacgcgct cagcaccgag gaggagcgca tcctgcgtta cgtgcagcag aagcgtgggc 13020 ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac cgcgcgcaac 13080 atggtagccc atcatgtacg ccagcaaccg cccgttcatc aataaactga tggactactt 13140 gcatcgggcg gccgccatga actctgacta tttcaccaac gccatcctga atccccactg 13200 gctcccgccg ccggggttct acacgggcga gtacgacatg cccgacccca atgacgggtt 13260 cctgtgggac gatgtggaca gcagcgtgtt ctccccccga ccgggtgcta acgagcgccc 13320 cttgtggaag aaggaaggca gcgaccgacg cccgtcctcg gcgctgtccg gccgcgaggg 13380 tgctgccgcg gcgctgtccg aggccgccag tcctttcccg agcttgccct tctcgctgaa 13440 cagtatccgc agcagcgagc tgggcaggat cacgcgcccg cgcttgctgg gcgaagagga 13500 gtacttgaat gactcgctgt tgagacccga gcgggagaag aacttcccca ataacgggat 13560 agaaagcctg gtggacaaga tgagccgctg gaagacgtat gcgcaggagc acagggacga 13620 tccccgggcg tcgcaggggg ccacgagccg gggcagcgcc gcccgtaaac gccggtggca 13680 cgacaggcag cggggacaga tgtgggacga tgaggactcc gccgacgaca gcagcgtgtt 13740 ggacttgggt gggagtggta acccgttcgc tcacctgcgc ccccgtatcg ggcgcatgat 13800 gtaagagaaa ccgaaaataa atgatactca ccaaggccat ggcgaccagc gtgcgttcgt 13860 ttcttctctg ttgttgttgt atctagtatg atgaggcgtg cgtacccgga gggtcctcct 13920 ccctcgtacg agagcgtgat gcagcaggcg atggcggcgg cggcgatgca gcccccgctg 13980 gaggctcctt acgtgccccc gcggtacctg gcgcctacgg aggggcggaa cagcattcgt 14040 tactcggagc tggcaccctt gtacgatacc acccggttgt acctggtgga caacaagtcg 14100 gcggacatcg cctcgctgaa ctaccagaac gaccacagca acttcctgac caccgtggtg 14160 cagaacaatg acttcacccc cacggaggcc agcacccaga ccatcaactt tgacgagcgc 14220 tcgcggtggg gcggccagct gaaaaccatc atgcacacca acatgcccaa cgtgaacgag 14280 ttcatgtaca gcaacaagtt caaggcgcgg gtgatggtct cccgcaagac ccccaatggg 14340 gtgacagtga cagaggatta tgatggtagt caggatgagc tgaagtatga atgggtggaa 14400 tttgagctgc ccgaaggcaa cttctcggtg accatgacca tcgacctgat gaacaacgcc 14460 atcatcgaca attacttggc ggtggggcgg cagaacgggg tgctggagag cgacatcggc 14520 gtgaagttcg acactaggaa cttcaggctg ggctgggacc ccgtgaccga gctggtcatg 14580 cccggggtgt acaccaacga ggctttccat cccgatattg tcttgctgcc cggctgcggg 14640 gtggacttca ccgagagccg cctcagcaac ctgctgggca ttcgcaagag gcagcccttc 14700 caggaaggct tccagatcat gtacgaggat ctggaggggg gcaacatccc cgcgctcctg 14760 gatgtcgacg cctatgagaa aagcaaggag gatgcagcag ctgaagcaac tgcagccgta 14820 gctaccgcct ctaccgaggt caggggcgat aattttgcaa gcgccgcagc agtggcagcg 14880 gccgaggcgg ctgaaaccga aagtaagata gtcattcagc cggtggagaa ggatagcaag 14940 aacaggagct acaacgtact accggacaag ataaacaccg cctaccgcag ctggtaccta 15000 gcctacaact atggcgaccc cgagaagggc gtgcgctcct ggacgctgct caccacctcg 15060 gacgtcacct gcggcgtgga gcaagtctac tggtcgctgc ccgacatgat gcaagacccg 15120 gtcaccttcc gctccacgcg tcaagttagc aactacccgg tggtgggcgc cgagctcctg 15180 cccgtctact ccaagagctt cttcaacgag caggccgtct actcgcagca gctgcgcgcc 15240 ttcacctcgc ttacgcacgt cttcaaccgc ttccccgaga accagatcct cgtccgcccg 15300 cccgcgccca ccattaccac cgtcagtgaa aacgttcctg ctctcacaga tcacgggacc 15360 ctgccgctgc gcagcagtat ccggggagtc cagcgcgtga ccgttactga cgccagacgc 15420 cgcacctgcc cctacgtcta caaggccctg ggcatagtcg cgccgcgcgt cctctcgagc 15480 cgcaccttct aaatgtccat tctcatctcg cccagtaata acaccggttg gggcctgcgc 15540 gcgcccagca agatgtacgg aggcgctcgc caacgctcca cgcaacaccc cgtgcgcgtg 15600 cgcgggcact tccgcgctcc ctggggcgcc ctcaaaggcc gcgtgcggtc gcgcaccacc 15660 gtcgacgacg tgatcgacca ggtggtggcs gacgcgcgca amtacacccc cgccgcggcg 15720 cctgtttcca cygtggacgc cgtcatcgac agcgtggtgg cggacgcgcg ccggtacgcc 15780 cgcgccaaga gccggcggcg gcgcatcgcc cggcggcacc ggagcacccc cgccatgcgc. 15840 gcggcgcgga gccttgttgc gcagggccag gcgcacggga cgcagggcca tgttcagggc 15900 ggccagacgc gcggcttcag gcgccagcgc cggcaggacc cggagacgcg cggccacggc 15960 ggcggcagcg gccatcgcca gcatgtcccg cccgcggcga gggaacgtgt actgggtgcg 16020 cgacgccgcc accggtgtgc gcgtgcccgt gcgcacccgc ccccctcgca cttgaagatg 16080 ttcacttcgc gatgttgatg tgtcccagcg gcgaggagga tgtccaagcg caaattcaag 16140 gaagagatgc tccaggtcat cgcgcctgag atatacggcc ctgcggtggt gaaggaggaa 16200 agaaagcccc gcaaaatcaa gcgggtcaaa aaggacaaaa aggaagaaga aagtgatgtg 16260 gacggattgg tggagtttgt gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg 16320 cggaaggtgc aaccggtgct gagacccggc accaccgtgg tcttcacgcc cggcgagcgc 16380 tccggcaccg cttccaagcg ctcctacgac gaggtgtacg gggatgatga tattctggag 16440 caggcggccg akcgcctggg cgagtttgct tacggcaagc gcagccgttc cgcaccgaag 16500 gaagaggcgg tgtccatccc gctggaccac ggcaacccca cgccgagcct caagcccgtg 16560 accttgcagc aggtgctgcc gaccgcggcg ccgcgccggg ggttcaagcg cgagggcgag 16620 gatctgtacc ccaccatgca gctgatggtg cccaagcgcc agaaghtgga agacgtgctg 16680 gagaccatga aggtggaccc ggacgtgcag cccgaggtca aggtgcggcc catcaagcag 16740 gtggccccgg gcntgggcgt gcagaccgtg gacatcwaga ttcccacgga gcccatggaa 16800 acgcagaccg agcccatgat caagcccagc accagcacca tggaggtgca gacggatccc 16860 tggatgccat cggctcctag tcgaagaccc cggcgcaagt acggcgcggc cagcctgctg 16920 atgcccaact acgcgctgca tccttccatc atccccacgc cgggctactg cggcacgcgc 16980 ttctaccgcg gtcataccag cagccgccgc cgcaagacca ccactcgccg ctcgccgtcg 17040 ccgcaccgcc gctgcaacca cccctgccgc cctggtgcgg agagtgtacc gccgcggccg 17100 cgcacctctg accctgccgc gcgcgcgcta ccacccgagc atcgccattt aaactttcgc 17160 cagctttgca gatcaatggc cctcacatga ccgccttcgc gttcccatta cgggctaccg 17220 aggaagaaaa ccgcgccgta gaaggctggc ggggaacggg atgcgtcgcc accaccaccg 17280 gcggcggcgc gccatcagca agcggttggg gggaggcttc ctgCCCgcgc tgatccccat 17340 catcgccgcg gcgatcgggg cgatccccgg cattgcttcc gtggcggtgc aggcctctca 17400 gcgccactga gacacacttg gaaacatctt gtaatagacc ratggactct gacgctcctg 17460 gtcctgtgat gtgttttcgt agacagatgg aagacatcaa tttttcgtcc ctggctccgc 17520 gacacggcac gcggccgttc atgggcacct ggagcgacat cggcaccagc caactgaacg 17580 ggggcgcctt caattggagc agtctctgga gcgggcttaa gaatttcggg tccacgctta 17640 aaacctatgg cagcaaggcg tggaacagca ccacagggca ggcgctgagg gataagctga 17700 aagagcagaa cttccagcag aaggtggtcg atgggctcgc ctcgggcatc aacggggtgg 17760 tggacctggc caaccaggcc gtgcagcggc agatcaacag ccgcctggac ccggtgccgc 17820 ccgccggctc cgtggagatg ccgcaggtgg aggaggagct gcctcccctg gacaagcggg 17880 gcgagaagcg accccgcccc gatgcggagg agacgctgct gacgcacacg gacgagccgc 17940 ccccgtacga ggaggcggtg aaactgggtc tgcccaccac gcggcccatc gcgcccctgg 18000 ccaccggggt gctgaaaccc gaaaagcccg cgaccctgga cttgcytcct ccccagcctt 18060 cccgcccatv tacagtggct aagcccctgc cgccggtggc cgtggcccgc gcgcgacccg 18120 ggggcaccgc ccgccctcat gcgaactggc agagcactct gaacagcatc gtgggtctgg 18180 gagtgcagag tgtgaagcgc cgccgctgmt attaaaccta ccgtagcgct taacttgctt 18240 gtctgtgtgt gtatgtatta tgtcgccgcc gcygctgtcc accagaagga ggagtgaaga 18300 ggggcggtgc cgagttgcra gatggccacc ccatcgatgc tgccccagtg ggcgtacatg 18360 cacatcgccg gacaggacgc ttcggagtac ctgagtccgg gtctggtgaa gtttgcccgc 18420 gccacagaca cctacttcag tctggggaac aagtttagga accccacggt ggcgcccacg 18480 caygatgtga ccaccgaccg cagccagcgg ctgacgctgc gcttcgtgcc cgtggaccgc 18540 gaggacaaca cctacttgta caaagtgcgc tacacgctgg ccgtgggcga caaccgcgtg 18600 ctggacatgg ccagcaccta ctttgacatc cgcggcgtgc tggatcgggg ccctagcttc 18660 aaaccctact ccggcaccgc ctacaacagt ctggccccca agggagcacc caacacttgt 18720 cagtggacat ataaagccga tggtgaaact gccacagaaa aaacctatac atatggaaat 18780 gcacccgtgc agggcattaa catcacaaaa gatggtattc aacttggaac tgacaccgat 18840 gatcagccaa tctacgcaga taaaacctat cagcctgaac ctcaagtggg tgatgctgaa 18900 tggcatgaca tcactggtac tgatgaaaag tatggaggca gagctcttaa gcctgatacc 18960 aaaatgaagc cttgttatgg ttcttttgcc aagcctacta ataaagaagg aggtcaggca 19020 aatgtgaaaa caggaacagg cactactaaa gaatatgaca tagacatggc tttctttgac 19080 aacagaagtg cggctgctgc tggcctagct ccagaaattg ttttgtatac tgaaaatgtg 19140 gatttggaaa ctgcagatac ccatattgta tacaaagcag gcacagatga cagcagctct 19200 tctattaatt tgggtcagca agccatgccc aacagaccta actacattgg tttcagagac 19260 aactttatcg ggctcatgta ctacaacagc actggcaata tgggggtgct ggccggtcag 19320 gcttctcagc tgaatgctgt ggttgacttg caagacagaa acaccgagct gtcctaccag 19380 ctcttgcttg actctctggg tgacagaacc cggtatttca gtatgtggaa tcaggcggtg 19440 gacagctatg atcctgatgt gcgcattatt gaaaatcatg gtgtggagga tgaacttccc 19500 aactattgtt tccctctgga tgctgttggc agaacagata cttatcaggg aattaaggct 19560 aatggaactg atcaaaccac atggaccaaa gatgacagtg tcaatgatgc taatgagata 19620 ggcaagggta atccattcgc catggaaatc aacatccaag ccaacctgtg gaggaacttc 19680 ctctacgcca acgtggccct gtacctgccc gactcttaca agtacacgcc ggccaatgtt 19740 accctgccca ccaacaccaa cacctacgat tacatgaacg gccgggtggt ggcgccctcg 19800 ctggtggact chtacatcaa catcggggcg cgctggtcgc tggatcccat ggacaacgtg 19860 aaccccttca accaccaccg caatggggcg ctgcgctacc gctccatgct cctgggcaac 19920 gggcgctacg tgcccttcca catccaggtg ccccagaaat ttttcgccat caagagcctc 19980 ctgctcctgc ccgggtccta cacctacgag tggaacttcc gcaaggacgt caacatgatc 20040 ctgcagagct ccctcggcaa cgacctgcgc acggacgggg cctccatctc cttcaccagc 20100 atcaacctct acgccacctt cttccccatg gcgcacaaca cggcctccac gctcgaggcc 20160 atgctgcgca acgacaccaa cgaccagtcc ttcaacgact acctctcggc ggccaacatg 20220 ctctacccca tcccggccaa cgccaccaac gtgcccatct ccatcccctc gcgcaactgg 20280 gccgccttcc gcggctggtc cttcacgcgt ctcaagacca aggagacgcc ctcgctgggc 20340 tccgggttcg acccctactt cgtctactcg ggctccatcc cctacctcga cggcaccttc 20400 tacctcaacc acaccttcaa gaaggtctcc atcaccttcg actcctccgt cagctggccc 20460 ggcaacgacc ggctcctgac gcccaacgag ttcgaaatca agcgcaccgt cgacggcgag 20520 ggctacaacg tggcccagtg caacatgacc aaggactggt tcctggtcca gatgctggcc 20580 cactacaaca tcggctacca gggcttctac gtgcccgagg gctacaagga ccgcatgtac 20640 tccttcttcc gcaacttcca gcccatgagc cgccaggtgg tggacgaggt caactacaag 20700 gactaccagg ccgtcaccct ggcctaccag cacaacaact cgggcttcgt cggctacctc 20760 gcgcccacca tgcgccaggg ccagccmtac cccgccaamt acccmtcccc gctcatcggc 20820 aagagcgccg tcaccagcgt cacccagaaa aagttcctct gcgacagggt catgtggcgc 20880 atccccttct ccagcaactt catgtccatg ggcgcgctca ccgacctcgg ccagaacatg 20940 ctctatgcca actccgccca cgcgctagac atgaatttcg aagtcgaccc catggatgag 21000 tccacccttc tctatgttgt cttcgaagtc ttcgacgtcg tccgagtgca ccagccccac 21060 cgcggcgtca tcgaggccgt ctacmtgcgc acccccttct cggccggtaa cgccaccacc 21120 taagctcttg cttcttgcaa gccatggccg cgggctccgg cgagcaggag ctcagggcca 21180 tcatccgcga cctgggctgc gggccmtact tcctgggcac sttcgataag cgcttcccgg 21240 gattcatggc cccgcacaag ctggcctgcg ccatcgtcaa cacggccggc cgcgagaccg 21300 ggggcgagca ctggctggcc ttcgcctgaa cccgcgctcg aacacctgct acctcttcga 21360 ccccttcggg ttctcggacg agcgcctcaa gcagatctac cagttcgagt acgagggcct 21420 gctgcgccgc agcgccctgg ccaccgagga ccgctgcgtc accctggaaa agtccaccca 21480 gaccgtgcag ggtccgcgct cggccgcctg cgggctcttc tgctgcatgt tcctgcacgc 21540 cttcgtgcac tggcccgacc gccccatgga caagaacccc accatgaact tgctgaaggg 21600 ggtgcccaac ggcatgctcc agtcgcccca ggtggaaccc accctgcgcc gcaaccagga 21660 ggcgctytac cgcttcctca actcccactc cgcmtacttt cgctcccacc gcgcgcgcat 21720 cgagaaggcc accgccttcg accgcatgaa tcaagacatg taaaccgtgt gtgtatgtta 21780 aatgtcttta ataaacagca ctttcatgtt acacatgcat ctgagatgat ttatttagaa 21840 atcsaaaggg ttcttccggg tctcggcatg gcccgcgggc agggacacgt tgcggaactg 21900 gtacttggcc agccacttga actcggggat cagcagtttg ggcagcgggg tgtcggggaa 21960 ggagtcggtc cacagcttcc gcgtcagttg cagggcgccc agcaggtcgg gcgcggagat 22020 cttgaaatcg cagttgggac ccgcgttctg cgcgcgggag ttgcggtaca cggggttgca 22080 gcactggaac accatcaggg ccgggtgctt cacgctcgcc agcaccgtcg cgtcggtgat 22140 gctctccacg tcgaggtcct cggcgttggC catcccgaag ggggtcatct tgcaggtctg 22200 ccttcccatg gtgggcacgc acccgggctt gtggttgcaa tcgcagtgca gggggatcag 22260 catcatctgg gcctggtcgg cgttcatccc cgggtacatg gccttcatga aagcctccaa 22320 ttgcctgaac gcctgctggg ccttggctcc ctcggtgaag aagaccccgc aggacttgct 22380 agagaactgg ttggtggcgc acccggcgtc gtgcacgcag cagcgcgcgt cgttgttggc 22440 cagctgcacc acgctgcgcc cccagcggtt ctgggtgatc ttggcccggt cggggttctc 22500 cttcagcgcg cgctgcccgt tctcgctcgc cacatccatc tcgatcatgt gctccttctg 22560 gatcatggtg gtcccgtgca ggcaccgcag cttgccctcg gcctcggtgc acccgtgcag 22620 ccacagcgcg cacccggtgc actcccagtt cttgtgggcg atctgggaat gcgcgtgcac 22680 gaagccctgc aggaagcggc ccatcatggt ggtcagggtc ttgttgctag tgaaggtcag 22740 cggaatgccg cggtgctcct cgttgatgta caggtggcag atgcggcggt acacctcgcc 22800 ctgctcgggc atcagctgga agttggcttt caggtcggtc tccacgcggt agcggtccat 22860 cagcatagtc atgatttcca tacccttctc ccaggccgag acgatgggca ggctcatagg 22920 gttcttcacc atcatcttag cgctagcagc cgcggccagg gggtcgctct cgtccagggt 22980 ctcaaagctc cgcttgccgt ccttctcggt gatccgcacc ggggggtagc tgaagcccac 23040 ggccgccagc tcctcctcgg cctgtctttc gtcctcgctg tcctggctga cgtcctgcag 23100 gaccacatgc ttggtcttgc ggggtttctt cttgggcggc agcggcggcg gagatgttgg 23160 agatggcgag ggggagcgcg agttctcgct caccactact atctcttcct cttcttggtc 23220 cgaggccacg cggcggtagg tatgtctctt cgggggcaga ggcggaggcg acgggctctc 23280 gccgccgcga cttggcggat ggctggcaga gccccttccg cgttcggggg tgcgctcccg 23340 gcggcgctct gactgacttc ctccgcggcc ggccattgtg ttctcctagg gaggaacaac 23400 aagcatggag actcagccat cgccaacctc gccatctgcc cccaccgccg acgagaagca 23460 gcagcagcag aatgaaagct taaccgcccc gccgcccagc cccgccacct ccgacgcggc 23520 cgtcccagac atgcaagaga tggaggaatc catcgagatt gacctgggct atgtgacgcc 23580 cgcggagcac gaggaggagc tggcagtgcg cttttcacaa gaagagatac accaagaaca 23640 gccagagcag gaagcagaga atgagcagag tcaggctggg ctcgagcatg acggcgacta 23700 cctccacctg agcggggggg aggacgcgct catcaagcat ctggcccggc aggccaccat 23760 cgtcaaggat gcgctgctcg accgcaccga ggtgcccctc agcgtggagg agctcagccg 23820 cgcctacgag ttgaacctct tctcgccgcg cgtgcccccc aagcgccagc ccaatggcac 23880 ctgcgagccc aacccgcgcc tcaacttcta cccggtcttc gcggtgcccg aggccctggc 23940 cacctaccac atctttttca agaaccaaaa gatccccgtc tcctgccgcg ccaaccgcac 24000 ccgcgccgac gcccttttca acctgggtcc cggcgcccgc ctacctgata tcgcctcctt 24060 ggaagaggtt cccaagatct tcgagggtct gggcagcgac gagactcggg ccgcgaacgc 24120 tctgcaagga gaaggaggag agcatgagca ccacagcgcc ctggtcgagt tggaaggcga 24180 caacgcgcgg ctggcggtgc tcaaacgcac ggtcgagctg acccatttcg cgtacccggc 24240 tctgaacctg ccccccaaag tcatgagcgc ggtcatggac caggtgctca tcaagcgcgc 24300 gtcgcccatc tccgaggacg agggcatgca agactccgag gagggcaagc ccgtggtcag 24360 cgacgagcag ctggcccggt ggctgggtcc taatgctagt ccccagagtt tggaagagcg 24420 gcgcaaactc atgatggccg tggtcctggt gaccgtggag ctggagtgcc tgcgccgctt 24480 cttcgccgac gcggagaccc tgcgcaaggt cgaggagaac ctgcactacc tcttcaggca 24540 cgggttcgtg cgccaggcct gcaagatctc caacgtggag ctgaccaacc tggtctcgta 24600 catgggcatc ttgcacgaga accgcctggg gcagaacgtg ctgcacacca ccctgcgcgg 24660 ggaggcccgg cgcgactaca tccgcgactg cgtctacctc tacctctgcc acacctggca 24720 gacgggcatg ggcgtgtggc agcagtgtct ggaggagcag aacctgaaag agctctgcaa 24780 gctcctgcag aagaactcaa gggtctgtgg accgggttcg acgagcgcac caccgcctcg 24840 gacctggccg acctcatttt ccccgagcgc ctcaggctga cgctgcgcaa cggcctgccc 24900 gactttatga gccaaagcat gttgcaaaac tttcgctctt tcatcctcga acgctccgga 24960 atcctgcccg ccacctgctc cgggctgccc tcggacttcg tgccgctgac cttccgcgag 25020 tgccccccgc cgctgtggag ccactgctac ctgctgcgcc tggccaacta cctggcctac 25080 cactcggacg tgattgagga cgtcagcggc gagggcctgc tcgagtgcca ctgccgctgc 25140 aacctctgca cgCcgcaccg ctccctggcc tgcaaccccc agctgytgag cgagacccag 25200 atcatcggca ccttcgagtt gcaagggccc agcgaaggcg agggttcagc cgccaagggg 25260 ggtctgaaac tcaccccggg gctgtggacc tcggcctact tgcgcaagtt cgtgcccgag 25320 gactaccatc ccttcgagat caggttctac gaggaccaat cccatccgcc caaggccgag 25380 ctgtcggcct gcgtcatcac ccagggggcg atcctggccc aattgcaagc catccagaaa 25440 tcccgccaag aattcttgct gaaaaagggc cgcggggtct acctcgaccc ccagaccggt 25500 gaggagCtCa accccggctt cccccaggat gccccgagga aacaagaagc tgaaagtgga 25560 gctgccgccc gtggaggatt tggaggaaga ctgggagaac agcagtcagg cagaggagga 25620 ggagatggag gaagactggg acagcactca ggcagaggag gacagcctgc aagacagtct 25680 ggaggaagac gaggaggagg cagaggagga ggtggaagaa gcagccgccg ccagaccgtc 25740 gtcctcggcg ggggagaaag caagcagcac ggataccatc tccgctccgg gtcggggtcc 25800 cgctcgacca cacagtagat gggacgagac cggacgattc ccgaacccca ccacccagac 25860 cggtaagaag gagcggcagg gatacaagtc ctggcggggg cacaaaaacg ccatcgtctc 25920 ctgcttgcag gcctgcgggg gcaacatctc cttcacccgg cgctacctgc tcttccaccg 25980 cggggtgaac tttccccgca acatcttgca ttactaccgt cacctccaca gcccctacta 26040 cttccaagaa gaggcagcag cagcagaaaa agaccagcag aaaaccagca gctagaaaat 26100 ccacagcggc ggcagcaggt ggactgagga tcgcggcgaa cgagccggcg caaacccggg 26160 agctgaggaa ccggatcttt cccaccctct atgccatctt ccagcagagt cgggggcagg 26220 agcaggaact gaaagtcaag aaccgttctc tgcgctcgct cacccgcagt tgtctgtatc 26280 acaagagcga agaccaactt cagcgcactc tcgaggacgc cgaggctctc ttcaacaagt 26340 actgcgcgct cactcttaaa gagtagcccg cgcccgccca gtcgcagaaa aaggcgggaa 26400 ttacgtcacc tgtgcccttc gccctagccg cctccaccca tcatcatgag caaagagatt 26460 cccacgcctt acatgtggag ctaccagccc cagatgggcc tggccgccgg tgccgcccag 26520 gactactcca cccgcatgaa ttggctcagc gccgggcccg cgatgatctc acgggtgaat 26580 gacatccgcg cccaccgaaa ccagatactc ctagaacagt cagcgctcac cgccacgccc 26640 cgcaatcacc taaatccgcg taattggccc gccgccctgg tgtaccagga aattccccag 26700 cccacgaccg tactacttcc gcgagacgcc caggccgaag tccagctgac taactcaggt 26760 gtccagctgg cgggcggcgc caccctgtgt cgtcaccgcc ccgctcaggg tataaagcgg 26820 ctggtgatcc ggggcagaag cacacagctc aacgacgaag tggtgagctc ttcgctgggt 26880 ctgcgacctg acggagtctt ccaactcgcc ggatcgggga gatcttcctt cacgcctcgt 26940 caggccgtcc tgactttgga gagttcgtcc tcgcagcccc gctcgggtgg catcggcact 27000 ctccagttcg tggaggagtt cactccctcg gtctacttca accccttctc cggctccccc 27060 ggccactacc cggacgagtt catcccgaac ttcgacgcca tcagcgagtc ggtggacggc 27120 tacgattgaa tgtcccatgg tggcgcagct gacctagctc ggcttcgaca cctggaccac 27180 tgccgccgct tccgctgctt cgctcgggat ctcgccgagt ttgcctactt tgagctgccc 27240 gaggagcacc ctcagggccc ggcccacgga gtgcggatcg tcgtcgaagg gggcctcgac 27300 tcccacctgc ttcggatctt cagccagcgt ccgatcctgg tcgagcgcga gcaaggacag 27360 acccttctga ctctgtactg catctgcaac caccccggcc tgcatgaaag tctttgttgt 27420 ctgctgtgta ctgagtataa taaaagctga gacagcgact actccggact tccgtttgtt 27480 cctgaatcca tcaaccagtc tttgttcttc accgggaacg agaccgagct ccagctccag 27540 tgtaagcccc acaagaagta cctcacctgg ctgttccagg gctccccgat cgccgttgtc 27600 aaccactgcg acaacgacgg agtcctgstg agcggccctg ccaaccwtac tttttccacc 27660 cgcagaagca agctccagct sttccaaccc ttcctccccg ggacctatca gtgcgtctcg 27720 ggaccctgcc atcacacctt ccacctgatc ccgaatacca cagcgtcgct ccccgmtact 27780 aacaaccaaa ctaacctcca ccaacgccac cgtcgcgacc tttctgaatc taatactacc 27840 acccacaccg gaggtgagct ccgaggtcaa ccaacctctg ggatttacta cggcccctgg 27900 gaggtggttg ggttaataac gctaggccta gttgcgggtg ggcttttggt tctctgctac 27960 ctatacctcc cttgctgttc gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga 28020 agatcaccct agtgagctgc ggtgcgctgg tggcggtgtt gctttcgatt gtgggactgg 28080 gcggtgcggc tgtantgaaa gagaaggccg atccctgctt gcatttcaat cccaacaaat 28140 gccagctgag ttttcagccc gatggcaatc ggtgtgcggt actgatcaag tgcggatggg 28200 aatgcgagaa cgtgagaatc gagtacaata acaagactcg gaacaatact ctcgcgtccg 28260 tgtggcagcc cggggacccc gagtggtaca ccgtctctgt ccccggtgct gacggctccc 28320 cgcgcaccgt gaataatact ttcatttttg cgcacatgtg cgacacggtc atgtggatga 28380 gcaagcagta cgatatgtgg ccccccacga aggagaacat cgtggtcttc tccatcgctt 28440 acagcctgtg cacggcgcta atcaccgcta tcgtgtgcct gagcattcac atgctcatcg 28500 ctattcgccc cagaaataat gccgaaaaag aaaaacagcc ataacgtttt ttttcacacc 28560 tttttcagac catggcctct gttaaatttt tgcttttatt tgccagtctc attgccgtca 28620 ttcatggaat gagtaatgag aaaattacta tttacactgg cactaatcac acattgaaag 28680 gtccagaaaa agccacagaa gtttcatggt attgttattt taatgaatca gatgtatcta 28740 ctgaactctg tggaaacaat aacaaaaaaa atgagagcat tactctcatc aagtttcaat 28800 gtggatctga cttaacccta attaacatca ctagagacta tgtaggtatg tattatggaa 28860 ctacagcagg catttcggac atggaatttt atcaagtttc tgtgtctgaa cccaccacgc 28920 ctagaatgac cacaaccaca aaaactacac ctgttaccac tatgcagctc actaccaata 28980 acatttttgc catgcgtcaa atggtcaaca atagcactca acccacccca cccagtgagg 29040 aaattcccaa atccatgatt ggcattattg ttgctgtagt ggtgtgcatg ttgatcatcg 29100 ccttgtgcat ggtgtactat gccttctgct acagaaagca cagactgaac gacaagctgg 29160 aacacttact aagtgttgaa ttttaatttt ttagaaccat gaagatccta ggccttttaa 29220 ttttttctat cattacctct gctctatgca attctgacaa tgaggacgtt actgtcgttg 29280 tcggatcaaa ttatacactg aaaggtccag cgaagggtat gctttcgtgg tattgctatt 29340 ttggatctga cactacagaa actgaattat gcnatcttaa gaatggcaaa attcaaaatt 29400 cttaaaatta acaattatat atgcaatggt actgatctaa tactcctcaa tatcacgaaa 29460 tcatatgstg gcagttacac ctgccctgga gatgatgctg acagtatgat tttttacaaa 29520 gtaactgttg ttgatcccat actccacctc cacccaccac aattactcac accacacaca 29580 cagatcaaac cgcagcagag gaggcagcaa agttagcctt gcaggtccaa gacagttcat 29640 ttgttggcat tacccctaca catgatcagc ggtgtccggg gctgctagtc agcggcattg 29700 tcggtgtgct ttcgggatta gcagtcataa tcatctgcat gttcattttt gcttgctgct 29760 atagaaggct ttaccgacaa aaatcagacc cactgctgaa cctctatgtt taattttttc 29820 cagagtcatg aaggcagtta gcgctctagt tttttgttct wtgattggca ttgttttttg 29880 caatcctatt cctaaagtta gctttattaa agatgtgaat gttactgagg ggggcaatgt 29940 gacactggta ggtgtagagg gtgctgaaaa caccacctgg acaaaatacc acctcaatgg 30000 gtggaaagat atttgcaatt ggagtgtatt agtttataca tgtgagggag ttaatcttac 30060 cattgtcaat gccacctcag ctcaaaatgg tagaattcaa ggacaaagtg tcagtgtatc 30120 taatgggtat tttacccaac atacttttat ctatgacgtt aaagtcatac cactgcwtac 30180 gcttagccca cttagcatta ccacacagac aacccacatt acacagacaa ccacatacag 30240 tacattaaat cagcbtacca ccactacagc agcagaggtt gccagctcgt ctggggtccg 30300 agtggcattt ttgatgtggg ccccatmtag cagtcccact gctagtacca atgagcagac 30360 tactgaattt ttgtccactg tcgagagcca caccacagct acctccagtg ccttctctag 30420 caccgccaat ctctcctcgc tttcctntac accaatcagt cccgytaata ctcctagccc 30480 cgtcctcttc ccactcccct gaagcaaaca gacggcggca tgcaatggca gatcaccctg 30540 ctcattgtga tcgggttggt catcctggcc gtgttgctct actacatctt ctgccgccgc 30600 attcccaacg cgcaccgcaa gccggtatac aagcccatca ttgtcgggca gccggagccg 30660 cttcaggtgg aagggggtct aaggaatctt ctcttctctt ttacagtatg gtgattgaac 30720 tatgattcct agacaattct tgatcactat tcttatctgc ctcctccaag tctgtgccac 30780 cctcgctctg gtggccaacg ccagtccaga ctgtattggg cccttcgcct cctacgtgct 30840 ctttgccttc accacctgca tctgctgctg tagcatagtc tgcctgctta tcaccttctt 30900 ccagttcatt gactggatct ttgtgcgcat cgcmtacctg cgccaccacc cccagtaccg 30960 cgaccagcga gtggcgcggc tgctcaggct cctctgataa gcatgcgggc tgtgntactt 31020 ctcgcgcttc tgctgttagt gctcccccgt cccgtcgacc cccggtcccc cacccagtcc 31080 cccgaggagg tccgcaaatg caaattccaa gaaccctgga aattcctcaa atgctaccgc 31140 caaaaatcag acatgcatcc cagctggatc atgatcattg ggatcgtgaa cattctggcc 31200 tgcaccctca tctcctttgt gatttacccc tgctttgact ttggttggaa ctcgccagag 31260 gcgctctatc tcccgcctga acctgacaca ccaccacagc aacctcaggc acacgcacta 31320 ccaccactac agcctaggcc acaatacatg cccatattag actatgaggc cgagccacag 31380 cgacccatgc tccccgctat tagttacttc aatctaaccg gcggagatga ctgacccact 31440 ggccaacaac aacgtcaacg accttctcct ggacatggac ggccgcgcct cggagcagcg 31500 actcgcccaa cttcgcattc gccagcagca ggagagagcc gtcaaggagc tgcaggatgc 31560 ggtggccatc caccagtgca agagaggcat cttctgcctg gtgaaacagg ccaagatctc 31620 ctacgaggtc actccaaacg accatcgcct ctcctacgag ctcctgcagc agcgccagaa 31680 gttcaccttc ctggtcggag tcaaccccat cgtcatcacc cagcagtctg gcgataccaa 31740 ggggtccatc cactgctcct gcgactcccc cgactgcgtc cacactctga tcaagaccct 31800 stgcggcctc cgcgacctcc tccccatgaa ctaatcaccc ccttatccag tgaaataaag 31860 atcatattga tgatgatttt acagaaataa aaaataatca tttgatttga aataaagata 31920 caatcatatt gatgatttga gtttaacaaa aaaataaaga atcacttact tgaaatctga 31980 taccaggtct ctgtccatgt tttctgccaa caccacttca ctcccctctt cccagctctg 32040 gtactgcagg ccccggcggg ctgcaaactt cctccacacg ctgaagggga tgtcaaattc 32100 ctcctgtccc tcaatcttca ttttatcttc tatcagatgt ccaaaaagcg cgtccgggtg 32160 gatgatgact tcgaccccgt ctacccctac gatgcagaca acgcaccgac cgtgcccttc 32220 atcaaccccc ccttcgtctc ttcagatgga ttccaagaga agcccctggg ggtgttgtcc 32280 ctgcgactgg ccgaccccgt caccaccaag aacggggaaa taaccctcaa gctgggagag 32340 ggggtggacc tcgattcctc gggaaaactc atctccaaca cggccaccaa ggccgccgcc 32400 cctctcagtt tttccaacaa caccatttcc cttaacatgg atcacccctt ttacactaaa 32460 gatggaaaat tatccttaca agtttctcca ccattaaata tactgagaac aagcattcta 32520 aacacactag ctttaggttt tggatcaggt ttaggactcc gtggctctgc cttggcagta 32580 cagttagtct ctccacttac atttgatact gatggaaaca taaagcttac cttagacaga 32640 ggtttgcatg ttacaacagg agatgcaatt gaaagcaaca taagctgggc taaaggttta 32700 aaatttgaag atggagccat agcaaccaac attggaaatg ggttagagtt tggaagcagt 32760 agtacagaaa caggtgttga tgatgcttac ccaatccaag ttaaacttgg atctggcctt 32820 agctttgaca gtacaggagc cataatggct ggtaacaaag aagacgataa actcactttg 32880 tggacaacac ctgatccatc accaaactgt caaatactcg cagaaaatga tgcaaaacta 32940 acactttgct tgactaaatg tggtagtcaa atactggcca ctgtgtcagt cttagttgta 33000 ggaagtggaa acctaaaccc cattactggc accgtaagca gtgctcaggt gtttctacgt 33060 tttgatgcaa acggtgttct tttaacagaa cattctacac taaaaaaata ctgggggtat 33120 aggcagggag atagcataga tggcactcca tataccaatg ctgtaggatt catgcccaat 33180 ttaaaagctt atccaaagtc acaaagttct actactaaaa ataatatagt agggcaagta 33240 tacatgaatg gagatgtttc aaaacctatg cttctcacta taaccctcaa tggtactgat 33300 gacagcaaca gtacatattc aatgtcattt tcatacacct ggactaatgg aagctatgtt 33360 ggagcaacat ttggggctaa ctcttatacc ttctcataca tcgcccaaga atgaacactg 33420 tatcccaccc tgcatgccaa cccttcccac cccactctgt ggaacaaact ctgaaacaca 33480 aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33540 tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagccttga 33600 acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33660 atggagccag tctcgggtcg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33720 cctcacaggt caacagctga ggattgtcct cggtggtcgg gatcacggtt atctggaaga 33780 agcagaagag cggcggtggg aatcatagtc cgcgaacggg atcggccggt ggtgtcgcat 33840 caggccccgc agcagtcgct gccgccgccg ctccgtcaag ctgctgctca gggggtccgg 33900 gtccagggac tccctcagca tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg 33960 ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acagaaccac 34020 caggttgttc aacagtccat agttcaacac gctCcagccg aaactcatcg cgggaaggat 34080 gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggtgcc ccctccagaa 34140 cacgctgccc acgtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34200 catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34260 cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34320 ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34380 tatgctcatg catctcttca gcactctcaa ctcctcgggg gtcaaaacca tatcccaggg 34440 cacggggaac tcttgcagga cagcgaaccc cgcagaacag ggcaatcctc gcacagaact 34500 tacattgtgc atggacaggg tatcgcaatc aggcagcacc gggtgatcct ccaccagaga 34560 agcgcgggtc tcgttctcct cacagcgtgg taagggggcc ggccgatacg ggtgatggcg 34620 ggacgcggct gatcgtgttc gcgaccgtgt catgatgcag ttgctttcgg acattttcgt 34680 acttgctgta gcagaacctg gtccgggcgC tgcacaccga tcgacggcgg cggtctcggc 34740 gcttggaacg ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34800 ctagggcctc aggagtgatg aagatcccat catgcctgat ggctctgatc acatcgacca 34860 ccgtggaatg ggccagaccc agccagatga tgcaattttg ttgggtttcg gtgacggcgg 34920 gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagtactt 34980 caaaatgaag atcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35040 ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggctttcagc aaagcctcca 35100 cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35160 tcatgttaca ctcstgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35220 gaactagttc gtgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35280 ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35340 cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctgagct cctccctcag 35400 caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35460 aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35520 aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35580 cagaaaatcg cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtggac 35640 gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35700 ttagctgatc tgtagaaaaa acaaaaatga acattaaacc atgctagcct ggcgaacagg 35760 tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35820 aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35880 gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35940 gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36000 atgaagcaca aaattctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36060 caaagccccc gatccctcca ggtacacata caaagcctca gcgtccatag cttaccgagc 36120 agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36180 ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36240 gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36300 cgcgcacttc ctcaaacgcc caaaactgcc gtcatttccg ggttcccacg ctacgtcatc 36360 aaaacacgac tttcaaattc cgtcgaccgt taaaaacgtc acccgccccg cccctaacgg 36420 tcgcccgtct ctcagccaat cagcgccccg catccccaaa ttcaaacacc tcatttgcat 36480 attaacgcgc acaaaaagtt tgaggtatat tattgatgat g 36521 <210> 34 <211> 314 <212> PRT
<213> Human adenovirus type 4 <400> 34 Asn Thr Cys Gln Trp Lys Asp Ser Asp Ser Lys Met His Thr Phe Gly Ala Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile Glu Ala Asp Gly Leu Pro Ile Arg Ile Asp Ser Thr Ser Gly Thr Asp Thr Val Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly Asn Asp Ser Trp Val Asp Thr Asn Gly Ala Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys Asp Thr Thr Lys Met Asn Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Leu Lys Asp Ser Glu Pro Ala Ala Thr Thr Pro Asn Tyr Asp Ile Asp Leu Ala Phe Phe Asp Ser Lys Thr Ile Val Ala Asn Tyr Asp Pro Asp Ile Val Met Tyr Thr Glu Asn Val Asp Leu Gln Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Glu Asp Thr Ser Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Val Gly Leu Thr Asp Thr Tyr Gln Gly Val Lys Val Lys Thr Asp Ala Gly Ser Glu Lys Trp Asp Lys Asp Asp Thr Thr Val Ser Asn Ala Asn Glu Ile His Val Gly Asn Pro Phe Ala Met <210> 35 <211> 318 <212> PRT
<213> Human adenovirus type 16 <400> 35 Asn Thr Cys Gln Trp Lys Asp Ser Asp Ser Lys Met His Thr Phe Gly Val Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile Glu Ala Asp Gly Leu Pro Ile Gly Ile Asp Ser Thr Ser Gly Thr Asp Thr Val Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly Asn Ala Ser Trp Val Asp Ala Asn Gly Thr Glu Glu Lys Tyr Gly Gly Arg Ala Leu Lys Asp Thr Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Leu Lys Asp Ser Glu Thr Ala Ala Thr Thr Pro Asn Tyr Asp Ile Asp Leu Ala Phe Phe Asp Asn Lys Asn Ile Ala Ala Asn Tyr Asp Pro Asp Ile Val Met Tyr Thr Glu Asn Val Asp Leu Gln Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Glu Asp Thr Ser Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Val Gly Phe Thr Asp Thr Tyr Gln Gly Val Lys Val Lys Thr Asp Ala Val Ala Gly Thr Ser Gly Thr Gln Trp Asp Lys Asp Asp Thr Thr Val Ser Thr Ala Asn Glu Ile His Gly Gly Asn Pro Phe Ala Met <210> 36 <211> 323 <212> PRT
<213> Human adenovirus type 3 <400> 36 Asn Thr Ser Gln Trp Ile Val Thr Thr Asn Gly Asp Asn Ala Val Thr Thr Thr Thr Asn Thr Phe Gly Ile Ala Ser Met Lys Gly Gly Asn Ile Thr Lys Glu Gly Leu Gln Ile Gly Lys Asp Ile Thr Thr Thr Glu Gly Glu Glu Lys Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Glu Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe Gly Gly Arg Ala Leu Lys Pro Ala Thr Asn Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg Pro Thr Asn Ile Lys Gly Gly Gln Ala Lys Asn Arg Lys Val Lys Pro Thr Thr Glu Gly Gly Val Glu Thr Glu Glu Pro Asp Ile Asp Met Glu Phe Phe Asp Gly Arg Asp Ala Val Ala Gly Ala Leu Ala Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Ser His Val Val Tyr Lys Pro Glu Thr Ser Asn Asn Ser His Ala Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asn Gly Ile Gly Pro Gly His Thr Tyr Gln Gly Ile Lys Lys Val Lys Thr Asp Asp Thr Asn Gly Trp Glu Lys Asp Ala Asn Val Ala Pro Ala Asn Glu Ile Thr Ile Gly Asn Asn Leu Ala Met <210> 37 <211> 315 <212> PRT
<213> Human adenovirus type 7 <400> 37 Asn Thr Ser Gln Trp Ile Val Thr Ala Gly Glu Glu Arg Ala Val Thr Thr Thr Thr Asn Thr Phe Gly Ile Ala Ser Met Lys Gly Asp Asn Ile Thr Lys Glu Gly Leu Glu Ile Gly Lys Asp Ile Thr Ala Asp Asn Lys Pro Ile Tyr Ala Asp Lys Thr Tyr Gln Pro Glu Pro Gln Val Gly Glu Glu Ser Trp Thr Asp Thr Asp Gly Thr Asn Glu Lys Phe Gly Gly Arg Ala Leu Lys Pro Ala Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Arg Pro Thr Asn Ile Lys Gly Gly Gln Ala Lys Asn Arg Lys Val Lys Pro Thr Glu Gly Asp Val Glu Thr Glu Glu Pro Asp Ile Asp Met Glu Phe Phe Asp Gly Arg Glu Ala Ala Asp Ala Phe Ser Pro Glu Ile Val Leu Tyr Thr Glu Asn Val Asn Leu Glu Thr Pro Asp Ser His Val Val Tyr Lys Pro Gly Thr Ser Asp Asp Asn Ser His Ala Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Val Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Giy Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ile Gly Pro Ala Lys Thr Tyr Gln Gly Ile Lys Ser Lys Asp Asn Giy Trp Glu Lys Asp Asp Asn Val Ser Lys Ser Asn Glu Ile Ala Ile Gly Asn Asn Gln Ala Met <210> 38 <211> 345 <212> PRT
<213> Human adenovirus type 2 <400> 38 Asn Ser Cys Glu Trp Glu Gln Thr Glu Asp Ser Gly Arg Ala Val Ala Glu Asp Glu Glu Glu Glu Asp Glu Asp Glu Glu Glu Glu Glu Glu Glu Gln Asn Ala Arg Asp Gln Ala Thr Lys Lys Thr His Val Tyr Ala Gln Ala Pro Leu Ser Gly Glu Thr Leu Thr Lys Ser Gly Leu Gln Ile Gly Ser Lys Asn Ala Glu Thr Gln Ala Lys Pro Val Tyr Ala Asp Pro Ser Tyr Gln Pro Glu Pro Gln Ile Gly Glu Ser Gln Trp Asn Giu Ala Asp Ala Asn Ala Ala Gly Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys Pro Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Pro Phe Gly Gly Gln Ser Val Leu Val Pro Asp Glu Lys Gly Val Pro Leu Pro Lys Val Asp Leu Gln Phe Phe Ser Asn Thr Thr Ser Leu Asn Asp Arg Gln Gly Asn Ala Thr Lys Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Met Glu Thr Pro Asp Thr His Leu Ser Tyr Lys Pro Gly Lys Gly Asp Glu Asn Ser Lys Ala Met Leu Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Ile Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Val Thr Asp Thr Tyr Gln Ala Ile Lys Ala Asn Gly Asn Gly Ser Gly Asp Asn Gly Asp Thr Thr Trp Thr Lys Asp Glu Thr Phe Ala Thr Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met <210> 39 <211> 183 <212> PRT
<213> human adenovirus protein <400> 39 Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg Ile His Ser Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gln Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser Met Thr Gly Thr Val Ala Ser Val Ser Ile Phe Leu Arg Phe Asp Gln Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gln Ser Gln Thr Ala Lys Asn Asn Ile Val Ser Gln Val Tyr Leu His Gly Asp Lys Thr Lys Pro Met Ile Leu Thr Ile Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 40 <211> 182 <212> PRT
<213> human adenovirus protein <400> 40 Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gln Ile Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro Ile Ser Gly Thr Val Gln Ser Ala His Leu Ile Ile Arg Phe Asp Glu Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala Lys Ser Asn Ile Val Ser Gln Val Tyr Leu Asn Gly Asp Lys Thr Lys Pro Val Thr Leu Thr Ile Thr Leu Asn Gly Thr Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr Ile Asn Glu Ile Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr Ile Ala Gln Glu <210> 41 <211> 338 <212> PRT
<213> human adenovirus protein <400> 41 Ala Pro Lys Gly Ala Pro Asn Pro Cys Glu Trp Asp Glu Ala Ala Thr Ala Leu Glu Ile Asn Leu Glu Glu Glu Asp Asp Asp Asn Glu Asp Glu Val Asp Glu Gln Ala Glu Gln Gln Lys Thr His Val Phe Gly Gln Ala Pro Tyr Ser Gly Ile Asn Ile Thr Lys Glu Gly Ile Gln Ile Gly Val Glu Gly Gln Thr Pro Lys Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Ile Gly Glu Ser Gin Trp Tyr Glu Thr Glu Ile Asn His Ala Ala Gly Arg Val Leu Lys Lys Thr Thr Pro Met Lys Pro Cys Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Asn Gly Gly Gln Gly Ile Leu Val Lys Gln Gln Asn Gly Lys Leu Glu Ser Gln Val Glu Met Gln Phe Phe Ser Thr Thr Glu Ala Thr Ala Gly Asn Gly Asp Asn Leu Thr Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asp Ile Glu Thr Pro Asp Thr His Ile Ser Tyr Met Pro Thr Ile Lys Glu Gly Asn Ser Arg Glu Leu Met Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Ala Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Ile Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Val Ile Asn Thr Glu Thr Leu Thr Lys Val Lys Pro Lys Thr Gly Gln Glu Asn Gly Trp Glu Lys Asp Ala Thr Glu Phe Ser Asp Lys Asn Glu Ile Arg Val Gly Asn Asn Phe Ala Met Glu Ile
Claims (17)
1. A recombinant adenovirus comprising an adenovirus capsid wherein the capsid comprises a hexon protein composed of one or more fragments of the C68 hexon protein of SEQ ID NO: 16 fused to a heterologous adenovirus hexon peptide, said capsid encapsidating a molecule for delivery to a target cell, wherein the molecule comprises an adenovirus 5' inverted terminal repeat sequence (ITRs), a minigene, and an adenovirus 3' ITR, wherein said one or more fragments of the C68 hexon protein is selected from the group consisting of:
(a) amino acids 125 to 443 of SEQ ID NO:16;
(b) amino acids 131 to 441 of SEQ ID NO: 16;
(c) amino acids 138 to 441 of SEQ ID NO:16;
(d) amino acids 138 to 163 of SEQ ID NO:16;
(e) amino acids 170 to 176 of SEQ ID NO: 16;
(f) amino acids 195 to 203 of SEQ ID NO: 16;
(g) amino acids 233 to 246 of SEQ ID NO: 16;
(h) amino acids 253 to 264 of SEQ ID NO:16;
(i) amino acids 287 to 297 of SEQ ID NO: 16; and (j) amino acids 404 to 430 of SEQ ID NO: 16.
(a) amino acids 125 to 443 of SEQ ID NO:16;
(b) amino acids 131 to 441 of SEQ ID NO: 16;
(c) amino acids 138 to 441 of SEQ ID NO:16;
(d) amino acids 138 to 163 of SEQ ID NO:16;
(e) amino acids 170 to 176 of SEQ ID NO: 16;
(f) amino acids 195 to 203 of SEQ ID NO: 16;
(g) amino acids 233 to 246 of SEQ ID NO: 16;
(h) amino acids 253 to 264 of SEQ ID NO:16;
(i) amino acids 287 to 297 of SEQ ID NO: 16; and (j) amino acids 404 to 430 of SEQ ID NO: 16.
2. The recombinant adenovirus according to claim 1, wherein the C68 fragment is a loop region of the hexon.
3. The recombinant adenovirus according to claim 2, wherein at least one loop region of the C68 hexon protein is substituted with a loop region from a heterologous adenovirus serotype.
4. The recombinant adenovirus according to any one of claims 1 to 3, wherein the capsid further comprises a fiber protein, a penton protein, or both a fiber protein and a penton protein from an adenovirus other than C68.
5. The recombinant adenovirus according to claim 4, wherein the penton protein comprises the amino acid sequence of SEQ ID NO: 12, or the fiber protein comprises the amino acid sequence of SEQ ID NO:27.
6. The recombinant adenovirus according to claim 4, wherein the fiber protein comprises amino acids 247 to 425 of SEQ ID NO:27.
7. The adenoviral capsid according to claim 4, wherein the penton and fiber are from a human adenovirus serotype.
8. The recombinant adenovirus according to any one of claims 1 to 7, wherein the adenovirus ITRs are from a serotype heterologous to C68.
9. The recombinant adenovirus according to any one of claims 1 to 8, wherein the one or more fragments of the C68 hexon protein SEQ ID NO:16 is fused to a heterologous hexon peptide via a linker.
10. The recombinant adenovirus according to any one of claims 1 to 9, wherein the adenovirus is functionally deleted in adenovirus E1a gene and adenovirus Elb gene.
11. A pharmaceutical composition comprising a physiologically acceptable carrier and a recombinant virus according to any one of claims I to 10.
12. Use of a recombinant adenovirus according to any one of claims I to 10 or a pharmaceutical composition according to claim 11 for delivering a molecule to a target cell.
13. Use according to claim 12, wherein said molecule is an immunogen.
14. Use according to claim 12, wherein said molecule is a therapeutic molecule.
15. Use of a recombinant adenovirus according to any one of claims 1 to 10 for the manufacture of a medicament for delivering a molecule to a target cell.
16. Use according to claim 15, wherein said molecule is an immunogen.
17. Use according to claim 15, wherein said molecule is a therapeutic molecule.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US30050101P | 2001-06-22 | 2001-06-22 | |
US60/300,501 | 2001-06-22 | ||
US38563202P | 2002-06-04 | 2002-06-04 | |
US60/385,632 | 2002-06-04 | ||
PCT/US2002/019735 WO2003000851A2 (en) | 2001-06-22 | 2002-06-20 | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2450470A1 CA2450470A1 (en) | 2003-01-03 |
CA2450470C true CA2450470C (en) | 2012-08-28 |
Family
ID=26971814
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2450470A Expired - Fee Related CA2450470C (en) | 2001-06-22 | 2002-06-20 | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins |
Country Status (8)
Country | Link |
---|---|
US (2) | US7344872B2 (en) |
EP (1) | EP1409748B1 (en) |
JP (1) | JP4399255B2 (en) |
AT (1) | ATE530672T1 (en) |
AU (1) | AU2002322285A1 (en) |
CA (1) | CA2450470C (en) |
ES (1) | ES2375557T3 (en) |
WO (1) | WO2003000851A2 (en) |
Families Citing this family (75)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3928858A1 (en) * | 1989-08-31 | 1991-03-07 | Beiersdorf Ag | NETWORKED HYDROGELES AND METHOD FOR THE PRODUCTION THEREOF |
US20040136963A1 (en) * | 2001-06-22 | 2004-07-15 | The Trustees Of The University Of Pennsylvania | Simian adenovirus vectors and methods of use |
CA2450470C (en) | 2001-06-22 | 2012-08-28 | The Trustees Of The University Of Pennsylvania | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins |
CA2466431C (en) | 2001-11-21 | 2014-08-05 | The Trustees Of The University Of Pennsylvania | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
US7291498B2 (en) | 2003-06-20 | 2007-11-06 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
US7491508B2 (en) | 2003-06-20 | 2009-02-17 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
AU2011247887B2 (en) * | 2004-01-23 | 2014-11-20 | Msd Italia S.R.L. | Chimpanzee adenovirus vaccine carriers |
LT2163260T (en) * | 2004-01-23 | 2017-06-26 | Msd Italia S.R.L. | Chimpanzee adenovirus vaccine carriers |
JP2008500364A (en) * | 2004-05-25 | 2008-01-10 | キメラコア, インコーポレイテッド | Self-assembling nanoparticle drug delivery system |
ATE492643T1 (en) * | 2004-10-13 | 2011-01-15 | Crucell Holland Bv | IMPROVED ADENOVIRUS VECTORS AND THEIR USE |
NZ581306A (en) | 2004-11-16 | 2011-03-31 | Crucell Holland Bv | Multivalent vaccines comprising recombinant viral vectors |
EP1893636A2 (en) | 2005-06-17 | 2008-03-05 | Istituto Di Ricerche Di Biologia Molecolare P. Angeletti S.P.A. | Hepatitis c virus nucleic acid vaccine |
AU2007348315A1 (en) * | 2006-04-07 | 2008-09-12 | Chimeros, Inc. | Compositions and methods for treating B- cell malignancies |
CA2683063A1 (en) * | 2007-04-09 | 2008-10-16 | Chimeros, Inc. | Self-assembling nanoparticle drug delivery system |
KR101761683B1 (en) | 2007-11-28 | 2017-07-26 | 더 트러스티스 오브 더 유니버시티 오브 펜실바니아 | Simian subfamily b adenoviruses sadv-28,27,-29,-32,-33, and -35 and uses thereof |
EP2463362B1 (en) * | 2007-11-28 | 2017-11-08 | The Trustees Of The University Of Pennsylvania | Simian subfamily c adenovirus SAdv-31 and uses thereof |
PL2220241T3 (en) * | 2007-11-28 | 2017-06-30 | The Trustees Of The University Of Pennsylvania | Adenovirus comprising a Simian E Adenovirus SAdV-39 capsid hexon protein and uses thereof |
EP2250255A2 (en) | 2008-03-04 | 2010-11-17 | The Trustees of the University of Pennsylvania | Simian adenoviruses sadv-36,-42.1, -42.2, and -44 and uses thereof |
US9217155B2 (en) | 2008-05-28 | 2015-12-22 | University Of Massachusetts | Isolation of novel AAV'S and uses thereof |
JP5809978B2 (en) * | 2008-10-31 | 2015-11-11 | ザ・トラステイーズ・オブ・ザ・ユニバーシテイ・オブ・ペンシルベニア | Simian adenovirus SAdV-43, -45, -46, -47, -48, -49 and -50 and their uses |
DK2391638T3 (en) * | 2009-02-02 | 2018-08-27 | Glaxosmithkline Biologicals Sa | Abeadenovirus nucleic acid and amino acid sequences, vectors containing them, and uses thereof. |
SG172935A1 (en) | 2009-02-02 | 2011-08-29 | Okairos Ag | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
WO2010120874A2 (en) | 2009-04-14 | 2010-10-21 | Chimeros, Inc. | Chimeric therapeutics, compositions, and methods for using same |
WO2010138263A2 (en) | 2009-05-28 | 2010-12-02 | University Of Massachusetts | Novel aav 's and uses thereof |
CA2762203A1 (en) | 2009-05-29 | 2010-12-02 | Soumitra Roy | Simian adenovirus 41 and uses thereof |
WO2011133890A1 (en) | 2010-04-23 | 2011-10-27 | University Of Massachusetts | Cns targeting aav vectors and methods of use thereof |
CA2833905C (en) | 2010-04-23 | 2019-09-10 | University Of Massachusetts | Multicistronic expression constructs |
JP2013533847A (en) | 2010-04-23 | 2013-08-29 | ユニバーシティ オブ マサチューセッツ | AAV-based treatment of cholesterol-related disorders |
CN103118702A (en) | 2010-09-20 | 2013-05-22 | 克鲁塞尔荷兰公司 | Therapeutic vaccination against active tuberculosis |
WO2012071318A2 (en) | 2010-11-23 | 2012-05-31 | The Trustees Of The University Of Pennsylvania | Subfamily e simian adenoviruses a1321, a1325, a1295, a1309, a1316 and a1322 and uses thereof |
JP2014527072A (en) | 2011-09-09 | 2014-10-09 | バイオメド リアルティー, エル.ピー. | Methods and compositions for controlling the assembly of viral proteins |
MX358019B (en) | 2012-05-18 | 2018-08-02 | Univ Pennsylvania | Subfamily e simian adenoviruses a1302, a1320, a1331 and a1337 and uses thereof. |
KR102089121B1 (en) | 2013-03-14 | 2020-03-13 | 더 솔크 인스티튜트 포 바이올로지칼 스터디즈 | Oncolytic adenovirus compositions |
BR112016008806A2 (en) | 2013-11-01 | 2017-10-03 | Pfizer | VECTORS FOR EXPRESSION OF PROSTATE-ASSOCIATED ANTIGENS |
WO2015191508A1 (en) | 2014-06-09 | 2015-12-17 | Voyager Therapeutics, Inc. | Chimeric capsids |
CN107073051B (en) | 2014-10-21 | 2021-08-24 | 马萨诸塞大学 | Recombinant AAV variants and uses thereof |
JP6401871B2 (en) | 2014-11-05 | 2018-10-10 | ボイジャー セラピューティクス インコーポレイテッドVoyager Therapeutics,Inc. | AADC polynucleotide for the treatment of Parkinson's disease |
SG11201703419UA (en) | 2014-11-14 | 2017-05-30 | Voyager Therapeutics Inc | Modulatory polynucleotides |
EP3218484A4 (en) | 2014-11-14 | 2018-05-30 | Voyager Therapeutics, Inc. | Compositions and methods of treating amyotrophic lateral sclerosis (als) |
EP3230441A4 (en) | 2014-12-12 | 2018-10-03 | Voyager Therapeutics, Inc. | Compositions and methods for the production of scaav |
US10983110B2 (en) | 2015-12-02 | 2021-04-20 | Voyager Therapeutics, Inc. | Assays for the detection of AAV neutralizing antibodies |
AU2017222568B2 (en) | 2016-02-23 | 2020-09-10 | Salk Institute For Biological Studies | High throughput assay for measuring adenovirus replication kinetics |
KR20220163505A (en) | 2016-02-23 | 2022-12-09 | 솔크 인스티튜트 포 바이올로지칼 스터디즈 | Exogenous gene expression in therapeutic adenovirus for minimal impact on viral kinetics |
WO2017189964A2 (en) | 2016-04-29 | 2017-11-02 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
EP3448874A4 (en) | 2016-04-29 | 2020-04-22 | Voyager Therapeutics, Inc. | Compositions for the treatment of disease |
EP3458588A4 (en) | 2016-05-18 | 2020-01-15 | Voyager Therapeutics, Inc. | Modulatory polynucleotides |
WO2017201258A1 (en) | 2016-05-18 | 2017-11-23 | Voyager Therapeutics, Inc. | Compositions and methods of treating huntington's disease |
WO2018035388A1 (en) | 2016-08-17 | 2018-02-22 | The Broad Institute, Inc. | Novel crispr enzymes and systems |
US20200283743A1 (en) | 2016-08-17 | 2020-09-10 | The Broad Institute, Inc. | Novel crispr enzymes and systems |
EP3831281A1 (en) | 2016-08-30 | 2021-06-09 | The Regents of The University of California | Methods for biomedical targeting and delivery and devices and systems for practicing the same |
AU2017341849B2 (en) | 2016-10-13 | 2024-03-21 | University Of Massachusetts | AAV capsid designs |
AU2017375633C1 (en) | 2016-12-12 | 2023-04-27 | Salk Institute For Biological Studies | Tumor-targeting synthetic adenoviruses and uses thereof |
US20200405639A1 (en) | 2017-04-14 | 2020-12-31 | The Broad Institute, Inc. | Novel delivery of large payloads |
JP2020518258A (en) | 2017-05-05 | 2020-06-25 | ボイジャー セラピューティクス インコーポレイテッドVoyager Therapeutics,Inc. | Amyotrophic lateral sclerosis (ALS) treatment composition and method |
WO2018204803A1 (en) | 2017-05-05 | 2018-11-08 | Voyager Therapeutics, Inc. | Compositions and methods of treating huntington's disease |
TW202333779A (en) | 2017-05-08 | 2023-09-01 | 美商磨石生物公司 | Alphavirus neoantigen vectors |
JOP20190269A1 (en) | 2017-06-15 | 2019-11-20 | Voyager Therapeutics Inc | Aadc polynucleotides for the treatment of parkinson's disease |
US11497576B2 (en) | 2017-07-17 | 2022-11-15 | Voyager Therapeutics, Inc. | Trajectory array guide system |
EP3808849A1 (en) | 2017-08-03 | 2021-04-21 | Voyager Therapeutics, Inc. | Compositions and methods for delivery of aav |
AU2018352236A1 (en) | 2017-10-16 | 2020-04-23 | The Curators Of The University Of Missouri | Treatment of amyotrophic lateral sclerosis (ALS) |
WO2019079242A1 (en) | 2017-10-16 | 2019-04-25 | Voyager Therapeutics, Inc. | Treatment of amyotrophic lateral sclerosis (als) |
EP3710039A4 (en) | 2017-11-13 | 2021-08-04 | The Broad Institute, Inc. | Methods and compositions for treating cancer by targeting the clec2d-klrb1 pathway |
US20220153871A1 (en) | 2018-01-04 | 2022-05-19 | Iconic Therapeutics, Inc. | Anti-Tissue Factor Antibodies, Antibody-Drug Conjugates, and Related Methods |
EP3807404A1 (en) | 2018-06-13 | 2021-04-21 | Voyager Therapeutics, Inc. | Engineered 5' untranslated regions (5' utr) for aav production |
EP3826719A1 (en) | 2018-07-24 | 2021-06-02 | Voyager Therapeutics, Inc. | Systems and methods for producing gene therapy formulations |
TW202035689A (en) | 2018-10-04 | 2020-10-01 | 美商航海家醫療公司 | Methods for measuring the titer and potency of viral vector particles |
SG11202103425YA (en) | 2018-10-05 | 2021-05-28 | Voyager Therapeutics Inc | Engineered nucleic acid constructs encoding aav production proteins |
WO2020081490A1 (en) | 2018-10-15 | 2020-04-23 | Voyager Therapeutics, Inc. | EXPRESSION VECTORS FOR LARGE-SCALE PRODUCTION OF rAAV IN THE BACULOVIRUS/Sf9 SYSTEM |
AU2019406778A1 (en) | 2018-12-17 | 2021-07-22 | Massachusetts Institute Of Technology | Crispr-associated transposase systems and methods of use thereof |
EP3917566A4 (en) | 2019-01-31 | 2022-10-26 | Oregon Health & Science University | Methods for using transcription-dependent directed evolution of aav capsids |
US20220220469A1 (en) | 2019-05-20 | 2022-07-14 | The Broad Institute, Inc. | Non-class i multi-component nucleic acid targeting systems |
BR122024002387A2 (en) | 2019-05-30 | 2024-03-12 | Gritstone Bio, Inc. | ADENOVIRUS VECTORS, PHARMACEUTICAL COMPOSITION, ISOLATED NUCLEOTIDE SEQUENCE, ISOLATED CELL, VECTOR, KIT, USES OF A VECTOR, METHOD FOR MAKING THE VECTOR, METHODS FOR PRODUCING A VIRUS AND VIRAL VECTOR |
AU2021320896A1 (en) | 2020-08-06 | 2023-03-23 | Gritstone Bio, Inc. | Multiepitope vaccine cassettes |
BR112023015303A2 (en) | 2021-02-01 | 2023-11-14 | Regenxbio Inc | METHOD TO TREAT CLN2 DISEASE DUE TO TPP1 DEFICIENCY IN A SUBJECT |
WO2023196818A1 (en) | 2022-04-04 | 2023-10-12 | The Regents Of The University Of California | Genetic complementation compositions and methods |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5770442A (en) | 1995-02-21 | 1998-06-23 | Cornell Research Foundation, Inc. | Chimeric adenoviral fiber protein and methods of using same |
WO1998010087A1 (en) | 1996-09-06 | 1998-03-12 | Trustees Of The University Of Pennsylvania | Chimpanzee adenovirus vectors |
US5922315A (en) | 1997-01-24 | 1999-07-13 | Genetic Therapy, Inc. | Adenoviruses having altered hexon proteins |
CA2321135A1 (en) | 1998-02-17 | 1999-08-19 | Uab Research Foundation | Modified adenovirus containing a fiber replacement protein |
US20030017138A1 (en) * | 1998-07-08 | 2003-01-23 | Menzo Havenga | Chimeric adenoviruses |
CA2450470C (en) | 2001-06-22 | 2012-08-28 | The Trustees Of The University Of Pennsylvania | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins |
US20030092161A1 (en) * | 2001-09-19 | 2003-05-15 | The Trustees Of The University Of Pennsylvania | Compositions and methods for production of recombinant viruses, and uses therefor |
CA2466431C (en) | 2001-11-21 | 2014-08-05 | The Trustees Of The University Of Pennsylvania | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
US7291498B2 (en) * | 2003-06-20 | 2007-11-06 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
US7491508B2 (en) * | 2003-06-20 | 2009-02-17 | The Trustees Of The University Of Pennsylvania | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses |
FR2860004A1 (en) | 2003-09-18 | 2005-03-25 | Roussy Inst Gustave | Adenoviral vector encoding modified capsid protein, useful in gene therapy of e.g. cancer and cystic fibrosis, can infect cells deficient in, or lacking, the common receptor for Coxsackie virus B3 and adenovirus |
DE602007004470D1 (en) * | 2006-04-28 | 2010-03-11 | Univ Pennsylvania | MODIFIED ADENOVIRUS HEXON PROTEIN AND APPLICATIONS THEREOF |
-
2002
- 2002-06-20 CA CA2450470A patent/CA2450470C/en not_active Expired - Fee Related
- 2002-06-20 US US10/477,527 patent/US7344872B2/en not_active Expired - Lifetime
- 2002-06-20 AU AU2002322285A patent/AU2002322285A1/en not_active Abandoned
- 2002-06-20 ES ES02756264T patent/ES2375557T3/en not_active Expired - Lifetime
- 2002-06-20 EP EP02756264A patent/EP1409748B1/en not_active Expired - Lifetime
- 2002-06-20 AT AT02756264T patent/ATE530672T1/en not_active IP Right Cessation
- 2002-06-20 JP JP2003507238A patent/JP4399255B2/en not_active Expired - Fee Related
- 2002-06-20 WO PCT/US2002/019735 patent/WO2003000851A2/en active Application Filing
-
2008
- 2008-01-25 US US12/011,377 patent/US7838277B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP1409748B1 (en) | 2011-10-26 |
WO2003000851A3 (en) | 2003-02-20 |
US20040171807A1 (en) | 2004-09-02 |
JP4399255B2 (en) | 2010-01-13 |
CA2450470A1 (en) | 2003-01-03 |
US20080219954A1 (en) | 2008-09-11 |
ES2375557T3 (en) | 2012-03-02 |
JP2004537300A (en) | 2004-12-16 |
WO2003000851A2 (en) | 2003-01-03 |
EP1409748A4 (en) | 2005-03-30 |
AU2002322285A1 (en) | 2003-01-08 |
US7344872B2 (en) | 2008-03-18 |
EP1409748A2 (en) | 2004-04-21 |
US7838277B2 (en) | 2010-11-23 |
ATE530672T1 (en) | 2011-11-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2450470C (en) | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins | |
AU2008331905B2 (en) | Simian subfamily B adenovirus SAdV-28 and uses thereof | |
US7491508B2 (en) | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses | |
US9133483B2 (en) | Simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use | |
US9617561B2 (en) | Simian adenovirus 41 and uses thereof | |
AU2008350937B2 (en) | Simian subfamily C adenovirus SAdV-31 and uses thereof | |
US7291498B2 (en) | Methods of generating chimeric adenoviruses and uses for such chimeric adenoviruses | |
US20170183636A1 (en) | Simian Adenoviruses SAdV-36, -42.1, -42.2, and -44 and Uses Thereof | |
AU2008331906B2 (en) | Simian E adenovirus SAdV-39 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20180620 |