CA2351110A1 - Helicobacter felis vaccine - Google Patents
Helicobacter felis vaccine Download PDFInfo
- Publication number
- CA2351110A1 CA2351110A1 CA002351110A CA2351110A CA2351110A1 CA 2351110 A1 CA2351110 A1 CA 2351110A1 CA 002351110 A CA002351110 A CA 002351110A CA 2351110 A CA2351110 A CA 2351110A CA 2351110 A1 CA2351110 A1 CA 2351110A1
- Authority
- CA
- Canada
- Prior art keywords
- gly
- lys
- thr
- val
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 229960005486 vaccine Drugs 0.000 title claims abstract description 53
- 241000590017 Helicobacter felis Species 0.000 title claims abstract description 34
- 229920001184 polypeptide Polymers 0.000 claims abstract description 109
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 109
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 109
- 239000012634 fragment Substances 0.000 claims abstract description 68
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 60
- 108020004414 DNA Proteins 0.000 claims abstract description 51
- 108010046334 Urease Proteins 0.000 claims abstract description 46
- 238000000034 method Methods 0.000 claims abstract description 26
- 238000001514 detection method Methods 0.000 claims abstract description 19
- 238000002405 diagnostic procedure Methods 0.000 claims abstract description 15
- 230000000890 antigenic effect Effects 0.000 claims abstract description 13
- 108020004511 Recombinant DNA Proteins 0.000 claims abstract description 8
- 239000000463 material Substances 0.000 claims abstract description 7
- 238000004519 manufacturing process Methods 0.000 claims abstract description 5
- 238000002360 preparation method Methods 0.000 claims abstract description 3
- 150000001413 amino acids Chemical class 0.000 claims description 30
- 230000002163 immunogen Effects 0.000 claims description 28
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 26
- 208000015181 infectious disease Diseases 0.000 claims description 19
- 241000700605 Viruses Species 0.000 claims description 17
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 16
- 230000028993 immune response Effects 0.000 claims description 15
- 230000001939 inductive effect Effects 0.000 claims description 13
- 238000012360 testing method Methods 0.000 claims description 9
- 239000000427 antigen Substances 0.000 claims description 7
- 102000036639 antigens Human genes 0.000 claims description 7
- 108091007433 antigens Proteins 0.000 claims description 7
- 244000005700 microbiome Species 0.000 claims description 7
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 6
- 102000053602 DNA Human genes 0.000 claims description 6
- 239000002671 adjuvant Substances 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims description 6
- 241000124008 Mammalia Species 0.000 claims description 5
- 230000002068 genetic effect Effects 0.000 claims description 5
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 241000606660 Bartonella Species 0.000 claims description 2
- 241000588779 Bordetella bronchiseptica Species 0.000 claims description 2
- 241000606161 Chlamydia Species 0.000 claims description 2
- 208000000655 Distemper Diseases 0.000 claims description 2
- 241000711475 Feline infectious peritonitis virus Species 0.000 claims description 2
- 241000701925 Feline parvovirus Species 0.000 claims description 2
- 241000282324 Felis Species 0.000 claims description 2
- 241000589929 Leptospira interrogans Species 0.000 claims description 2
- 230000007812 deficiency Effects 0.000 claims description 2
- 241000701161 unidentified adenovirus Species 0.000 claims description 2
- 230000001717 pathogenic effect Effects 0.000 claims 2
- 241000589969 Borreliella burgdorferi Species 0.000 claims 1
- 241000701931 Canine parvovirus Species 0.000 claims 1
- 239000000969 carrier Substances 0.000 abstract description 3
- ZZUFCTLCJUWOSV-UHFFFAOYSA-N furosemide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC(C(O)=O)=C1NCC1=CC=CO1 ZZUFCTLCJUWOSV-UHFFFAOYSA-N 0.000 description 64
- 229940054541 urex Drugs 0.000 description 64
- 241000589989 Helicobacter Species 0.000 description 53
- 108090000623 proteins and genes Proteins 0.000 description 40
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 31
- 239000004202 carbamide Substances 0.000 description 31
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 29
- 108010015792 glycyllysine Proteins 0.000 description 24
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 23
- 210000004027 cell Anatomy 0.000 description 22
- 108010054155 lysyllysine Proteins 0.000 description 21
- 108020004707 nucleic acids Proteins 0.000 description 20
- 102000039446 nucleic acids Human genes 0.000 description 20
- 108010092854 aspartyllysine Proteins 0.000 description 17
- 108010048818 seryl-histidine Proteins 0.000 description 17
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 16
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 16
- 108010047857 aspartylglycine Proteins 0.000 description 15
- 108010092114 histidylphenylalanine Proteins 0.000 description 15
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 14
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 14
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 14
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 14
- 108010049041 glutamylalanine Proteins 0.000 description 14
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 13
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 12
- 241001465754 Metazoa Species 0.000 description 12
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 12
- 108010013835 arginine glutamate Proteins 0.000 description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 12
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 11
- 108010047495 alanylglycine Proteins 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 11
- 108010036413 histidylglycine Proteins 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 10
- 241000894006 Bacteria Species 0.000 description 10
- 241000282326 Felis catus Species 0.000 description 10
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 10
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 10
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 10
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 10
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 108010003700 lysyl aspartic acid Proteins 0.000 description 10
- 108010009298 lysylglutamic acid Proteins 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- 238000002255 vaccination Methods 0.000 description 10
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 9
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 9
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 9
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 9
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 9
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 9
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 9
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 9
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 9
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 9
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 9
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 9
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 9
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 9
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- WEZDRVHTDXTVLT-GJZGRUSLSA-N 2-[[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WEZDRVHTDXTVLT-GJZGRUSLSA-N 0.000 description 8
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 8
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 8
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 8
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 8
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 8
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 8
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 8
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 8
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 8
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 8
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 8
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 8
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 8
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 8
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 8
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 8
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 8
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 8
- 108010062796 arginyllysine Proteins 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- 108010054813 diprotin B Proteins 0.000 description 8
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 8
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 7
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 7
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 7
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 7
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 7
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 7
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 7
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 7
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 7
- 241000282472 Canis lupus familiaris Species 0.000 description 7
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 7
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 7
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 7
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 7
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 7
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 7
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 7
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 7
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 7
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 7
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 7
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 7
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 7
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 7
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 7
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 7
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 7
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 7
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 7
- 108010047562 NGR peptide Proteins 0.000 description 7
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 7
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 7
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 7
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 7
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 7
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 7
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 7
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 7
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 7
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 7
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 7
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 7
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 7
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 7
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 7
- 108010041407 alanylaspartic acid Proteins 0.000 description 7
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 7
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 7
- 235000018102 proteins Nutrition 0.000 description 7
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 7
- 108010003137 tyrosyltyrosine Proteins 0.000 description 7
- 101150030895 ureAB gene Proteins 0.000 description 7
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 6
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 6
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 6
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 6
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 6
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 6
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 6
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 6
- SCOPAVYZWHPDBA-DCAQKATOSA-N Cys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N SCOPAVYZWHPDBA-DCAQKATOSA-N 0.000 description 6
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 6
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 6
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 6
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 6
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 6
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 6
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 6
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 6
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 6
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 6
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 6
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 6
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 6
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 6
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 6
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 6
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 6
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 6
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 6
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 6
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 6
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 6
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 6
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 6
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 6
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 6
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 6
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 6
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 6
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 6
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 6
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 6
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 6
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 6
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 6
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 6
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 6
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010038320 lysylphenylalanine Proteins 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 239000003981 vehicle Substances 0.000 description 6
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 5
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 5
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 5
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 5
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 5
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 5
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 5
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 5
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 5
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 5
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 5
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 5
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 5
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 5
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 5
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 5
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 5
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 5
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 5
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 5
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 5
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 5
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 5
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 5
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 5
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 5
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 5
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 5
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 5
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 5
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 5
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 5
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 5
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 5
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 5
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 5
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 5
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 5
- 108091029795 Intergenic region Proteins 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- 125000003412 L-alanyl group Chemical group [H]N([H])[C@@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 5
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 5
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 5
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 5
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 5
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 5
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 5
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 5
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 5
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 5
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 5
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 5
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 5
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 5
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 5
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 5
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 5
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 5
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 5
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 5
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 5
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 5
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 5
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 5
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 5
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 5
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 5
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 5
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 5
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 5
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 5
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 5
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 5
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 5
- 150000001875 compounds Chemical class 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 230000006698 induction Effects 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 4
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 4
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 4
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 4
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 4
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 4
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 4
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 4
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 4
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 4
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 4
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 4
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 4
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 4
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 4
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 4
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 4
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 4
- 238000002965 ELISA Methods 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 4
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 4
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 4
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 4
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 4
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 4
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 4
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 4
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 4
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 4
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 4
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 4
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 4
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 4
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 4
- 241001495142 Helicobacter heilmannii Species 0.000 description 4
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 4
- BILZDIPAKWZFSG-PYJNHQTQSA-N His-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BILZDIPAKWZFSG-PYJNHQTQSA-N 0.000 description 4
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 4
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 4
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 4
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 4
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 4
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 4
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 4
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 4
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 4
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 4
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 4
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 4
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 4
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 4
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 4
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 4
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 4
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 4
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 4
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 4
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 4
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 4
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 4
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 4
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 4
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 4
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 4
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 4
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 4
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 4
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 4
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 4
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 4
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 4
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 4
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 4
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 4
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 4
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 4
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 4
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 4
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 3
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 3
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 3
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 3
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 3
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 3
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 3
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 3
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 3
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 3
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 3
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 3
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 3
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 3
- OWAFTBLVZNSIFO-SRVKXCTJSA-N Cys-His-His Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OWAFTBLVZNSIFO-SRVKXCTJSA-N 0.000 description 3
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 3
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 3
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 3
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 3
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 3
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 3
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 3
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 3
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 3
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 3
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 3
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 3
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 3
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 3
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 3
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 3
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 3
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 3
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 3
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 3
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 3
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- 125000000570 L-alpha-aspartyl group Chemical group [H]OC(=O)C([H])([H])[C@]([H])(N([H])[H])C(*)=O 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 3
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 3
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 3
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 3
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 3
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 3
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 3
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 3
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 3
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 3
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 3
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 3
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 3
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 3
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 3
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 3
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 3
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 3
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 3
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 3
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 3
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 3
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 3
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 3
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 3
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 3
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 3
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 3
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 3
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 3
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 3
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 3
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 3
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 3
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 3
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 3
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 3
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 244000045947 parasite Species 0.000 description 3
- 238000011533 pre-incubation Methods 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 210000002784 stomach Anatomy 0.000 description 3
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 2
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000557057 Helicobacter bizzozeronii Species 0.000 description 2
- 241000590002 Helicobacter pylori Species 0.000 description 2
- 241000557050 Helicobacter salomonis Species 0.000 description 2
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 2
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 2
- AYIZHKDZYOSOGY-IUCAKERBSA-N His-Met Chemical compound CSCC[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 AYIZHKDZYOSOGY-IUCAKERBSA-N 0.000 description 2
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 2
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- 125000003440 L-leucyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C(C([H])([H])[H])([H])C([H])([H])[H] 0.000 description 2
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 2
- 125000000769 L-threonyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](O[H])(C([H])([H])[H])[H] 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108091092724 Noncoding DNA Proteins 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- 241000276498 Pollachius virens Species 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 2
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 2
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 2
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 2
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 2
- 101710136524 X polypeptide Proteins 0.000 description 2
- 101150044453 Y gene Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- PNEYBMLMFCGWSK-UHFFFAOYSA-N aluminium oxide Inorganic materials [O-2].[O-2].[O-2].[Al+3].[Al+3] PNEYBMLMFCGWSK-UHFFFAOYSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 230000002496 gastric effect Effects 0.000 description 2
- 210000001035 gastrointestinal tract Anatomy 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 229940037467 helicobacter pylori Drugs 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000007857 nested PCR Methods 0.000 description 2
- 230000008506 pathogenesis Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 239000005995 Aluminium silicate Substances 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 108010041986 DNA Vaccines Proteins 0.000 description 1
- 229940021995 DNA vaccine Drugs 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000448280 Elates Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 206010056663 Gastric infection Diseases 0.000 description 1
- 208000007882 Gastritis Diseases 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 1
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- JMEWFDUAFKVAAT-WDSKDSINSA-N Met-Asn Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O JMEWFDUAFKVAAT-WDSKDSINSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 229910017974 NH40H Inorganic materials 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 208000008469 Peptic Ulcer Diseases 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 101710182846 Polyhedrin Proteins 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- -1 SerIGly Chemical group 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- JAQGKXUEKGKTKX-HOTGVXAUSA-N Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 229910000318 alkali metal phosphate Inorganic materials 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 229960005363 aluminium oxide Drugs 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 229940001007 aluminium phosphate Drugs 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 235000012211 aluminium silicate Nutrition 0.000 description 1
- 229940024545 aluminum hydroxide Drugs 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 239000001166 ammonium sulphate Substances 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000000440 bentonite Substances 0.000 description 1
- 229910000278 bentonite Inorganic materials 0.000 description 1
- SVPXDRXYRYOSEX-UHFFFAOYSA-N bentoquatam Chemical compound O.O=[Si]=O.O=[Al]O[Al]=O SVPXDRXYRYOSEX-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000001804 emulsifying effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 210000004051 gastric juice Anatomy 0.000 description 1
- 201000011587 gastric lymphoma Diseases 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000009851 immunogenic response Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000009987 spinning Methods 0.000 description 1
- 230000003019 stabilising effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 229940031626 subunit vaccine Drugs 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940124856 vaccine component Drugs 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000012130 whole-cell lysate Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
- C12N9/80—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5) acting on amide bonds in linear amides (3.5.1)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
- A61P1/04—Drugs for disorders of the alimentary tract or the digestive system for ulcers, gastritis or reflux esophagitis, e.g. antacids, inhibitors of acid secretion, mucosal protectants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
- A61P9/10—Drugs for disorders of the cardiovascular system for treating ischaemic or atherosclerotic diseases, e.g. antianginal drugs, coronary vasodilators, drugs for myocardial infarction, retinopathy, cerebrovascula insufficiency, renal arteriosclerosis
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/205—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Campylobacter (G)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
Abstract
The present invention relates to novel Helicobacter felis urease subunit polypeptides and to nucleic acid sequences encoding these subunit polypeptides, to DNA
fragments and recombinant DNA molecules comprising the nucleic acid sequences encoding these subunit polypeptides, to live recombinant carriers and to host cells comprising nucleic acid sequences encoding these subunit polypeptides. Also, the invention relates to the subunit polypeptides for use in vaccines and the use in the manufacturing thereof, to vaccines comprising said subunit polypeptides and to methods for the preparation of such vaccines. Furthermore, the invention relates to diagnostic methods for the detection of Helicobacter felis specific nucleic acid sequences, Helicobacter felis antigenic material and to antibodies against Helicobacter felis.
fragments and recombinant DNA molecules comprising the nucleic acid sequences encoding these subunit polypeptides, to live recombinant carriers and to host cells comprising nucleic acid sequences encoding these subunit polypeptides. Also, the invention relates to the subunit polypeptides for use in vaccines and the use in the manufacturing thereof, to vaccines comprising said subunit polypeptides and to methods for the preparation of such vaccines. Furthermore, the invention relates to diagnostic methods for the detection of Helicobacter felis specific nucleic acid sequences, Helicobacter felis antigenic material and to antibodies against Helicobacter felis.
Description
Helicobacter fells vaccine.
The present invention relates to novel Helicobacter urease subunit polypeptides, nucleic acid sequences encoding these polypeptides, to the polypeptides for use in vaccines and to the use in the manufacturing thereof, to vaccines comprising said polypeptides and to methods for the preparation of such vaccines. Further, the invention relates to diagnostic methods for the detection of the nucleic acid sequences, the polypeptides and antibodies against the polypeptides.
Several Helicobacter species are the cause of pathogenesis of the gastric epithelium.
Helicobacter pylori, and to a lesser extent H. heilmannii are known to cause gastritis, a major factor in the development of peptic ulcers and gastric lymphoma in humans.
Helicobacter fells is most likely the cause of gastric infections in both cats and dogs.
In order to survive the highly acidic environment of the stomach, members of the Helicobacterfamily produce an urease that is capable of hydrolysing the urea present in gastric juice. This hydrolysation sets free an amount of NH40H that suffices to neutralise the environment of the bacterium. It is known, that the urease plays a role in the colonisation of the bacterium as well as in its pathogenesis.
Genes encoding urease have been described and sequenced for both Helicobacter pylori (Labigne et al., J. Bacteriol. 173: 1920-1931 (1991 )) and Helicobacter fells (Ferrero et al., Molec. Microbiol. 9, 323-333 (1993)). Of the seven genes involved in urease expression and secretion, only two genes encode the two structural subunits urease A en B of the urease enzyme; ureA and urea. These two polypeptides form a polypeptide complex having urease activity.
Vaccines against infections caused by both H. pylori and fells have been made and have been the subject of i.a. International Patent Applications WO 94/09823 and WO
96/34624. Several attempt have been made, to use H. pylori urease as a vaccine component for the protection of cats against H. fells infection. Although indeed a certain level of protection can be obtained, the results are far from the 100 %
protection that would be desirable. From animal experiments published so far it becomes clear that a significant number of animals vaccinated with H. pylori is not at all protected against subsequent challenge with H. fells. Protection of cats vaccinated with purified urease from either H. fells or pylori has not been described. Vaccinating cats with H. fells whole cell lysates might theoretically be feasible but is not a practical option.
This is because in spite of many attempts for improvement, H. fells is difficult to grow.
There clearly is a need for an efficacious vaccine, based upon homologous components, and it is clear that the known H. fells urease does not confer full protection.
It is i.a. an object of the present invention to provide a H. fells urease which is able to induce protection against Helicobacter fells infection in dogs and cats. It was surprisingly found now, that in H. fells a second urease exists, of which the genes encoding the structural subunits share only low homology with the known H. fells ureA and B
genes.
The novel urease is named ureaseXY, in order to discriminate it from the known urease AB. The newly found urease has been discovered in H. fells, and is not present in H.
pylori.
The overall genetic structure of the genes encoding the two structural urease subunits, UreX and UreY is comparable to that of the known UreA and B in H. fells and H.
pylori.
The sequence homology is however surprisingly low. It was even more surprisingly found, that the homology between the ureA and B genes and the novel ureX and Y
genes in one single H. fells strain is even strikingly lower than the homology between the various ureA and B genes from the various Helicobacter species.
The present invention relates to novel Helicobacter urease subunit polypeptides, nucleic acid sequences encoding these polypeptides, to the polypeptides for use in vaccines and to the use in the manufacturing thereof, to vaccines comprising said polypeptides and to methods for the preparation of such vaccines. Further, the invention relates to diagnostic methods for the detection of the nucleic acid sequences, the polypeptides and antibodies against the polypeptides.
Several Helicobacter species are the cause of pathogenesis of the gastric epithelium.
Helicobacter pylori, and to a lesser extent H. heilmannii are known to cause gastritis, a major factor in the development of peptic ulcers and gastric lymphoma in humans.
Helicobacter fells is most likely the cause of gastric infections in both cats and dogs.
In order to survive the highly acidic environment of the stomach, members of the Helicobacterfamily produce an urease that is capable of hydrolysing the urea present in gastric juice. This hydrolysation sets free an amount of NH40H that suffices to neutralise the environment of the bacterium. It is known, that the urease plays a role in the colonisation of the bacterium as well as in its pathogenesis.
Genes encoding urease have been described and sequenced for both Helicobacter pylori (Labigne et al., J. Bacteriol. 173: 1920-1931 (1991 )) and Helicobacter fells (Ferrero et al., Molec. Microbiol. 9, 323-333 (1993)). Of the seven genes involved in urease expression and secretion, only two genes encode the two structural subunits urease A en B of the urease enzyme; ureA and urea. These two polypeptides form a polypeptide complex having urease activity.
Vaccines against infections caused by both H. pylori and fells have been made and have been the subject of i.a. International Patent Applications WO 94/09823 and WO
96/34624. Several attempt have been made, to use H. pylori urease as a vaccine component for the protection of cats against H. fells infection. Although indeed a certain level of protection can be obtained, the results are far from the 100 %
protection that would be desirable. From animal experiments published so far it becomes clear that a significant number of animals vaccinated with H. pylori is not at all protected against subsequent challenge with H. fells. Protection of cats vaccinated with purified urease from either H. fells or pylori has not been described. Vaccinating cats with H. fells whole cell lysates might theoretically be feasible but is not a practical option.
This is because in spite of many attempts for improvement, H. fells is difficult to grow.
There clearly is a need for an efficacious vaccine, based upon homologous components, and it is clear that the known H. fells urease does not confer full protection.
It is i.a. an object of the present invention to provide a H. fells urease which is able to induce protection against Helicobacter fells infection in dogs and cats. It was surprisingly found now, that in H. fells a second urease exists, of which the genes encoding the structural subunits share only low homology with the known H. fells ureA and B
genes.
The novel urease is named ureaseXY, in order to discriminate it from the known urease AB. The newly found urease has been discovered in H. fells, and is not present in H.
pylori.
The overall genetic structure of the genes encoding the two structural urease subunits, UreX and UreY is comparable to that of the known UreA and B in H. fells and H.
pylori.
The sequence homology is however surprisingly low. It was even more surprisingly found, that the homology between the ureA and B genes and the novel ureX and Y
genes in one single H. fells strain is even strikingly lower than the homology between the various ureA and B genes from the various Helicobacter species.
Table 1 a, 1 b and 1 c show the comparison of the ureX and Y gene and the polypeptides they encode form five different Helicobacter fells species, with the ureA and B genes and polypeptides from Helicobacfer fells, pylori and heilmannii.
The level of homology of the genes encoding the novel structural urease subunits X and Y and the polypeptides they encode as compared to that of known ureA and B
genes and polypeptide subunits is presented in table 1 a, b and c.
Reference molecule : H. fellsa.a. n.a.
ureX CS1 H. fells ureA 50 % 57 _ 52 % 60 H. pylori ureA
H. heilmannii ureA 54 % 62 H. fells strain Kukka ureX 100 % 91 H. fells strain Ds4 ureX 99 % 91 H. fells strain 2301 ureX 99 % 91 H. fells strain 390 ureX 99 % 91 Table 1a: amino acid and nucleic acid homology between the H. fells ureX and various ureA subunits.
Reference molecule : H. fellsa.a. n.a.
ureY CS1 H. fells urea _ 73 % 71 H. pylori urea 73 % 70 H. heilmannii urea 74 % 71 H. fells strain Kukka ureY 99 % 95 H. fells strain Ds4 ureY 98 % 94 H. fells strain 2301 ureY 99 % 95 Table 1 b: amino acid and nucleic acid homology between the H. fells ureY and various urea subunits.
Reference molecule: H. fells n.a. _ ureXY CSI
H. fe 67 lis ureAB _ _ 67 H. pylori ureAB
H. heilmannii ureAB 68 H. fells strain Kukka ureXY 94 H. fells strain Ds4 ureXY 94 H. fells strain 2301 ureXY 94 Table 1 c: nucleic acid homology between H, fells ureXY and various ureAB genes.
The level of homology of the genes encoding the novel structural urease subunits X and Y and the polypeptides they encode as compared to that of known ureA and B
genes and polypeptide subunits is presented in table 1 a, b and c.
Reference molecule : H. fellsa.a. n.a.
ureX CS1 H. fells ureA 50 % 57 _ 52 % 60 H. pylori ureA
H. heilmannii ureA 54 % 62 H. fells strain Kukka ureX 100 % 91 H. fells strain Ds4 ureX 99 % 91 H. fells strain 2301 ureX 99 % 91 H. fells strain 390 ureX 99 % 91 Table 1a: amino acid and nucleic acid homology between the H. fells ureX and various ureA subunits.
Reference molecule : H. fellsa.a. n.a.
ureY CS1 H. fells urea _ 73 % 71 H. pylori urea 73 % 70 H. heilmannii urea 74 % 71 H. fells strain Kukka ureY 99 % 95 H. fells strain Ds4 ureY 98 % 94 H. fells strain 2301 ureY 99 % 95 Table 1 b: amino acid and nucleic acid homology between the H. fells ureY and various urea subunits.
Reference molecule: H. fells n.a. _ ureXY CSI
H. fe 67 lis ureAB _ _ 67 H. pylori ureAB
H. heilmannii ureAB 68 H. fells strain Kukka ureXY 94 H. fells strain Ds4 ureXY 94 H. fells strain 2301 ureXY 94 Table 1 c: nucleic acid homology between H, fells ureXY and various ureAB genes.
One embodiment of the invention thus relates to nucleic acid sequences encoding the novel urease X and Y subunits.
First of all, this embodiment of the invention relates to nucleic acid sequences encoding two subunits of a urease complex such as expressed by Helicobacter fells, that have at least 85 % homology with SEQ ID NO: 1, or parts thereof with a length of at least 40, preferably 45, more preferably 50 nucleotides encoding at least an immunogenic fragment of one of the subunits. Still even longer fragments, with a length of at least 55, 60 or 70 nucleic acids are in that order even more preferred.
A preferred form of this embodiment relates to nucleic acid sequences that encode the urease X subunit polypeptide or the urease Y subunit polypeptide and that have at least 85 % homology with SEQ ID NO: 1, or parts thereof with a length of at least 40, preferably 45, more preferably 50 nucleotides encoding at least an immunogenic fragment of the urease X subunit polypeptide or the urease Y subunit polypeptide.
Merely as an example: the nucleic acid sequence encoding the unease X subunit of Helicobacter fells strain CS1 starts at position 206/207/208 (GTG) (See figure 1 a (1 )) and stops at position 884/885/886 (TAA). the nucleic acid sequence encoding the unease Y subunit of Helicobacter fells strain CS1 starts at position 897!898/899 (ATG) and stops at position 2601/260212603 (TAG).
Still even longer fragments, with a length of at least 55, 60 or 70 nucleic acids are in that order even more preferred.
A more preferred form of this embodiment relates to nucleic acid sequences having at least 90 %, preferably 94 %, more preferably 97 % homology with SEQ ID NO: 1.
The determination of the homology percentages was done with the computer program Align Plus for Windows, available from Scientific and Educational Software, P.O.Box 72045 Durham, NC 27722-2045, USA. Settings used for the nucleic acid comparisons are indicated in figures 1 a, 1 b and 1 c.
Since the present invention discloses nucleic acid sequences encoding novel structural Helicobacter fells unease subunits, it is now for the first time possible to obtain such polypeptides in sufFcient quantities. This can e.g. be done by using expression systems to express the genes encoding the UreX and UreY subunits.
Therefore, in a more preferred embodiment, the invention relates to DNA
fragments comprising a nucleic acid sequence according to the invention. Such DNA
fragments can e.g. be plasmids, into which a nucleic acid sequence according to the invention is cloned. Such DNA fragments are useful e.g. for enhancing the amount of DNA for use as a probe, as described below.
An essential requirement for the expression of the nucleic acid sequence is an adequate promoter operably linked to the nucleic acid sequence. It is obvious to those skilled in the art that the choice of a promoter extends to any eukaryotic, prokaryotic or viral promoter capable of directing gene transcription in cells used as host cells for protein expression.
Therefore, an even more preferred form of this embodiment relates to a recombinant DNA molecule comprising a DNA fragment or a nucleic acid sequence according to the invention that is placed under the control of a functionally linked promotor.
This can be obtained by means of e.g. standard molecular biology techniques.
(Maniatis/Sambrook (Sambrook, J. Molecular cloning: a laboratory manual, 1989. ISBN 0-87969-309-6).
First of all, this embodiment of the invention relates to nucleic acid sequences encoding two subunits of a urease complex such as expressed by Helicobacter fells, that have at least 85 % homology with SEQ ID NO: 1, or parts thereof with a length of at least 40, preferably 45, more preferably 50 nucleotides encoding at least an immunogenic fragment of one of the subunits. Still even longer fragments, with a length of at least 55, 60 or 70 nucleic acids are in that order even more preferred.
A preferred form of this embodiment relates to nucleic acid sequences that encode the urease X subunit polypeptide or the urease Y subunit polypeptide and that have at least 85 % homology with SEQ ID NO: 1, or parts thereof with a length of at least 40, preferably 45, more preferably 50 nucleotides encoding at least an immunogenic fragment of the urease X subunit polypeptide or the urease Y subunit polypeptide.
Merely as an example: the nucleic acid sequence encoding the unease X subunit of Helicobacter fells strain CS1 starts at position 206/207/208 (GTG) (See figure 1 a (1 )) and stops at position 884/885/886 (TAA). the nucleic acid sequence encoding the unease Y subunit of Helicobacter fells strain CS1 starts at position 897!898/899 (ATG) and stops at position 2601/260212603 (TAG).
Still even longer fragments, with a length of at least 55, 60 or 70 nucleic acids are in that order even more preferred.
A more preferred form of this embodiment relates to nucleic acid sequences having at least 90 %, preferably 94 %, more preferably 97 % homology with SEQ ID NO: 1.
The determination of the homology percentages was done with the computer program Align Plus for Windows, available from Scientific and Educational Software, P.O.Box 72045 Durham, NC 27722-2045, USA. Settings used for the nucleic acid comparisons are indicated in figures 1 a, 1 b and 1 c.
Since the present invention discloses nucleic acid sequences encoding novel structural Helicobacter fells unease subunits, it is now for the first time possible to obtain such polypeptides in sufFcient quantities. This can e.g. be done by using expression systems to express the genes encoding the UreX and UreY subunits.
Therefore, in a more preferred embodiment, the invention relates to DNA
fragments comprising a nucleic acid sequence according to the invention. Such DNA
fragments can e.g. be plasmids, into which a nucleic acid sequence according to the invention is cloned. Such DNA fragments are useful e.g. for enhancing the amount of DNA for use as a probe, as described below.
An essential requirement for the expression of the nucleic acid sequence is an adequate promoter operably linked to the nucleic acid sequence. It is obvious to those skilled in the art that the choice of a promoter extends to any eukaryotic, prokaryotic or viral promoter capable of directing gene transcription in cells used as host cells for protein expression.
Therefore, an even more preferred form of this embodiment relates to a recombinant DNA molecule comprising a DNA fragment or a nucleic acid sequence according to the invention that is placed under the control of a functionally linked promotor.
This can be obtained by means of e.g. standard molecular biology techniques.
(Maniatis/Sambrook (Sambrook, J. Molecular cloning: a laboratory manual, 1989. ISBN 0-87969-309-6).
Functionally linked promotors are promotors that are capable of controlling the transcription of the nucleic acid sequences to which they are linked.
When the host cells are bacteria, useful expression control sequences which may be used include the Trp promoter and operator (Goeddel, et al., Nucl. Acids Res., 8, 4057, 1980); the lac promoter and operator (Chang, et al., Nature, 275, 615, 1978);
the outer membrane protein promoter (Nakamura, K. and Inouge, M., EMBO J., 1, 771-775, 1982); the bacteriophage lambda promoters and operators (Remaut, E. et al., Nucl.
Acids Res., 11, 4677-4688, 1983); the a-amylase (B. subtilis) promoter and operator, termination sequences and other expression enhancement and control sequences compatible with the seled'ted host cell.
When the host cell is yeast, useful expression control sequences include, e.g., a-mating factor. For insect cells the polyhedrin or p10 promoters of baculoviruses can be used (Smith, G.E. et al., Mol. Cell. Biol. 3, 2156-65, 1983). When the host cell is of mammalian origin illustrative useful expression control sequences include the promoter (Berman, P.W. et al., Science, 222, 524-527, 1983) or the metallothionein promoter (Brinster, R.L., Nature, 296, 39-42, 1982) or a heat shock promoter (Voellmy et al., Proc. Natl. Acad. Sci. USA, 82, 4949-53, 1985).
Bacterial, yeast, fungal, insect and mammalian cell expression systems are very frequently used systems. Such systems are well-known in the art and generally available, e.g. commercially through Clontech Laboratories, Inc. 4030 Fabian Way, Palo Alto, California 94303-4607, USA. Next to these expression systems, parasite-based expression systems are very attractive expression systems. Such systems are e.g.
described in the French Patent Application with Publication number 2 714 074, and in US NTIS Publication No US 08/043109 (Hoffman, S. and Rogers, W.: Public. Date December 1993).
Thus a still even more preferred form of this embodiment of the invention relates to Live Recombinant Carrier micro-organisms (LRCs) comprising a gene encoding the UreX
or UreY polypeptide or an immunogenic fragment thereof according to the invention. Such micro-organisms are e.g. bacteria and viruses. These LRC micro-organisms are micro-organisms in which additional genetic information, in this case a gene encoding the UreX
or UreY polypeptide or an immunogenic fragment thereof according to the invention has been cloned. Animals infected with such LRCs will produce an immunogenic response not only against the immunogens of the vector, but also against the immunogenic parts of the polypeptide(s) for which the genetic code is additionally cloned into the LRC, e.g.
the ureX or Y gene.
As an example of bacterial LRCs, attenuated Salmonella strains known in the art can attractively be used.
Live recombinant carrier parasites have i.a. been described by Vermeulen, A.
N. (Int.
Journ. Parasitol. 28: 1121-1130 (1998)) Also, LRC viruses may be used as a way of transporting the nucleic acid sequence into a target cell. Live recombinant carrier viruses are also called vector viruses. The site of integration of the gene encoding a UreX or Y polypeptide may be a site in a viral gene that is not essential to the virus, or a site in an intergenic region. Viruses often used as vectors are Vaccinia viruses (Panicali et al; Proc. Natl. Acad. Sci. USA, 79:
4927 (1982), Herpesviruses (E.P.A. 0473210A2), and Retroviruses (Valerio, D. et al; in Baum, S.J., Dicke, K.A., Lotzova, E. and Pluznik, D.H. (Eds.), Experimental Haematology today -1988. Springer Verlag, New York: pp. 92-99 (1989)).
The technique of in vivo homologous recombination, well-known in the art, can be used to introduce a recombinant nucleic acid sequence into the genome of a bacterium, parasite or virus of choice, capable of inducing expression of the inserted nucleic acid sequence according to the invention in the host animal.
When the host cells are bacteria, useful expression control sequences which may be used include the Trp promoter and operator (Goeddel, et al., Nucl. Acids Res., 8, 4057, 1980); the lac promoter and operator (Chang, et al., Nature, 275, 615, 1978);
the outer membrane protein promoter (Nakamura, K. and Inouge, M., EMBO J., 1, 771-775, 1982); the bacteriophage lambda promoters and operators (Remaut, E. et al., Nucl.
Acids Res., 11, 4677-4688, 1983); the a-amylase (B. subtilis) promoter and operator, termination sequences and other expression enhancement and control sequences compatible with the seled'ted host cell.
When the host cell is yeast, useful expression control sequences include, e.g., a-mating factor. For insect cells the polyhedrin or p10 promoters of baculoviruses can be used (Smith, G.E. et al., Mol. Cell. Biol. 3, 2156-65, 1983). When the host cell is of mammalian origin illustrative useful expression control sequences include the promoter (Berman, P.W. et al., Science, 222, 524-527, 1983) or the metallothionein promoter (Brinster, R.L., Nature, 296, 39-42, 1982) or a heat shock promoter (Voellmy et al., Proc. Natl. Acad. Sci. USA, 82, 4949-53, 1985).
Bacterial, yeast, fungal, insect and mammalian cell expression systems are very frequently used systems. Such systems are well-known in the art and generally available, e.g. commercially through Clontech Laboratories, Inc. 4030 Fabian Way, Palo Alto, California 94303-4607, USA. Next to these expression systems, parasite-based expression systems are very attractive expression systems. Such systems are e.g.
described in the French Patent Application with Publication number 2 714 074, and in US NTIS Publication No US 08/043109 (Hoffman, S. and Rogers, W.: Public. Date December 1993).
Thus a still even more preferred form of this embodiment of the invention relates to Live Recombinant Carrier micro-organisms (LRCs) comprising a gene encoding the UreX
or UreY polypeptide or an immunogenic fragment thereof according to the invention. Such micro-organisms are e.g. bacteria and viruses. These LRC micro-organisms are micro-organisms in which additional genetic information, in this case a gene encoding the UreX
or UreY polypeptide or an immunogenic fragment thereof according to the invention has been cloned. Animals infected with such LRCs will produce an immunogenic response not only against the immunogens of the vector, but also against the immunogenic parts of the polypeptide(s) for which the genetic code is additionally cloned into the LRC, e.g.
the ureX or Y gene.
As an example of bacterial LRCs, attenuated Salmonella strains known in the art can attractively be used.
Live recombinant carrier parasites have i.a. been described by Vermeulen, A.
N. (Int.
Journ. Parasitol. 28: 1121-1130 (1998)) Also, LRC viruses may be used as a way of transporting the nucleic acid sequence into a target cell. Live recombinant carrier viruses are also called vector viruses. The site of integration of the gene encoding a UreX or Y polypeptide may be a site in a viral gene that is not essential to the virus, or a site in an intergenic region. Viruses often used as vectors are Vaccinia viruses (Panicali et al; Proc. Natl. Acad. Sci. USA, 79:
4927 (1982), Herpesviruses (E.P.A. 0473210A2), and Retroviruses (Valerio, D. et al; in Baum, S.J., Dicke, K.A., Lotzova, E. and Pluznik, D.H. (Eds.), Experimental Haematology today -1988. Springer Verlag, New York: pp. 92-99 (1989)).
The technique of in vivo homologous recombination, well-known in the art, can be used to introduce a recombinant nucleic acid sequence into the genome of a bacterium, parasite or virus of choice, capable of inducing expression of the inserted nucleic acid sequence according to the invention in the host animal.
Finally another form of this embodiment of the invention relates to a host cell comprising a nucleic acid sequence encoding a polypeptide according to the invention, a DNA
fragment comprising such a nucleic acid sequence or a recombinant DNA molecule comprising such a nucleic acid sequence under the control of a functionally linked promotor. This form alsor elates to a host cell containing a live recombinant carrier containing a nucleic acid molecule encoding a UreX or Y polypeptide or an immunogenic fragment thereof according to the invention.
A host cell may be a cell of bacterial origin, e.g. Escherichia coli, Bacillus subtilus and Lactobacillus species, in combination with bacteria-based plasmids as pBR322, or bacterial expression vectors as pGEX, or with bacteriophages. The host cell may also be of eukaryotic origin, e.g. yeast-cells in combination with yeast-specific vector molecules, or higher eukaryotic cells like insect cells (Luckow et al; Bio-technology 6:
47-55 (1988)) in combination with vectors or recombinant baculoviruses, plant cells in combination with e.g. Ti-plasmid based vectors or plant viral vectors (Barton, K.A. et al; Cell 32: 1033 (1983), mammalian cells like Hela cells, Chinese Hamster Ovary cells (CHO) or Crandell Feline Kidney-cells, also with appropriate vectors or recombinant viruses.
Another embodiment of the invention relates to the polypeptides encoded by the nucleic acid sequences, i.e. the urease X subunit and the urease Y subunit and to immunogenic fragments thereof according to the invention.
Therefore, this embodiment of the invention relates to the Helicobacter fells urease X
polypeptide, said polypeptide having an amino acid sequence that is at least homologous to SEQ ID NO: 2 or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids that is capable of inducing an immune response against ureaseXY. Preferably, the length of that fragment is more than 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference.
Preferably this embodiment relates to such polypeptides having a sequence homology of at least 90 %, more preferably 94 %, even more preferably 97 % homology to SEQ
ID
NO: 2, or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference that is capable of inducing an immune response against ureaseXY.
This embodiment of the invention also relates to the Helicobacfer fells urease Y
polypeptide, said polypeptide having an amino acid sequence that is at least homologous to SEQ ID NO: 3 or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids that is capable of inducing an immune response against ureaseXY. Preferably, the length of that fragment is more than 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference.
Preferably this embodiment relates to such polypeptides having a sequence homology of at least 90 %, more preferably 94 %, even more preferably 97 % homology to SEQ
ID
NO: 3, or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference that is capable of inducing an immune response against ureaseXY.
fragment comprising such a nucleic acid sequence or a recombinant DNA molecule comprising such a nucleic acid sequence under the control of a functionally linked promotor. This form alsor elates to a host cell containing a live recombinant carrier containing a nucleic acid molecule encoding a UreX or Y polypeptide or an immunogenic fragment thereof according to the invention.
A host cell may be a cell of bacterial origin, e.g. Escherichia coli, Bacillus subtilus and Lactobacillus species, in combination with bacteria-based plasmids as pBR322, or bacterial expression vectors as pGEX, or with bacteriophages. The host cell may also be of eukaryotic origin, e.g. yeast-cells in combination with yeast-specific vector molecules, or higher eukaryotic cells like insect cells (Luckow et al; Bio-technology 6:
47-55 (1988)) in combination with vectors or recombinant baculoviruses, plant cells in combination with e.g. Ti-plasmid based vectors or plant viral vectors (Barton, K.A. et al; Cell 32: 1033 (1983), mammalian cells like Hela cells, Chinese Hamster Ovary cells (CHO) or Crandell Feline Kidney-cells, also with appropriate vectors or recombinant viruses.
Another embodiment of the invention relates to the polypeptides encoded by the nucleic acid sequences, i.e. the urease X subunit and the urease Y subunit and to immunogenic fragments thereof according to the invention.
Therefore, this embodiment of the invention relates to the Helicobacter fells urease X
polypeptide, said polypeptide having an amino acid sequence that is at least homologous to SEQ ID NO: 2 or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids that is capable of inducing an immune response against ureaseXY. Preferably, the length of that fragment is more than 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference.
Preferably this embodiment relates to such polypeptides having a sequence homology of at least 90 %, more preferably 94 %, even more preferably 97 % homology to SEQ
ID
NO: 2, or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference that is capable of inducing an immune response against ureaseXY.
This embodiment of the invention also relates to the Helicobacfer fells urease Y
polypeptide, said polypeptide having an amino acid sequence that is at least homologous to SEQ ID NO: 3 or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids that is capable of inducing an immune response against ureaseXY. Preferably, the length of that fragment is more than 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference.
Preferably this embodiment relates to such polypeptides having a sequence homology of at least 90 %, more preferably 94 %, even more preferably 97 % homology to SEQ
ID
NO: 3, or an immunogenic fragment of that polypeptide with a length of at least 40 amino acids, more preferably at least 45, 50, 55, 60 or 70 amino acids in that order or preference that is capable of inducing an immune response against ureaseXY.
As for the nucleotide sequence comparison, the comparison between the various amino acid sequences was made using Align Plus for Windows, available from Scientific and Educational Software, P.O.Box 72045 Durham, NC 27722-2045, USA. Settings used for the amino acid comparisons are indicated in figures 1 a, 1 b and 1 c.
It will be understood that, for the particular polypeptides embraced herein, natural variations can exist between individual Helicobacter fells strains. These variations may be demonstrated by (an) amino acid differences) in the overall sequence or by deletions, substitutions, insertions, inversions or additions of (an) amino acids) in said sequence. Amino acid substitutions which do not essentially alter biological and immunological activities, have been described, e.g. by Neurath et al in "The Proteins"
Academic Press New York (1979). Amino acid replacements between related amino acids or replacements which have occurred frequently in evolution are, inter alia, Ser/Ala, SerIGly, Asp/Gly, Asp/Asn, IIe/Val (see Dayhof, M.D., Atlas of protein sequence and structure, Nat. Biomed. Res. Found., Washington D.C., 1978, vol. 5, suppl.
3). Other amino acid substitutions include Asp/Glu, Thr/Ser, Ala/Gly, AIa/Thr, Ser/Asn, AIa/Val, Thr/Phe, Ala/Pro, Lys/Arg, Leu/lle, Leu/Val and Ala/Glu. Based on this information, Lipman and Pearson developed a method for rapid and sensitive protein comparison (Science,227, 1435-1441, 1985) and determining the functional similarity between homologous proteins. Such amino acid substitutions of the exemplary embodiments of this invention, as well as variations having deletions and/or insertions are within the scope of the invention as long as the resulting polypeptides retain their immunoreactivity.
Thus, variations not essentially influencing the immunogenicity of the polypeptide compared to the wild type polypeptide as depicted in SEQ ID NO: 2 or 3 are considered to fall within the scope of the invention. Those variations in the amino acid sequence of a certain structural subunit X or Y according to the invention that still provide a polypeptide capable of inducing an immune response against infection with H. fells or at least against the clinical manifestations of the infection are considered as "not essentially influencing the immunogenicity".
When a polypeptide is used for e.g. vaccination purposes or for raising antibodies, it is however not necessary to use the whole polypeptide. It is also possible to use a fragment of that polypeptide that is capable, as such or coupled to a carrier such as e.g.
KLH, of inducing an immune response against that polypeptide, a so-called immunogenic fragment. An "immunogenic fragment" is understood to be a fragment of the full-length polypeptide of the structural subunit X or Y, that still has retained its capability to induce an immune response in the host, i.e. comprises a B- or T-cell epitope. At this moment, a variety of techniques is available to easily identify DNA
fragments encoding antigenic fragments (determinants). The method described by Geysen et al (Patent Application WO 84/03564, Patent Application WO 86/06487, US
Patent NR. 4,833,092, Proc. Natl Acad. Sci. 81: 3998-4002 (1984), J. Imm.
Meth. 102, 259-274 (1987), the so-called PEPSCAN method is an easy to perform, quick and well-established method for the detection of epitopes; the immunologically important regions of the polypeptide. The method is used world-wide and as such well-known to man skilled in the art. This (empirical) method is especially suitable for the detection of B-cell epitopes. Also, given the sequence of the gene encoding any protein, computer algorithms are able to designate specific polypeptide fragments as the immunologically important epitopes on the basis of their sequential and/or structural agreement with epitopes that are now known. The determination of these regions is based on a combination of the hydrophilicity criteria according to Hopp and Woods (Proc.
Natl.
It will be understood that, for the particular polypeptides embraced herein, natural variations can exist between individual Helicobacter fells strains. These variations may be demonstrated by (an) amino acid differences) in the overall sequence or by deletions, substitutions, insertions, inversions or additions of (an) amino acids) in said sequence. Amino acid substitutions which do not essentially alter biological and immunological activities, have been described, e.g. by Neurath et al in "The Proteins"
Academic Press New York (1979). Amino acid replacements between related amino acids or replacements which have occurred frequently in evolution are, inter alia, Ser/Ala, SerIGly, Asp/Gly, Asp/Asn, IIe/Val (see Dayhof, M.D., Atlas of protein sequence and structure, Nat. Biomed. Res. Found., Washington D.C., 1978, vol. 5, suppl.
3). Other amino acid substitutions include Asp/Glu, Thr/Ser, Ala/Gly, AIa/Thr, Ser/Asn, AIa/Val, Thr/Phe, Ala/Pro, Lys/Arg, Leu/lle, Leu/Val and Ala/Glu. Based on this information, Lipman and Pearson developed a method for rapid and sensitive protein comparison (Science,227, 1435-1441, 1985) and determining the functional similarity between homologous proteins. Such amino acid substitutions of the exemplary embodiments of this invention, as well as variations having deletions and/or insertions are within the scope of the invention as long as the resulting polypeptides retain their immunoreactivity.
Thus, variations not essentially influencing the immunogenicity of the polypeptide compared to the wild type polypeptide as depicted in SEQ ID NO: 2 or 3 are considered to fall within the scope of the invention. Those variations in the amino acid sequence of a certain structural subunit X or Y according to the invention that still provide a polypeptide capable of inducing an immune response against infection with H. fells or at least against the clinical manifestations of the infection are considered as "not essentially influencing the immunogenicity".
When a polypeptide is used for e.g. vaccination purposes or for raising antibodies, it is however not necessary to use the whole polypeptide. It is also possible to use a fragment of that polypeptide that is capable, as such or coupled to a carrier such as e.g.
KLH, of inducing an immune response against that polypeptide, a so-called immunogenic fragment. An "immunogenic fragment" is understood to be a fragment of the full-length polypeptide of the structural subunit X or Y, that still has retained its capability to induce an immune response in the host, i.e. comprises a B- or T-cell epitope. At this moment, a variety of techniques is available to easily identify DNA
fragments encoding antigenic fragments (determinants). The method described by Geysen et al (Patent Application WO 84/03564, Patent Application WO 86/06487, US
Patent NR. 4,833,092, Proc. Natl Acad. Sci. 81: 3998-4002 (1984), J. Imm.
Meth. 102, 259-274 (1987), the so-called PEPSCAN method is an easy to perform, quick and well-established method for the detection of epitopes; the immunologically important regions of the polypeptide. The method is used world-wide and as such well-known to man skilled in the art. This (empirical) method is especially suitable for the detection of B-cell epitopes. Also, given the sequence of the gene encoding any protein, computer algorithms are able to designate specific polypeptide fragments as the immunologically important epitopes on the basis of their sequential and/or structural agreement with epitopes that are now known. The determination of these regions is based on a combination of the hydrophilicity criteria according to Hopp and Woods (Proc.
Natl.
Acad. Sci. 78: 38248-3828 (1981 )), and the secondary structure aspects according to Chou and Fasman (Advances in Enzymology 47: 45-148 (1987) and US Patent 4,554,101). T-cell epitopes can likewise be predicted from the sequence by computer with the aid of Berzofsky's amphiphilicity criterion (Science 235, 1059-1062 (1987) and US Patent application NTIS US 07/005,885). A condensed overview is found in:
Shan Lu on common principles: Tibtech 9: 238-242 (1991 ), Good et al on Malaria epitopes;
Science 235: 1059-1062 (1987), Lu for a review; Vaccine 10: 3-7 (1992), Berzowsky for HIV-epitopes; The FASEB Journal 5:2412-2418 (1991 ).
Vaccines against e.g. He1icobacter pylori, which has only one urease, can be made on the basis of this urease, as was described above. In the specific case of Helicobacter fells however a vaccine based upon the known Helicobacter fells structural subunits ureA and B is not capable of providing sufficient protection against Helicobacter fells infection: immunity against structural subunits ureA and B allegedly does not neutralise the urease activity of the newly found heterologous structural subunits UreX
and Y.
Therefore, vaccines for the protection of animals against Helicobacter fells infection should at least be directed against the novel urease XY.
Therefore, one form of still another embodiment of the invention relates to vaccines capable of protecting mammals such as dogs and cats against Helicobacter fells infection, that comprise the structural subunit X or Y, preferably X and Y, more preferably X, Y, A and B, or an immunogenic fragment of X and/or Y according to the invention together with a pharmaceutically acceptable carrier.
Still another embodiment of the present invention relates to the polypeptides according to the invention for use in a vaccine.
Still another embodiment relates to the use of the polypeptide according to the invention in the manufacturing of a vaccine for combating Helicobacter fells infections.
One way of making a vaccine according to the invention is by biochemical purification of the ureaseXY polypeptide or its subunits from a bacterial culture. This can e.g. be done by centrifugation of the bacteria, and the use of gel-filtration columns for separation of the urease polypeptide or its subunits from other components. Further purification may e.g. be done by selective precipitation in ammonium-sulphate, followed by centrifugation, gel electrophoresis and, if desired, separation from the urease AB subunits and dissolving the pellet in a suitable buffer. This is however a time-consuming way of making the vaccine, especially where Helicobacter fells is difficult to grow.
It is therefore much more convenient to use the expression products of the genes encoding the urease X and Y subunits according to the invention in vaccines.
Such vaccines can easily be made by admixing ureaseXY or an UreX or Y subunit or an immunological fragment thereof according to the invention with a pharmaceutically acceptable carrier as described below.
Furthermore vaccines can comprise live recombinant carriers as described above, capable of expressing ureaseXY, an UreX or UreY subunit or immunogenic fragments thereof according to the invention. Such vaccines, e.g. based upon a Salmonella carrier or a viral carrier infecting the gastric epithelium have the advantage over subunit vaccines that they better mimic the natural way of infection of Helicobacfer fells.
Shan Lu on common principles: Tibtech 9: 238-242 (1991 ), Good et al on Malaria epitopes;
Science 235: 1059-1062 (1987), Lu for a review; Vaccine 10: 3-7 (1992), Berzowsky for HIV-epitopes; The FASEB Journal 5:2412-2418 (1991 ).
Vaccines against e.g. He1icobacter pylori, which has only one urease, can be made on the basis of this urease, as was described above. In the specific case of Helicobacter fells however a vaccine based upon the known Helicobacter fells structural subunits ureA and B is not capable of providing sufficient protection against Helicobacter fells infection: immunity against structural subunits ureA and B allegedly does not neutralise the urease activity of the newly found heterologous structural subunits UreX
and Y.
Therefore, vaccines for the protection of animals against Helicobacter fells infection should at least be directed against the novel urease XY.
Therefore, one form of still another embodiment of the invention relates to vaccines capable of protecting mammals such as dogs and cats against Helicobacter fells infection, that comprise the structural subunit X or Y, preferably X and Y, more preferably X, Y, A and B, or an immunogenic fragment of X and/or Y according to the invention together with a pharmaceutically acceptable carrier.
Still another embodiment of the present invention relates to the polypeptides according to the invention for use in a vaccine.
Still another embodiment relates to the use of the polypeptide according to the invention in the manufacturing of a vaccine for combating Helicobacter fells infections.
One way of making a vaccine according to the invention is by biochemical purification of the ureaseXY polypeptide or its subunits from a bacterial culture. This can e.g. be done by centrifugation of the bacteria, and the use of gel-filtration columns for separation of the urease polypeptide or its subunits from other components. Further purification may e.g. be done by selective precipitation in ammonium-sulphate, followed by centrifugation, gel electrophoresis and, if desired, separation from the urease AB subunits and dissolving the pellet in a suitable buffer. This is however a time-consuming way of making the vaccine, especially where Helicobacter fells is difficult to grow.
It is therefore much more convenient to use the expression products of the genes encoding the urease X and Y subunits according to the invention in vaccines.
Such vaccines can easily be made by admixing ureaseXY or an UreX or Y subunit or an immunological fragment thereof according to the invention with a pharmaceutically acceptable carrier as described below.
Furthermore vaccines can comprise live recombinant carriers as described above, capable of expressing ureaseXY, an UreX or UreY subunit or immunogenic fragments thereof according to the invention. Such vaccines, e.g. based upon a Salmonella carrier or a viral carrier infecting the gastric epithelium have the advantage over subunit vaccines that they better mimic the natural way of infection of Helicobacfer fells.
Moreover, their self propagation is an advantage since only low amounts of the recombinant carrier are necessary for immunisation.
Vaccines described above all contribute to active vaccination, i.e. the host's immune system is triggered by the UreX and/or Y polypeptide or immunogenic fragments thereof, to make antibodies against these polypeptides.
Alternatively, such antibodies can be raised in e.g. rabbits or can be obtained from antibody-producing cell lines as described below. Such antibodies can then be administered to the host animal. This method of vaccination, passive vaccination, is the vaccination of choice when an animal is already infected, and there is no time to allow the natural immune response to be triggered. It is also the preferred method for vaccinating immune-compromised animals. Administered antibodies against Helicobacter UreX or UreY can in these cases bind directly to the urease excreted by the bacteria. This has the advantage that the unease activity is directly eliminated, thus resulting in acidification of the environment and decreased or stopped Helicobacter growth.
Therefore, one other form of this embodiment of the invention relates to vaccines comprising antibodies against Helicobacter fells unease X polypeptides that have an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 2 or immunogenic fragments of that polypeptide with a length of at least 40 amino acids that are capable of inducing an immune response against ureaseXY or antibodies against Helicobacter fells unease Y polypeptides that have an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 3 or immunogenic fragments of that polypeptide with a length of at least 40 amino acids that are capable of inducing an immune response against ureaseXY.
Vaccines can also be based upon host cells as described above, that comprise ureaseXY, an UreX or UreY subunit or immunogenic fragments thereof according to the invention.
An alternative and efficient way of vaccination is direct vaccination with DNA
encoding the relevant antigen. Direct vaccination with DNA encoding polypeptides has been successful for many different polypeptides. (As reviewed in e.g. Donnelly et al., The Immunologist 2: 20-26 (1993)).
This way of vaccination is very attractive for the vaccination of both cats and dogs against Helicobacter fells infection.
Therefore, still other forms of this embodiment of the invention relate to vaccines comprising nucleic acid sequences encoding a polypeptide according to the invention or immunogenic fragments thereof according to the invention, and to vaccines comprising DNA fragments that comprise such nucleic acid sequences.
Still other forms of this embodiment relate to vaccines comprising recombinant DNA
molecules according to the invention.
DNA vaccines can easily be administered through intradermal application e.g.
using a needle-less injector. This way of administration delivers the DNA directly into the cells of the animal to be vaccinated. Amount of DNA in the microgram range between 1 and 100 ~g provide very good results.
In a further embodiment, the vaccine according to the present invention also comprises antigens from other dog or cat pathogenic organisms and viruses, or genetic information encoding such antigens. Such organisms and viruses are e.g. Feline Infectious Peritonitis virus, Feline Immune deficiency virus, Canine and Feline Parvovirus, Distemper virus, Adenovirus, Calicivirus, Bordetella bronchiseptica, Bon-elia burgdorferi, Leptospira interrogans, Chlamydia and Bartonella henseli.
Also, the present invention relates to polypeptides according to the invention for use in the manufacturing of a vaccine for combating Helicobacter fells infections.
All vaccines according to the present invention comprise a pharmaceutically acceptable carrier. A pharmaceutically acceptable carrier can be e.g. sterile water or a sterile physiological salt solution. In a more complex form the carrier can e.g. be a buffer.
Vaccines according to the present invention may in a preferred presentation also contain an adjuvant. Adjuvants in general comprise substances that boost the immune response of the host in a non-specific manner. A number of different adjuvants are known in the art. Examples of adjuvants are Freunds Complete and Incomplete adjuvant, vitamin E, non-ionic block polymers, muramyldipeptides, Quill A(R), mineral oil e.g.
Bayol(R) or Markol(R), vegetable oil, and Carbopol(R) (a homopolymer), or Diluvac(R) Forte.
The vaccine may also comprise a so-called "vehicle". A vehicle is a compound to which the polypeptide adheres, without being covalently bound to it. Often used vehicle compounds are e.g. aluminium hydroxide, -phosphate or -oxide, silica, Kaolin, and Bentonite.
A special form of such a vehicle, in which the antigen is partially embedded in the vehicle, is the so-called ISCOM (EP 109.942, EP 180.564, EP 242.380) In addition, the vaccine may comprise one or more suitable surface-active compounds or emulsifiers, e.g. Span or Tween.
Often, the vaccine is mixed with stabilisers, e.g. to protect degradation-prone polypeptides from being degraded, to enhance the shelf-life of the vaccine, or to improve freeze-drying efficiency. Useful stabilisers are i.a. SPGA (Bovarnik et al; J.
Bacteriology 59: 509 (1950)), carbohydrates e.g. sorbitol, mannitol, trehalose, starch, sucrose, dextran or glucose, proteins such as albumin or casein or degradation products thereof, and buffers, such as alkali metal phosphates.
In addition, the vaccine may be suspended in a physiologically acceptable diluent.
It goes without saying, that other ways of adjuvating, adding vehicle compounds or diluents, emulsifying or stabilising a polypeptide are also embodied in the present invention.
Vaccines according to the invention that comprise the UreX or UreY subunit polypeptide can very suitably be administered in amounts ranging between 1 and 100 micrograms, although smaller doses can in principle be used. A dose exceeding 100 micrograms will, although immunologically very suitable, be less attractive for commercial reasons.
Vaccines based upon live attenuated recombinant carriers, such as the LRC-viruses and bacteria described above can be administered in much lower doses, because they multiply themselves during the infection. Therefore, very suitable amounts would range between 103 and 109 CFU/PFU for respectively bacteria and viruses.
Many ways of administration can be applied. Intranasal application is a frequently used way of administrating a vaccine. Oral application is also an attractive way of administration, because the infection is often located in the upper digestive tract. A
preferred way of oral administration is the packaging of the vaccine in capsules, known and frequently used in the art, that only disintegrate in the highly acidic environment of the stomach. Also, the vaccine could be mixed with compounds known in the art for temporarily enhancing the pH of the stomach.
Systemic application is also suitable, e.g. by intramuscular application of the vaccine. If this route is followed, standard procedures known in the art for systemic application are 5 well-suited.
Another embodiment of the invention relates to diagnostic tests for the detection of H.
fells infection. It is known that several Helicobacter species such as H.
bizzozeronii, H.
fells and H. salomonis are capable of infecting both cats and dogs. Of these three, H.
Vaccines described above all contribute to active vaccination, i.e. the host's immune system is triggered by the UreX and/or Y polypeptide or immunogenic fragments thereof, to make antibodies against these polypeptides.
Alternatively, such antibodies can be raised in e.g. rabbits or can be obtained from antibody-producing cell lines as described below. Such antibodies can then be administered to the host animal. This method of vaccination, passive vaccination, is the vaccination of choice when an animal is already infected, and there is no time to allow the natural immune response to be triggered. It is also the preferred method for vaccinating immune-compromised animals. Administered antibodies against Helicobacter UreX or UreY can in these cases bind directly to the urease excreted by the bacteria. This has the advantage that the unease activity is directly eliminated, thus resulting in acidification of the environment and decreased or stopped Helicobacter growth.
Therefore, one other form of this embodiment of the invention relates to vaccines comprising antibodies against Helicobacter fells unease X polypeptides that have an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 2 or immunogenic fragments of that polypeptide with a length of at least 40 amino acids that are capable of inducing an immune response against ureaseXY or antibodies against Helicobacter fells unease Y polypeptides that have an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 3 or immunogenic fragments of that polypeptide with a length of at least 40 amino acids that are capable of inducing an immune response against ureaseXY.
Vaccines can also be based upon host cells as described above, that comprise ureaseXY, an UreX or UreY subunit or immunogenic fragments thereof according to the invention.
An alternative and efficient way of vaccination is direct vaccination with DNA
encoding the relevant antigen. Direct vaccination with DNA encoding polypeptides has been successful for many different polypeptides. (As reviewed in e.g. Donnelly et al., The Immunologist 2: 20-26 (1993)).
This way of vaccination is very attractive for the vaccination of both cats and dogs against Helicobacter fells infection.
Therefore, still other forms of this embodiment of the invention relate to vaccines comprising nucleic acid sequences encoding a polypeptide according to the invention or immunogenic fragments thereof according to the invention, and to vaccines comprising DNA fragments that comprise such nucleic acid sequences.
Still other forms of this embodiment relate to vaccines comprising recombinant DNA
molecules according to the invention.
DNA vaccines can easily be administered through intradermal application e.g.
using a needle-less injector. This way of administration delivers the DNA directly into the cells of the animal to be vaccinated. Amount of DNA in the microgram range between 1 and 100 ~g provide very good results.
In a further embodiment, the vaccine according to the present invention also comprises antigens from other dog or cat pathogenic organisms and viruses, or genetic information encoding such antigens. Such organisms and viruses are e.g. Feline Infectious Peritonitis virus, Feline Immune deficiency virus, Canine and Feline Parvovirus, Distemper virus, Adenovirus, Calicivirus, Bordetella bronchiseptica, Bon-elia burgdorferi, Leptospira interrogans, Chlamydia and Bartonella henseli.
Also, the present invention relates to polypeptides according to the invention for use in the manufacturing of a vaccine for combating Helicobacter fells infections.
All vaccines according to the present invention comprise a pharmaceutically acceptable carrier. A pharmaceutically acceptable carrier can be e.g. sterile water or a sterile physiological salt solution. In a more complex form the carrier can e.g. be a buffer.
Vaccines according to the present invention may in a preferred presentation also contain an adjuvant. Adjuvants in general comprise substances that boost the immune response of the host in a non-specific manner. A number of different adjuvants are known in the art. Examples of adjuvants are Freunds Complete and Incomplete adjuvant, vitamin E, non-ionic block polymers, muramyldipeptides, Quill A(R), mineral oil e.g.
Bayol(R) or Markol(R), vegetable oil, and Carbopol(R) (a homopolymer), or Diluvac(R) Forte.
The vaccine may also comprise a so-called "vehicle". A vehicle is a compound to which the polypeptide adheres, without being covalently bound to it. Often used vehicle compounds are e.g. aluminium hydroxide, -phosphate or -oxide, silica, Kaolin, and Bentonite.
A special form of such a vehicle, in which the antigen is partially embedded in the vehicle, is the so-called ISCOM (EP 109.942, EP 180.564, EP 242.380) In addition, the vaccine may comprise one or more suitable surface-active compounds or emulsifiers, e.g. Span or Tween.
Often, the vaccine is mixed with stabilisers, e.g. to protect degradation-prone polypeptides from being degraded, to enhance the shelf-life of the vaccine, or to improve freeze-drying efficiency. Useful stabilisers are i.a. SPGA (Bovarnik et al; J.
Bacteriology 59: 509 (1950)), carbohydrates e.g. sorbitol, mannitol, trehalose, starch, sucrose, dextran or glucose, proteins such as albumin or casein or degradation products thereof, and buffers, such as alkali metal phosphates.
In addition, the vaccine may be suspended in a physiologically acceptable diluent.
It goes without saying, that other ways of adjuvating, adding vehicle compounds or diluents, emulsifying or stabilising a polypeptide are also embodied in the present invention.
Vaccines according to the invention that comprise the UreX or UreY subunit polypeptide can very suitably be administered in amounts ranging between 1 and 100 micrograms, although smaller doses can in principle be used. A dose exceeding 100 micrograms will, although immunologically very suitable, be less attractive for commercial reasons.
Vaccines based upon live attenuated recombinant carriers, such as the LRC-viruses and bacteria described above can be administered in much lower doses, because they multiply themselves during the infection. Therefore, very suitable amounts would range between 103 and 109 CFU/PFU for respectively bacteria and viruses.
Many ways of administration can be applied. Intranasal application is a frequently used way of administrating a vaccine. Oral application is also an attractive way of administration, because the infection is often located in the upper digestive tract. A
preferred way of oral administration is the packaging of the vaccine in capsules, known and frequently used in the art, that only disintegrate in the highly acidic environment of the stomach. Also, the vaccine could be mixed with compounds known in the art for temporarily enhancing the pH of the stomach.
Systemic application is also suitable, e.g. by intramuscular application of the vaccine. If this route is followed, standard procedures known in the art for systemic application are 5 well-suited.
Another embodiment of the invention relates to diagnostic tests for the detection of H.
fells infection. It is known that several Helicobacter species such as H.
bizzozeronii, H.
fells and H. salomonis are capable of infecting both cats and dogs. Of these three, H.
10 fells is the species suspected to cause most of the pathology, although it is often outnumbered by H. bizzozeronii and H. salomonis. Thus, a quick and correct diagnosis of disease, in both cats and dogs, caused by Helicobacter fells is important.
It has however been very difficult to discriminate between these three species due to the fact that they are so very closely related.
Therefore it is another objective of this invention to provide such diagnostic tools suitable for discriminating H. fells from other Helicobacter species.
On the basis of the novel urease polypeptides and the genes encoding the urease polypeptides, at least three different diagnostic tests, specifically suitable for the discrimination of H. fells from other members of the Helicobacter family were developed:
1 ) a diagnostic test based upon the presence or absence of DNA encoding the specific UreX and UreY structural subunits 2) a diagnostic test based upon the detection of antibodies against the specific UreX and UreY structural subunits 3) a diagnostic test based upon the detection of antigenic material of the specific UreX
and UreY structural subunits A diagnostic test according to 1 ) is e.g. based upon the reaction of bacterial DNA
isolated from the animal to be tested, with specific probes or PCR-primers based upon the sequence of ureX or Y genes. If H. fells DNA is present in the animal, this will e.g.
specifically bind to ureX or Y specific PCR-primers and will subsequently become amplified in PCR-reaction. The PCR-reaction product can then easily be detected in DNA gel electrophoresis.
The DNA can most easily be isolated from the micro-organisms present in swabs of the upper digestive tract or in the saliva of the animal to be tested. Specific primers can easily be selected from the many regions of the ureX and ureY coding sequences and the non-coding intergenic sequence that differ in sequence from the comparable regions in the ureAB coding sequences. One of the many algorithms suitable for the determination of the level of nucleic acid homology and for comparison of nucleotide sequences in general is known as "Clustal W". It has been described by Thompson et al., in Nucleic Acid Research 22: 4673-4680 (1994). The program can be found at several sites on Internet. An more recent alternative for this program is e.g.
Align Plus for Windows, available from Scientific and Educational Software, P.O.Box 72045 Durham, NC 27722-2045, USA.
As follows from figure 1, a large number of possible PCR-primers can be found that are specific for ureX or ureY. An extremely specific pair of PCR-probes is e.g.
formed by the 5'-located sequence CATGCACTTTTTGAAAAAAGA (SEQ ID NO: 16) and the 3'-located sequence TATGGTGGTCTTCTCT (SEQ ID NO: 17). Of course many other sequences that are specific for ureX or Y or the intergenic region are suitable. Standard PCR-textbooks give methods for determining the suitability of the probes for selective PCR-reactions with ureX or ureY. PCR-techniques are extensively described in (Dieffenbach & Dreksler; PCR primers, a laboratory manual. ISBN 0-87969-447-5 ( 1995)).
Another DNA-based test is based upon growth of bacterial material obtained from the swab, followed by classical DNA purification followed by classical hybridisation with radioactively or colour-labelled ureXY-specific DNA-fragments. Given the very low homology between the ureXY-coding regions and the ureAB coding regions of both H.
fells and other Helicobacter species, hybridisation unambiguously indicates the presence or absence of H. fells. Both PCR-reactions and hybridisation reactions are well-known in the art and are i.a. described in Maniatis/Sambrook (Sambrook, J. et al.
Molecular cloning: a laboratory manual. ISBN 0-87969-309-6).
Selective detection with PCR-primers or with classical hybridisation with ureXY-specific DNA-fragments can be done with fragments that preferably are short, but for practical reasons preferably consist of a stretch of at least 10 contiguous nucleotides of SEQ ID
NO: 1. It is clear that for hybridisation experiments a probe needs to be selected that has a higher homology to SEQ ID NO: 1, than to sequences encoding the Helicobacter ureA
or urea subunit. Such a probe can very easily be selected with the help of the Align Plus for Windows program or the Clustal W program as discussed above. In a comparative hybridisation experiment the DNA to be diagnosed can be tested next to e.g. H.
pylori DNA. The probe according to the invention, having a higher homology to SEQ ID
NO: 1 than to a gene encoding ureAB, would bind better to H. fells DNA (if present in the sample) than to DNA of other Helicobacter species thus specifically revealing the presence of H. fells DNA in the sample to be tested. The sequences SEQ ID NO:
16 or 17 mentioned above are merely examples of probes very suitable for labelling and subsequent use in the H. fells-specific hybridisation assays as described.
Thus, one embodiment of the invention relates to a diagnostic test for the detection of DNA encoding the specific Helicobacter UreX and UreY subunit polypeptides.
Such a test comprises a nucleic acid sequence according to the invention or a fragment thereof that is specific for the DNA encoding UreX and UreY or the intergenic region between UreX and UreY. A fragment that is specific for that DNA is a fragment that binds better to the DNA encoding UreX and UreY or the intergenic region between UreX and UreY
than to the DNA encoding UreA and Urea or the intergenic region between UreA and Urea.
Methods for the detection of Helicobacter fells DNA comprise hybridisation of the DNA to be tested with UreX or Y DNA, or PCR-reaction of the DNA to be tested with UreX or Y
DNA specific probes.
A diagnostic test according to 2) for the detection of Helicobacter fells antibodies in sera can be e.g. a simple sandwich-ELISA-test in which purified UreX or UreY
subunit polypeptides or antigenic fragments thereof according to the invention are coated to the wall of the wells of an ELISA-plate. A method for the detection of such antibodies is e.g.
Incubation of purified UreX or Y polypeptide with serum from mammals to be tested, followed by e.g. incubation with a labelled antibody against the relevant mammalian antibody. A colour reaction can then reveal the presence or absence of antibodies against Helicobacter fells urease XY. Depending on the labelled antibodies used, the selectivity of this system can be improved by pre-incubation of the serum to be tested with urease AB followed by spinning down the precipitate, in order to avoid non-XY-specific reactions.
If antigenic fragments of the UreX or UreY structural subunits according to the invention are used for coating, this pre-incubation step can be skipped.
It has however been very difficult to discriminate between these three species due to the fact that they are so very closely related.
Therefore it is another objective of this invention to provide such diagnostic tools suitable for discriminating H. fells from other Helicobacter species.
On the basis of the novel urease polypeptides and the genes encoding the urease polypeptides, at least three different diagnostic tests, specifically suitable for the discrimination of H. fells from other members of the Helicobacter family were developed:
1 ) a diagnostic test based upon the presence or absence of DNA encoding the specific UreX and UreY structural subunits 2) a diagnostic test based upon the detection of antibodies against the specific UreX and UreY structural subunits 3) a diagnostic test based upon the detection of antigenic material of the specific UreX
and UreY structural subunits A diagnostic test according to 1 ) is e.g. based upon the reaction of bacterial DNA
isolated from the animal to be tested, with specific probes or PCR-primers based upon the sequence of ureX or Y genes. If H. fells DNA is present in the animal, this will e.g.
specifically bind to ureX or Y specific PCR-primers and will subsequently become amplified in PCR-reaction. The PCR-reaction product can then easily be detected in DNA gel electrophoresis.
The DNA can most easily be isolated from the micro-organisms present in swabs of the upper digestive tract or in the saliva of the animal to be tested. Specific primers can easily be selected from the many regions of the ureX and ureY coding sequences and the non-coding intergenic sequence that differ in sequence from the comparable regions in the ureAB coding sequences. One of the many algorithms suitable for the determination of the level of nucleic acid homology and for comparison of nucleotide sequences in general is known as "Clustal W". It has been described by Thompson et al., in Nucleic Acid Research 22: 4673-4680 (1994). The program can be found at several sites on Internet. An more recent alternative for this program is e.g.
Align Plus for Windows, available from Scientific and Educational Software, P.O.Box 72045 Durham, NC 27722-2045, USA.
As follows from figure 1, a large number of possible PCR-primers can be found that are specific for ureX or ureY. An extremely specific pair of PCR-probes is e.g.
formed by the 5'-located sequence CATGCACTTTTTGAAAAAAGA (SEQ ID NO: 16) and the 3'-located sequence TATGGTGGTCTTCTCT (SEQ ID NO: 17). Of course many other sequences that are specific for ureX or Y or the intergenic region are suitable. Standard PCR-textbooks give methods for determining the suitability of the probes for selective PCR-reactions with ureX or ureY. PCR-techniques are extensively described in (Dieffenbach & Dreksler; PCR primers, a laboratory manual. ISBN 0-87969-447-5 ( 1995)).
Another DNA-based test is based upon growth of bacterial material obtained from the swab, followed by classical DNA purification followed by classical hybridisation with radioactively or colour-labelled ureXY-specific DNA-fragments. Given the very low homology between the ureXY-coding regions and the ureAB coding regions of both H.
fells and other Helicobacter species, hybridisation unambiguously indicates the presence or absence of H. fells. Both PCR-reactions and hybridisation reactions are well-known in the art and are i.a. described in Maniatis/Sambrook (Sambrook, J. et al.
Molecular cloning: a laboratory manual. ISBN 0-87969-309-6).
Selective detection with PCR-primers or with classical hybridisation with ureXY-specific DNA-fragments can be done with fragments that preferably are short, but for practical reasons preferably consist of a stretch of at least 10 contiguous nucleotides of SEQ ID
NO: 1. It is clear that for hybridisation experiments a probe needs to be selected that has a higher homology to SEQ ID NO: 1, than to sequences encoding the Helicobacter ureA
or urea subunit. Such a probe can very easily be selected with the help of the Align Plus for Windows program or the Clustal W program as discussed above. In a comparative hybridisation experiment the DNA to be diagnosed can be tested next to e.g. H.
pylori DNA. The probe according to the invention, having a higher homology to SEQ ID
NO: 1 than to a gene encoding ureAB, would bind better to H. fells DNA (if present in the sample) than to DNA of other Helicobacter species thus specifically revealing the presence of H. fells DNA in the sample to be tested. The sequences SEQ ID NO:
16 or 17 mentioned above are merely examples of probes very suitable for labelling and subsequent use in the H. fells-specific hybridisation assays as described.
Thus, one embodiment of the invention relates to a diagnostic test for the detection of DNA encoding the specific Helicobacter UreX and UreY subunit polypeptides.
Such a test comprises a nucleic acid sequence according to the invention or a fragment thereof that is specific for the DNA encoding UreX and UreY or the intergenic region between UreX and UreY. A fragment that is specific for that DNA is a fragment that binds better to the DNA encoding UreX and UreY or the intergenic region between UreX and UreY
than to the DNA encoding UreA and Urea or the intergenic region between UreA and Urea.
Methods for the detection of Helicobacter fells DNA comprise hybridisation of the DNA to be tested with UreX or Y DNA, or PCR-reaction of the DNA to be tested with UreX or Y
DNA specific probes.
A diagnostic test according to 2) for the detection of Helicobacter fells antibodies in sera can be e.g. a simple sandwich-ELISA-test in which purified UreX or UreY
subunit polypeptides or antigenic fragments thereof according to the invention are coated to the wall of the wells of an ELISA-plate. A method for the detection of such antibodies is e.g.
Incubation of purified UreX or Y polypeptide with serum from mammals to be tested, followed by e.g. incubation with a labelled antibody against the relevant mammalian antibody. A colour reaction can then reveal the presence or absence of antibodies against Helicobacter fells urease XY. Depending on the labelled antibodies used, the selectivity of this system can be improved by pre-incubation of the serum to be tested with urease AB followed by spinning down the precipitate, in order to avoid non-XY-specific reactions.
If antigenic fragments of the UreX or UreY structural subunits according to the invention are used for coating, this pre-incubation step can be skipped.
Another example of a diagnostic test system is e.g. the incubation of a Western blot comprising UreX or UreY polypeptide or an antigenic fragment thereof according to the invention, with serum of mammals to be tested, followed by analysis of the blot.
The purified UreX and UreY structural subunits or antigenic fragments thereof according to the invention, suitable for the coating of ELISA plates or for Western blotting can easily be obtained by expression of the ureX and ureY gene as was described by Ferrero for ureA and B (Ferrero et al., Molec. Microbiol. 9, 323-333 (1993)).
Also, the invention relates to methods for the detection in serum of antibodies against Helicobacfer fells antibodies in which the method comprises the incubation of serum with UreX or UreY polypeptide or an antigenic fragment thereof according to the invention.
A diagnostic test according to 3) based upon the detection of antigenic material of the specific UreX and UreY structural subunits of Helicobacter fells antigens and therefore suitable for the detection of Helicobacter fells infection can e.g. also be a standard ELISA test. In one example of such a test the walls of the wells of an ELISA
plate are coated with antibodies directed against the specific UreX and UreY structural subunits of Helicobacter fells. The antigenic material to be tested can if necessary be pre-incubated with antibodies against UreA and B. This will leave the UreX and Y specific epitopes uncovered and therefore the pre-incubated Helicobacter species will bind to the ELISA
plate only if it comprises UreX or Y, i.e. if it specifically is Helicobacter fells.
The use of monoclonal antibodies specific for UreX or Y and not reacting with UreA or B
are the preferred antibodies in such tests, because they make the pre-incubation step superfluous. Such monoclonal antibodies can easily be obtained by immunising inbred mice with immunising fragments of UreX or Y according to the invention, by techniques also known in the art (See below: Kohler and Milstein).
The polypeptides or immunogenic fragments thereof according to the invention expressed as characterised above can be used to produce antibodies, which may be polyclonal, monospecific or monoclonal (or derivatives thereof). If polyclonal antibodies are desired, techniques for producing and processing polyclonal sera are well-known in the art (e.g. Mayer and Walter, eds. lmmunochemical Methods in Cell and Molecular Biology, Academic Press, London, 1987).
Monoclonal antibodies, reactive against the polypeptide according to the invention (or variants or fragments thereof) according to the present invention, can be prepared by immunising inbred mice by techniques also known in the art (Kohler and Milstein, Nature, 256, 495-497, 1975).
Finally, the invention relates to methods for the detection of antigenic material from Helicobacter fells in which the method comprises the incubation of serum, tissue of body fluids with antibodies against UreX or UreY polypeptide or an antigenic fragment thereof according to the invention.
The purified UreX and UreY structural subunits or antigenic fragments thereof according to the invention, suitable for the coating of ELISA plates or for Western blotting can easily be obtained by expression of the ureX and ureY gene as was described by Ferrero for ureA and B (Ferrero et al., Molec. Microbiol. 9, 323-333 (1993)).
Also, the invention relates to methods for the detection in serum of antibodies against Helicobacfer fells antibodies in which the method comprises the incubation of serum with UreX or UreY polypeptide or an antigenic fragment thereof according to the invention.
A diagnostic test according to 3) based upon the detection of antigenic material of the specific UreX and UreY structural subunits of Helicobacter fells antigens and therefore suitable for the detection of Helicobacter fells infection can e.g. also be a standard ELISA test. In one example of such a test the walls of the wells of an ELISA
plate are coated with antibodies directed against the specific UreX and UreY structural subunits of Helicobacter fells. The antigenic material to be tested can if necessary be pre-incubated with antibodies against UreA and B. This will leave the UreX and Y specific epitopes uncovered and therefore the pre-incubated Helicobacter species will bind to the ELISA
plate only if it comprises UreX or Y, i.e. if it specifically is Helicobacter fells.
The use of monoclonal antibodies specific for UreX or Y and not reacting with UreA or B
are the preferred antibodies in such tests, because they make the pre-incubation step superfluous. Such monoclonal antibodies can easily be obtained by immunising inbred mice with immunising fragments of UreX or Y according to the invention, by techniques also known in the art (See below: Kohler and Milstein).
The polypeptides or immunogenic fragments thereof according to the invention expressed as characterised above can be used to produce antibodies, which may be polyclonal, monospecific or monoclonal (or derivatives thereof). If polyclonal antibodies are desired, techniques for producing and processing polyclonal sera are well-known in the art (e.g. Mayer and Walter, eds. lmmunochemical Methods in Cell and Molecular Biology, Academic Press, London, 1987).
Monoclonal antibodies, reactive against the polypeptide according to the invention (or variants or fragments thereof) according to the present invention, can be prepared by immunising inbred mice by techniques also known in the art (Kohler and Milstein, Nature, 256, 495-497, 1975).
Finally, the invention relates to methods for the detection of antigenic material from Helicobacter fells in which the method comprises the incubation of serum, tissue of body fluids with antibodies against UreX or UreY polypeptide or an antigenic fragment thereof according to the invention.
Examale 1 The ureX and ureY genes of Helicobacter fells strain CS1: cloning and expression in Escherichia coli.
The ureX and ureY genes of H. fells strain CS1 were cloned as an operon into an E. coli T7 expression vector, pET3a, as follows:
For proper expression of the UreX and Y proteins in pET3a (Novagen, 601 Science Drive, Madison WI, USA) the genes were cloned as a Ndel-BamHl DNA fragment into the Ndel-BamHl sites of this vector. The ureaseXYoperon contains an internal Ndel site that was mutated by overlap-extension PCR of 2 PCR fragments. For that purpose two PCR fragments (the 5' and the 3' products) were amplified using chromosomal DNA of H. fells CS1 as the template. The 5' PCR product contained the complete ureX
gene and the first part of the ureY gene. The forward primer contained a Ndel restriction site and the start codon of ureX (GGAGTAACATATGAAACTCACA CCCAAAGAGC) (SEQ ID
NO: 18), and the reverse primer contains a point mutation (CACACCC
ACGACCATGTGAGGGCTTAC) (SEQ ID NO: 19). The second, 3' PCR product consisted of the 3' end of the ureY gene. This forward primer is complementary to the reverse primer of the first PCR product and also contained the same point mutation (GTAAGCC CTCACATGGTCGTGGGTGTG) (SEQ ID NO: 20), and the reverse primer contained a BamHl restriction site just downstream of the stopcodon of the ureY gene (CGAATT CGGATCCTAGAAGAAAGTGTAGCGCTGG) (SEQ ID NO: 21 ). The mutation in the complementary primers is made to delete the internal Ndel site in ureY, it replaces the CATATG (His- Met) by CACATG (His-Met).
After amplification of both PCR products, the complete operon was obtained by overlap-extension-PCR with the forward primer of the ureX and the reverse primer of the ureY
using both PCR products as templates. The resulting PCR product was cloned into PCR-bluntll-TOPO (Invitrogen, P.O.Box 2312, 9704 CH Groningen, The Netherlands) and transformed into E. coli TOP10F' cells (Invitrogen). Positive clones were isolated and the ureaseXY genes were sub-cloned into pET3a with Ndel-8amHl. The obtained plasmid was called pUreXY-1 and was transformed into the expression strain HMS174(DE3)/pLysS (Novagen).
The ureX and ureY genes of pUreXY-1 were expressed in HMS174(DE3)/pLysS as follows: an overnight culture was diluted 1/100 into TB Amp'°°
Cam25; this culture was incubated for 3 h at 37°C at 200 rpm; the culture was induced by adding 1 mM of IPTG
and incubated for another 3 h at 37°C at 200 rpm. The induction was done twice, once in a small scale and once in a large scale.
The induced samples were analysed on a SDS-PAGE gel (fig. 2). As can be clearly seen from lane 9, expression of UreX and UreY, when induced provides the two structural subunits as polypeptide bands with a molecular weight of 25 kDa for the UreX
subunit and 62 kDa for the UreY subunit.
The ureX and ureY genes of H. fells strain CS1 were cloned as an operon into an E. coli T7 expression vector, pET3a, as follows:
For proper expression of the UreX and Y proteins in pET3a (Novagen, 601 Science Drive, Madison WI, USA) the genes were cloned as a Ndel-BamHl DNA fragment into the Ndel-BamHl sites of this vector. The ureaseXYoperon contains an internal Ndel site that was mutated by overlap-extension PCR of 2 PCR fragments. For that purpose two PCR fragments (the 5' and the 3' products) were amplified using chromosomal DNA of H. fells CS1 as the template. The 5' PCR product contained the complete ureX
gene and the first part of the ureY gene. The forward primer contained a Ndel restriction site and the start codon of ureX (GGAGTAACATATGAAACTCACA CCCAAAGAGC) (SEQ ID
NO: 18), and the reverse primer contains a point mutation (CACACCC
ACGACCATGTGAGGGCTTAC) (SEQ ID NO: 19). The second, 3' PCR product consisted of the 3' end of the ureY gene. This forward primer is complementary to the reverse primer of the first PCR product and also contained the same point mutation (GTAAGCC CTCACATGGTCGTGGGTGTG) (SEQ ID NO: 20), and the reverse primer contained a BamHl restriction site just downstream of the stopcodon of the ureY gene (CGAATT CGGATCCTAGAAGAAAGTGTAGCGCTGG) (SEQ ID NO: 21 ). The mutation in the complementary primers is made to delete the internal Ndel site in ureY, it replaces the CATATG (His- Met) by CACATG (His-Met).
After amplification of both PCR products, the complete operon was obtained by overlap-extension-PCR with the forward primer of the ureX and the reverse primer of the ureY
using both PCR products as templates. The resulting PCR product was cloned into PCR-bluntll-TOPO (Invitrogen, P.O.Box 2312, 9704 CH Groningen, The Netherlands) and transformed into E. coli TOP10F' cells (Invitrogen). Positive clones were isolated and the ureaseXY genes were sub-cloned into pET3a with Ndel-8amHl. The obtained plasmid was called pUreXY-1 and was transformed into the expression strain HMS174(DE3)/pLysS (Novagen).
The ureX and ureY genes of pUreXY-1 were expressed in HMS174(DE3)/pLysS as follows: an overnight culture was diluted 1/100 into TB Amp'°°
Cam25; this culture was incubated for 3 h at 37°C at 200 rpm; the culture was induced by adding 1 mM of IPTG
and incubated for another 3 h at 37°C at 200 rpm. The induction was done twice, once in a small scale and once in a large scale.
The induced samples were analysed on a SDS-PAGE gel (fig. 2). As can be clearly seen from lane 9, expression of UreX and UreY, when induced provides the two structural subunits as polypeptide bands with a molecular weight of 25 kDa for the UreX
subunit and 62 kDa for the UreY subunit.
Legend to the figures Figure 1a: Comparison of the nucleic acid sequence encoding UreX and Y, including a short non-coding region bridging the two coding sequences, from Helicobacter fells species CS1, Kukka, Ds4, 2301 and 390 with the nucleic acid sequence encoding UreA
and B, including a short non-coding region bridging the two coding sequences, from Helicobacter fells, pylori and heilmannii Figure 1 b: Comparison of the amino acid sequence of UreX from Helicobacter fells species CS1, Kukka, Ds4, 2301 and 390 with the amino acid sequence encoding UreA
from Helicobacter fells, pylori and heilmannii Figure 1 c: Comparison of the amino acid sequence of UreY from Helicobacter fells species CS1, Kukka, Ds4, 2301 and 390 with the amino acid sequence encoding Urea from Helicobacter fells, pylori and heilmannii Figure 2: Polyacrylamide gel of the expression products UreX and UreY
Lane 7 : Biorad broad range marker Lane 8 : Complete cell culture before induction (small scale culture) Lane 9 : Complete cell culture after induction (small scale culture) Lane 10 : Complete cell culture after induction (large scale culture) Lane 11 : Supernatant after induction (large scale culture).
Lane 12 : Biorad pre-stained marker SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: AKZO NOBEL N.V.
(ii} TITLE OF INVENTION: HELICOBACTER FELIS VACCINE
(iii} NUMBER OF SEQUENCES: 21 (iv} CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: FETHERSTONHAUGH & CO.
10 (B) STREET: P.O. BOX 2999, STATION D
(C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA
(B) FILING DATE: 03-JUL-2001 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: FETHERSTONHAUGH & C0.
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 23804-613 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-235-4373 {B) TELEFAX: (613)-232-8440 (2} INFORMATION FOR SEQ ID NO.: 1:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2883 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (206) . . (886) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (897}..(2603}
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1:
ACTTGTTAAT RCTATTATTA ATTTTTTAAT AATTACTTAT TATCA'rATAT AATAATATTA 120 TTACTTATAT TAAAAAGTTA ATAAAAAGTA ACGAAATTAG GACTA'rAATC CCATTGCCTT 180 TAAAATTTAA CACAAGGAGT AATAG GTG AAA CTC ACA CCC Ai'-~A GAG CAA GAA 232 VaI Lys Leu Thr Pro Lys Glu Gln Glu AAG TTC TTG TTA TAT TAT GCG GGC GAA GTG GCT AGA Ai~G CGC AAA GCA 280 Lys Phe Leu Leu Tyr Tyr AIa Gly Glu Val Ala Arg L~~s Arg Lys Ala GAG GGC TTA AAG CTC AAC CAA CCC GAA GCC ATT GCT Ti~C ATT AGT GCC 328 Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala T~~r Ile Ser Ala CAT ATT ATG GAC GAA GCG CGC CGT GGA AAA AAA ACC G'rT GCC CAG CTT 376 10 His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Gln Leu ATG GAA GAG TGC ATG CAC TTT TTG AAA AAA GAT GAA G'rA ATG CCC GGG 424 Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly 60 65 '70 Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Tlar Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro I1e Glu P:ro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Aap Ile Glu Leu AAT GCA GGC AAA GAA GTA ACC GAA CTT GAG GTT ACT Ai~T GAA GGG CCT 616 Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr A:an Glu Gly Pro AAA TCC TTG CAT GTG GGT AGC CAT TTC CAC TTC TTT Gi~A GCT AAC AAG 664 Lys Ser Leu His Val Gly Ser His Phe His Phe Phe G.Lu Ala Asn Lys 140 145 1!50 GCA CTA AAA TTC GAT CGT GAA AAA GCC TAT GGC AAA C(3C CTA GAT ATT 712 Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val CAG TTG ATT CCT CTT GGT GGC AGT AAA AAA GTG ATT G(zC ATG AAC GGG 808 Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly CTT GTG AAT AAC ATC GCG GAT GAA CGC CAT AAA CAT AAP. GCG CTT GAC 856 Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His L~rs Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys Met Lys Met AAA AAA CAA GAA TAT GTA AAT ACC TAC GGA CCC ACC A~~.A GGC GAT AAA 953 Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Lys Gly Asp Lys s GTG CGC TTA GGA GAT ACC GAT CTT TGG GCA GAA GTA Gi~A CAT GAC TAT 1001 Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val G:Lu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly L~~rs Thr Ile Arg 265 270 2'75 Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Tlzr Leu Asp Leu GTC ATC ACT AAC GCG ATG ATT ATC GAC TAC ACC GGG A'L'T TAC AAA GCC 1145 Val Ile Thr Asn Ala Met Ile Ile Asp Tyr Thr Gly I:Le Tyr Lys Ala GAC ATT GGG ATT AAA AAC GGC AAA ATC CAT GGC ATT G(3C AAG GCA GGA 1193 Asp IIe Gly Ile Lys Asn Gly Lys Ile His Gly Ile G:Ly Lys Ala Gly AAC AAG GAC ATG CAA GAT GGC GTA AGC CCT CAT ATG G'CC GTG GGT GTG 1241 Asn Lys Asp Met Gln Asp GIy Val Ser Pro His Met V<~1 Val Gly VaI
GIy Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly ATC GAT TCA CAC ACC CAC TTC CTT TCT CCA CAA CAA T'CC CCT ACC GCT 1337 Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His CGC ATG TTG CGC GCA GCA GAA GAG TAT TCT ATG AAT G'.CG GGC TTT TTG 1481 Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val G7Lu Gln Val Glu GCG GGC GCG ATT GGT TTT AAA TTG CAT GAA GAC TGG GCiC ACA ACA CCA 1577 Ala Gly Ala IIe Gly Phe Lys Leu His GIu Asp Trp G7_y Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val Ala Asp Glu Tyr Asp VaI Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp ' F
ACC CTA AAT GCA ATG AAC GGG CGC GCC ATC CAT GCC T.AC CAC ATT GAG 1721 Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu GGA GCG GGT GGA GGA CAC TCA CCT GAT GTT ATC ACC A'rG GCA GGC GAG 1769 Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg IIe Arg Glu Asp Leu Gln Phe Ser Gln S~er Arg Ile Arg CCC GGC TCT ATC GCG GCT GAA GAT GTG CTC CAT GAT A'rG GGT GTG ATC 1961 Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Met Gly Val Ile GCG ATG ACA AGC TCG GAT TCG CAA GCA ATG GGG CGT G~~A GGC GAA GTG 2009 Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val ATT CCT CGA ACT TGG CAG ACT GCG GAT AAG AAT AAA A;?~A GAA TTT GGT 2057 Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Lys Asp Asn Asp Asn Phe A:rg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr H.is Gly Val Ser GIu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val TGG AAT CCT GCC TTT TTT GGC GTA AAA CCC AAA ATC G'rG ATC AAA GGC 2249 Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Va 1 Ile Lys Gly 665 670 6'75 Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn A:la Ser Val Pro ACT CCC CAA CCG GTT TAT TAC CGC GAA ATG TTT GGG Ci~T CAC GGC AAG 2345 Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly H:is His GIy Lys GCG AAA TTT GAC ACC AGC ATC ACT TTT GTT TCC AAA G'CC GCC TAT GAA 2393 Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu a s t Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Gln Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Gln Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 2:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg 35 40 ~45 Arg Gly Lys Lys Thr Val Ala Gln Leu Met Glu Glu Cys Met His Phe 50 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro IIe Glu Pro Asp Glu His Phe Lys Ala G:Ly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~~rs Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys P:he Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg 10 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 3:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B) TYPE: amino acid (C) STRANDEDNESS:
fD) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Lys Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T~yr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile H.is Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser P:ro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Me t Ile Ile Thr 115 12 0 l:Z 5 Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser P:ro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe G:ly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr P:ro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr S~'r Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys G:ln Leu Val Glu 195 200 2~D5 Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His G:lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:la Asp Glu Tyr Asp Val GIn Val Cys Ile His Thr Asp Thr Val Asn G:lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala I:le His Ala Tyr His Ile GIu Gly Ala Gly Gly Gly His Ser Pro Asp V;al Ile Thr Met 275 280 2.85 Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr P:ro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met L.°u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Plhe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Met Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Me°t Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Lys Asp Asn Aap Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro A:La Leu Thr His Gly Val Ser Glu Tyr Ile GIy Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly VaI Lys Pogo Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Gln Val Leu Pro VaI Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Gln Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 4:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2405 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (1)..(681) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (692)..(2398) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 4:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln CCC GAA GCC ATT GCC TAC ATT AGT GCC CAT ATT ATG G.AC GAG GCG CGC 144 Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe TTG AAA AAA GAT GAG GTG ATG CCC GGT GTG GGG AAT A'rG GTC CCT GAT 240 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp TTG GGC GTA GAA GCC ACT TTC CCC GAT GGC ACC AAA C'TC GTA ACC GTG 288 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val AAT TGG CCC ATT GAA CCT GAT GAA CAC TTT AAA GCC G'~GT GAA GTG AAA 336 Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys TTT GGC TGT GAT AAA GAC ATT GAG CTC AAC GCG GGT A,AG GAA GTT ACC 384 Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~ys Glu Val Thr GAG CTT GAA GTT ACC AAC GAA GGA CCT AAA TCC TTG C'AT GTG GGT AGC 432 Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu F3:is Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys F'he Asp Arg Glu AAA GCC TAT GGC AAA CGC CTA GAT ATT CCC TCT GGC pAC ACG CTA CGC 528 Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly A.sn Thr Leu Arg ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA ATC C'CT CTT GGC GGT 576 Ile Gly Ala GIy Gln Thr Arg Lys Val Gln Leu Ile Fro Leu Gly Gly AGT AAA AAA GTG ATT GGC ATG AAC GGG CTT GTG AAT A,AT ATT GCG GAC 624 Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn A.sn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe ATC AAG TAA GGAGACTCCC ATG AAA ATG AAA AAA CAA GA.G TAT GTA AAC 721 Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val Asn ACC TAC GGA CCC ACC ACA GGC GAT AAA GTG CGC TTA G'~GA GAT ACC GAT 769 Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu G!ly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr A.sn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly a a t P
Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser H:is Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr A.sn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu A.rg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly A.sn Ser Ser Ser AAA AAA CAA CTC GTA GAA CAA GTA GAA GCG GGC GCG A.TT GGC TTT AAA 1345 Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile A.sp His Cys Leu Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp CTC CAG TTT TCC CAA AGC CGT ATC CGC CCC GGC TCT A'L'T GCC GCT GAA 1729 Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser I:Le Ala AIa Glu 560 565 5'70 GAT GTG CTC CAT GAT ATT GGC GTG ATC GCG ATG ACA A(3C TCG GAT TCG 1777 Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser 10 Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Tllr Trp Gln Thr GCA GAC AAG AAT AAA AAA GAA TTT GGT AAG CTT CCT Gi'~A GAT GGT GCA 1873 AIa Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro G:Lu Asp Gly Ala GAT AAT GAC AAC TTC CGC ATC AAA CGC TAT ATC TCC Ai'~A TAC ACC ATT 1921 Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser L~~s Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile G.Ly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro A:La Phe Phe Gly GTA AAA CCC AAA ATC GTG ATC AAA GGC GGT ATG GTG G'rG TTC TCT GAA 2065 Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val VaI Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln P:ro Val Tyr Tyr CGC GAA ATG TTT GGG CAT CAC GGC AAG GCG AAA TTT Gi3C ACC AGC ATC 2161 Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu 720 725 7:30 GGC TTA GAG CGC AAG GTG CTA CCC GTG AAA AAC TGC C(3C AAC ATC ACT 2257 Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr AAG AAA GAC TTC AAA TTC AAC AAC AAG ACG GCG CAT A'.CC ACT GTC GAT 2305 Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His I:Le Thr Val Asp CCT AAA ACC TTC GAG GTC TTT GTA GAT GGC AAA CTC T(3C ACC TCT AAA 2353 Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu C;rs Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 5:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C} STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 5:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met A.sp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu G1y Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 6:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr GIy Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp A.la Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro A.sp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T'yr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile H:is Gly Ile Gly Lys AIa Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe G'~ly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu GIu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A.la Asp Glu Tyr Asp VaI Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr 260 265 2?0 His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr IIe Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn A.sp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro A.la Leu Thr His Gly Val Ser Glu Tyr IIe Gly Ser Val Glu Glu Gly Lays Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly A.sp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Miet Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe V'al Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys A.sp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 7:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2183 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (3)..(683) (ix} FEATURE
(A} NAME/KEY: CDS
(B) LOCATION: (694)..(2181) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala CGC CGT GGC AAA AAA ACC GTT GCT GAA CTT ATG GAA G.AA TGT ATG CAC 191 Arg Arg Gly Lys Lys Thr Val AIa Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val 50 Lys Phe Gly Cys Asp Lys Asp IIe Glu Leu Asn Val Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp IIe Pro Ser G:Ly Asn Thr Leu CGC ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA A'rC CCT CTT GGC 575 Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu I:Le Pro Leu Gly 10 Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn IIe Ala GAC GAA CGC CAT AAA CAC AAA GCA CTA GAC AAG GCA Ai'~A TCT CAC GGA 671 Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly 210 215 2:~0 TTC ATC AAG TAA GGAGACTCCC ATG AAA ATG AAA AAA CAi'~ GAG TAT GTA 720 Phe Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val AAC ACC TAC GGA CCC ACC ACA GGC GAT AAA GTG CGC T'rA GGA GAT ACC 768 Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu CTC AAA TTT GGC GCG GGT AAA ACT ATC CGT GAG GGT A'rG GGT CAG AGC 864 Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Tlhr Asn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile G:ly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys A;sp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr G:Lu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Se>r His Thr His TTC CTC TCT CCC CAA CAA TTC CCT ACC GCT CTA GCC Ai'~T GGT GTT ACA 1152 Phe Leu Ser Pro Gln Gln Phe Fro Thr Ala Leu Ala A:an Gly Val Thr Thr Met Phe Gly Gly Gly Thr GIy Pro Val Asp Gly Tlzr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu GIu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn GIy Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His TCA CCT GAT GTT ATC ACC ATG GCA GGC GAG CTC AAT A.TT CTA CCC TCC 1584 Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys A.rg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr ATT AAT CCC GCT TTG ACC CAT GGC GTG AGC GAG TAT A'TC GGC TCT GTG 1968 Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe GGC GTG AAA CCT AAG ATT GTG ATT AAA GGT GGC ATG G'TG GTC TTC TCT 2064 Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser GAA ATG GGC GAT TCT AAC GCG TCC GTG CCC ACG CCT C.AG CCG GTT TAT 2112 Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Arg Val Ser Ser (2) INFORMATION FOR SEQ ID NO.: 8:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 8:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Val Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val GIn Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Se r His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 9:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 496 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter fells (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 9:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp A7_a Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys 35 40 9':5 Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro A:;p Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T~~r Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys G.ln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His G.lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:La Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn G.Lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp VaI Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Le:u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Le:u His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Mea Gly Arg Ala 355 360 36.5 Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile AIa Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly As:p Ser Asn Ala A
Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Arg Val Ser Ser (2) INFORMATION FOR SEQ ID NO.: 10:
(i) SEQUENCE CHARACTERISTICS
10 (A) LENGTH: 2407 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (2)..(682) 20 (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (693)..(2399) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 10:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln CCC GAA GCC ATT GCC TAC ATT AGT GCC CAT ATT ATG G.AC GAG GCG CGC 145 Pro Glu Ala Ile Ala Tyr IIe Ser Ala His Ile Met Asp Glu Ala Arg 35 40 ~45 Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe TTG AAA AAA GAC GAG GTG ATG CCC GGT GTG GGG AAT A'rG GTC CCT GAT 241 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Ms~t Val Pro Asp TTA GGC GTG GAA GCT ACT TTT CCC GAT GGC ACC AAA C'CC GTA ACC GTG 289 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val AAT TGG CCC ATC GAA CCC GAT GAA CAC TTC AAA GCG GCiC GAA GTC AAA 337 Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala G_Ly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr 115 120 1~!5 Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu Hi.s Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu AAA GCC TAT GGC AAA CGC CTA GAT ATT CCC TCT GGC P,AC ACG CTA CGC 529 Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly F,sn Thr Leu Arg ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA ATC C'CT CTT GGC GGC 577 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Fro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn A.sn Ile AIa Asp Glu Arg His Lys His Lys Ala Leu Glu Lys Ala Lys Ser His Gly Phe Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met G:Iy Gln Ser Asn AGT CCA GAT GAA AAC ACC CTA GAT TTA GTC.' ATC ACC AAC GCG ATG ATT 914 Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Aan Ala Met Ile ATT GAC TAC ACC GGG ATT TAC AAA GCC GAC ATT GGC A'CT AAA AAT GGC 962 Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly IIe Lys Asn GIy AAA ATC CAT GGC ATT GGC AAG GCA GGA AAC AAG GAC A':L'G CAA GAT GGC 1010 Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr GIu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser Hi.s Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr m s ATG TTT GGC GGT GGC ACA GGT CCG GTA GAT GGC ACG A.AT GCG ACT ACC 1202 Met Phe Gly Gly Gly Thr GIy Pro Val Asp Gly Thr A.sn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu A.rg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Ile G1u Ala Gly Ala Ile Gly Phe Lys TTG CAT GAA GAC TGG GGC ACA ACT CCA AGT GCA ATC G.AT CAC TGC TTG 1394 Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu AGC GTA GCA GAT GAA TAC GAT GTG CAA GTT TGT ATC C:~C ACC GAT ACG 1442 Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile H.is Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn A:La Met Asn GIy 480 485 4'.a0 CGC GCC ATC CAT GCC TAC CAC ATT GAG GGA GCG GGC GczA GGA CAC TCA 1538 Arg Ala Ile His Ala Tyr His IIe Glu Gly Ala Gly GIy Gly His Ser CCT GAT GTT ATC ACC ATG GCA GGC GAG CTC AAT ATT C'.CA CCC TCC TCC 1586 Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Le:u Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Il.e Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser ILe AIa Ala Glu Asp VaI Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met GIy Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Ser Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu 640 645 6.50 Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro A.la Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser VaI Pro Thr Pro Gln Pro Val Tyr Tyr CGC GAA ATG TTT GGG CAT CAC GGC AAG GCG AAA TTT G.AC ACC AGC ATC 2162 Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lays Glu Lys Leu 720 725 7:30 Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys A:rg Asn Ile Thr AAG AAA GAC TTC AAA TTC AAC AAC AAG ACG GCG CAT A'CC ACT GTC GAT 2306 Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys CCC GCC TCT GAA GTG CCT CTA GCC CAG CGC TAC ACT T7.'C TTC TAG 2399 Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 11:
(i) SEQUENCE CHARACTERISTICS
(A} LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi} ORIGINAL SOURCE:
(A} ORGANISM: Helicobacter fells (xi} SEQUENCE DESCRIPTION: SEQ ID NO.: 11:
Val Lys Leu Thr Pro Lys GIu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu C".ys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp 65 70 ?5 80 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~ys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu T3:is Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr GIy Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg 165 1?0 175 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly GIy Ser Lys Lys VaI Ile Gly Met Asn Gly Leu Val Asn Asn Ile AIa Asp Glu Arg His Lys His Lys Ala Leu Glu Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 12:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B} TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A} ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 12:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp AT.a Glu Val Glu m m His Asp Tyr Thr Thr Tyr Gly Glu GIu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp 'I'yr Thr Gly Ile 10 Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Fro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly M:et Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Ile Glu Ala Gly Ala Ile Gly Phe Lys Leu His G:lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:La Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn G:Lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala I:Le His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Le:u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Le:u His Asp Ile a Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala 355 360 ~t65 Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Ser Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro F~la Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly L~ys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Fro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly A.sp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His 465 470 475 4gp His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys T:hr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 13:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2452 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (48) . . (728) (ix) FEATURE
(A) NAME/KEY: CDS
(B} LOCATION: (739}..(2445) m (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 13:
Val Lys Leu ACA CCC AAA GAG CAA GAA AAG TTC TTG TTA TAT TAT G'CG GGC GAA GTG 104 Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr A.la Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys AAA ACC GTT GCG GAA CTT ATG GAA GAG TGT ATG CAC T'TT TTG AAA AAA 248 Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His P:he Leu Lys Lys GAC GAG GTG ATG CCC GGG GTG GGG AAT ATG GTC CCT Gi'~T TTG GGC GTG 296 Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro A:ap Leu Gly Val GAA GCC ACT TTC CCC GAT GGC ACC AAA CTC GTA ACT G':CG AAT TGG CCC 344 Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys GAT AAA GAC ATT GAA CTC AAC GCA GGT AAG GAA GTT AC'.C GAA CTA GAA 440 Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Se:r His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala GGA CAA ACC CGT AAA GTG CAG TTA ATC CCT CTT GGC GG'T AGT AAA AAA 632 Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His AAA CAC AAA GCG CTA GAC AAA GCA AAA TCT CAC GGA TT'.C ATC AAG TAA 728 Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe~ Ile Lys a c Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala GAA GTA GAA CAT GAC TAT ACC ACC TAT GGC GAA GAA C;TC AAA TTC GGT 873 Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser P,sn Ser Pro Asp GAA AAC ACC TTA GAT TTA GTG ATC ACC AAC GCG ATG F,TT ATT GAC TAC 969 Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly GIy Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr T:hr Met Phe Gly 370 375 3g0 Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr T~hr Ile Thr Pro GGC AAA TGG AAC TTG CAC CGC ATG TTG CGC GCA GCA GA.A GAG TAT TCT 1305 Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala G:Lu Glu Tyr Ser ATG AAT GTG GGC TTT TTG GGC AAA GGC AAT AGC TCT A<sT AAA AAA CAA 1353 Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Seer Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu 435 440 4~E5 Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Le:u Ser Val Ala *' s b Asp Glu Tyr Asp Val Gln Val Cys IIe His Thr Asp Thr Val Asn Glu GCA GGT TAT GTA GAT GAC ACC CTA AAT GCA ATG AAC CiGG CGC GCC ATC 1545 Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn G:ly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His ~~er Pro Asp Val ATC ACC ATG GCA GGC GAA GTG AAT ATT CTA CCC TCC T'CC ACA ACC CCT 1641 Ile Thr Met Ala Gly Glu Val Asn Ile Leu Pro Ser S'er Thr Thr Pro ACT ATC CCC TAT ACC ATT AAT ACG GTT GCA GAA CAC T'TA GAC ATG CTT 1689 Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu A.sp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln T:hr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly A:la Asp Asn Asp AAC TTC CGC ATC AAA CGC TAT ATC TCC AAA TAC ACC A'rT AAT CCC GCT 1977 Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr I:Le Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly VaI Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser G7.u Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met s Phe GIy His His Gly Lys Ala Lys Phe Asp Thr Ser I:le Thr Phe Val TCC AAA GTC GCC TAT GAA AAT GGT GTG AAA GAA AAA C.'TA GGT TTA GAG 2265 Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys I~eu Gly Leu Glu CGC AAG GTG CTC CCC GTG AAA AAC TGC CGT AAC ATC A.CC AAG AAG GAC 2313 10 Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val A.sp Pro Lys Thr Phe Glu Val Plie Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Glu Val Pro Leu Ala GIn Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 14:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 14:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Lf°u Tyr Tyr AIa Gly Glu Val Ala Arg Lys Arg Lys Ala Glu GIy Leu L~,rs Leu Asn Gln 20 25 ~ 30 Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Meat Val Pro Asp Leu Gly Val GIu AIa Thr Phe Pro Asp Gly Thr Lys Le;u Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Fhe Lys Ala Gl.y Glu Val Lys Phe GIy Cys Asp Lys Asp IIe Glu Leu Asn Ala Gly Lys Glu VaI Thr ', at r Glu Leu Glu Val Thr Asn Glu GIy Pro Lys Ser Leu hfis Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys F~he Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly A.sn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys VaI Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys AIa Leu Asp Lys Ala Lys S~er His GIy Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 15:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B} TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 15:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gl.y Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met GIy Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met IIe Ile Asp Tyr Thr Gly Ile Tyr Lys AIa Asp Ile Gly Ile Lys Asn Gly Lys Ile Hiss Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Va1 Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Mei: Ile Ile Thr ~ I5 12 0 12 Ei Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe a a a.
Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr ~~er Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala GIy Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A.la Asp Glu Tyr Asp Val GIn Val Cys Ile His Thr Asp Thr VaI Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn AIa Met Asn Gly Arg Ala IIe His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Val Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met L~eu Met Thr Cys His His Leu Asp Lys Arg Ile Arg GIu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Most GIy Arg Ala 355 360 3Ei5 Gly Glu VaI Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn A:ap Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Al.a Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly L~~s Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Nfet Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys A.sp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp GIy Lys Leu Cys Thr Ser Lys Pro Thr Ser GIu Val Pro Leu Ala Gln Arg Tyr Thr Phe Fhe (2) INFORMATION FOR SEQ ID NO.: 16:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 21 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 16:
(2) INFORMATION FOR SEQ ID NO.: 17:
{i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 16 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 17:
(2) INFORMATION FOR SEQ ID NO.: 18:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 32 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 18:
.__ _._..-._._.___m. .._..~"~..,~~,~,,~~", ~~~, ,~",. _ .~~_._-__..
(2) INFORMATION FOR SEQ ID NO.: 19:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 19:
(2) INFORMATION FOR SEQ ID NO.: 20:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 20:
(2) INFORMATION FOR SEQ ID NO.: 21:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 34 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 21:
w
and B, including a short non-coding region bridging the two coding sequences, from Helicobacter fells, pylori and heilmannii Figure 1 b: Comparison of the amino acid sequence of UreX from Helicobacter fells species CS1, Kukka, Ds4, 2301 and 390 with the amino acid sequence encoding UreA
from Helicobacter fells, pylori and heilmannii Figure 1 c: Comparison of the amino acid sequence of UreY from Helicobacter fells species CS1, Kukka, Ds4, 2301 and 390 with the amino acid sequence encoding Urea from Helicobacter fells, pylori and heilmannii Figure 2: Polyacrylamide gel of the expression products UreX and UreY
Lane 7 : Biorad broad range marker Lane 8 : Complete cell culture before induction (small scale culture) Lane 9 : Complete cell culture after induction (small scale culture) Lane 10 : Complete cell culture after induction (large scale culture) Lane 11 : Supernatant after induction (large scale culture).
Lane 12 : Biorad pre-stained marker SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: AKZO NOBEL N.V.
(ii} TITLE OF INVENTION: HELICOBACTER FELIS VACCINE
(iii} NUMBER OF SEQUENCES: 21 (iv} CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: FETHERSTONHAUGH & CO.
10 (B) STREET: P.O. BOX 2999, STATION D
(C) CITY: OTTAWA
(D) STATE: ONT
(E) COUNTRY: CANADA
(F) ZIP: K1P 5Y6 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA
(B) FILING DATE: 03-JUL-2001 (C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: FETHERSTONHAUGH & C0.
(B) REGISTRATION NUMBER:
(C) REFERENCE/DOCKET NUMBER: 23804-613 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (613)-235-4373 {B) TELEFAX: (613)-232-8440 (2} INFORMATION FOR SEQ ID NO.: 1:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2883 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (206) . . (886) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (897}..(2603}
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1:
ACTTGTTAAT RCTATTATTA ATTTTTTAAT AATTACTTAT TATCA'rATAT AATAATATTA 120 TTACTTATAT TAAAAAGTTA ATAAAAAGTA ACGAAATTAG GACTA'rAATC CCATTGCCTT 180 TAAAATTTAA CACAAGGAGT AATAG GTG AAA CTC ACA CCC Ai'-~A GAG CAA GAA 232 VaI Lys Leu Thr Pro Lys Glu Gln Glu AAG TTC TTG TTA TAT TAT GCG GGC GAA GTG GCT AGA Ai~G CGC AAA GCA 280 Lys Phe Leu Leu Tyr Tyr AIa Gly Glu Val Ala Arg L~~s Arg Lys Ala GAG GGC TTA AAG CTC AAC CAA CCC GAA GCC ATT GCT Ti~C ATT AGT GCC 328 Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala T~~r Ile Ser Ala CAT ATT ATG GAC GAA GCG CGC CGT GGA AAA AAA ACC G'rT GCC CAG CTT 376 10 His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Gln Leu ATG GAA GAG TGC ATG CAC TTT TTG AAA AAA GAT GAA G'rA ATG CCC GGG 424 Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly 60 65 '70 Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Tlar Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro I1e Glu P:ro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Aap Ile Glu Leu AAT GCA GGC AAA GAA GTA ACC GAA CTT GAG GTT ACT Ai~T GAA GGG CCT 616 Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr A:an Glu Gly Pro AAA TCC TTG CAT GTG GGT AGC CAT TTC CAC TTC TTT Gi~A GCT AAC AAG 664 Lys Ser Leu His Val Gly Ser His Phe His Phe Phe G.Lu Ala Asn Lys 140 145 1!50 GCA CTA AAA TTC GAT CGT GAA AAA GCC TAT GGC AAA C(3C CTA GAT ATT 712 Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val CAG TTG ATT CCT CTT GGT GGC AGT AAA AAA GTG ATT G(zC ATG AAC GGG 808 Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly CTT GTG AAT AAC ATC GCG GAT GAA CGC CAT AAA CAT AAP. GCG CTT GAC 856 Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His L~rs Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys Met Lys Met AAA AAA CAA GAA TAT GTA AAT ACC TAC GGA CCC ACC A~~.A GGC GAT AAA 953 Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Lys Gly Asp Lys s GTG CGC TTA GGA GAT ACC GAT CTT TGG GCA GAA GTA Gi~A CAT GAC TAT 1001 Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val G:Lu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly L~~rs Thr Ile Arg 265 270 2'75 Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Tlzr Leu Asp Leu GTC ATC ACT AAC GCG ATG ATT ATC GAC TAC ACC GGG A'L'T TAC AAA GCC 1145 Val Ile Thr Asn Ala Met Ile Ile Asp Tyr Thr Gly I:Le Tyr Lys Ala GAC ATT GGG ATT AAA AAC GGC AAA ATC CAT GGC ATT G(3C AAG GCA GGA 1193 Asp IIe Gly Ile Lys Asn Gly Lys Ile His Gly Ile G:Ly Lys Ala Gly AAC AAG GAC ATG CAA GAT GGC GTA AGC CCT CAT ATG G'CC GTG GGT GTG 1241 Asn Lys Asp Met Gln Asp GIy Val Ser Pro His Met V<~1 Val Gly VaI
GIy Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly ATC GAT TCA CAC ACC CAC TTC CTT TCT CCA CAA CAA T'CC CCT ACC GCT 1337 Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His CGC ATG TTG CGC GCA GCA GAA GAG TAT TCT ATG AAT G'.CG GGC TTT TTG 1481 Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val G7Lu Gln Val Glu GCG GGC GCG ATT GGT TTT AAA TTG CAT GAA GAC TGG GCiC ACA ACA CCA 1577 Ala Gly Ala IIe Gly Phe Lys Leu His GIu Asp Trp G7_y Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val Ala Asp Glu Tyr Asp VaI Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp ' F
ACC CTA AAT GCA ATG AAC GGG CGC GCC ATC CAT GCC T.AC CAC ATT GAG 1721 Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu GGA GCG GGT GGA GGA CAC TCA CCT GAT GTT ATC ACC A'rG GCA GGC GAG 1769 Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg IIe Arg Glu Asp Leu Gln Phe Ser Gln S~er Arg Ile Arg CCC GGC TCT ATC GCG GCT GAA GAT GTG CTC CAT GAT A'rG GGT GTG ATC 1961 Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Met Gly Val Ile GCG ATG ACA AGC TCG GAT TCG CAA GCA ATG GGG CGT G~~A GGC GAA GTG 2009 Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val ATT CCT CGA ACT TGG CAG ACT GCG GAT AAG AAT AAA A;?~A GAA TTT GGT 2057 Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Lys Asp Asn Asp Asn Phe A:rg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr H.is Gly Val Ser GIu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val TGG AAT CCT GCC TTT TTT GGC GTA AAA CCC AAA ATC G'rG ATC AAA GGC 2249 Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Va 1 Ile Lys Gly 665 670 6'75 Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn A:la Ser Val Pro ACT CCC CAA CCG GTT TAT TAC CGC GAA ATG TTT GGG Ci~T CAC GGC AAG 2345 Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly H:is His GIy Lys GCG AAA TTT GAC ACC AGC ATC ACT TTT GTT TCC AAA G'CC GCC TAT GAA 2393 Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu a s t Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Gln Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Gln Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 2:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg 35 40 ~45 Arg Gly Lys Lys Thr Val Ala Gln Leu Met Glu Glu Cys Met His Phe 50 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro IIe Glu Pro Asp Glu His Phe Lys Ala G:Ly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~~rs Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys P:he Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg 10 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 3:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B) TYPE: amino acid (C) STRANDEDNESS:
fD) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Lys Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T~yr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile H.is Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser P:ro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Me t Ile Ile Thr 115 12 0 l:Z 5 Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser P:ro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe G:ly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr P:ro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr S~'r Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys G:ln Leu Val Glu 195 200 2~D5 Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His G:lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:la Asp Glu Tyr Asp Val GIn Val Cys Ile His Thr Asp Thr Val Asn G:lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala I:le His Ala Tyr His Ile GIu Gly Ala Gly Gly Gly His Ser Pro Asp V;al Ile Thr Met 275 280 2.85 Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr P:ro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met L.°u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Plhe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Met Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Me°t Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Lys Asp Asn Aap Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro A:La Leu Thr His Gly Val Ser Glu Tyr Ile GIy Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly VaI Lys Pogo Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Gln Val Leu Pro VaI Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Gln Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 4:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2405 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (1)..(681) (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (692)..(2398) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 4:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln CCC GAA GCC ATT GCC TAC ATT AGT GCC CAT ATT ATG G.AC GAG GCG CGC 144 Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe TTG AAA AAA GAT GAG GTG ATG CCC GGT GTG GGG AAT A'rG GTC CCT GAT 240 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp TTG GGC GTA GAA GCC ACT TTC CCC GAT GGC ACC AAA C'TC GTA ACC GTG 288 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val AAT TGG CCC ATT GAA CCT GAT GAA CAC TTT AAA GCC G'~GT GAA GTG AAA 336 Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys TTT GGC TGT GAT AAA GAC ATT GAG CTC AAC GCG GGT A,AG GAA GTT ACC 384 Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~ys Glu Val Thr GAG CTT GAA GTT ACC AAC GAA GGA CCT AAA TCC TTG C'AT GTG GGT AGC 432 Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu F3:is Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys F'he Asp Arg Glu AAA GCC TAT GGC AAA CGC CTA GAT ATT CCC TCT GGC pAC ACG CTA CGC 528 Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly A.sn Thr Leu Arg ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA ATC C'CT CTT GGC GGT 576 Ile Gly Ala GIy Gln Thr Arg Lys Val Gln Leu Ile Fro Leu Gly Gly AGT AAA AAA GTG ATT GGC ATG AAC GGG CTT GTG AAT A,AT ATT GCG GAC 624 Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn A.sn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe ATC AAG TAA GGAGACTCCC ATG AAA ATG AAA AAA CAA GA.G TAT GTA AAC 721 Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val Asn ACC TAC GGA CCC ACC ACA GGC GAT AAA GTG CGC TTA G'~GA GAT ACC GAT 769 Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu G!ly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr A.sn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly a a t P
Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser H:is Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr A.sn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu A.rg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly A.sn Ser Ser Ser AAA AAA CAA CTC GTA GAA CAA GTA GAA GCG GGC GCG A.TT GGC TTT AAA 1345 Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile A.sp His Cys Leu Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp CTC CAG TTT TCC CAA AGC CGT ATC CGC CCC GGC TCT A'L'T GCC GCT GAA 1729 Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser I:Le Ala AIa Glu 560 565 5'70 GAT GTG CTC CAT GAT ATT GGC GTG ATC GCG ATG ACA A(3C TCG GAT TCG 1777 Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser 10 Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Tllr Trp Gln Thr GCA GAC AAG AAT AAA AAA GAA TTT GGT AAG CTT CCT Gi'~A GAT GGT GCA 1873 AIa Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro G:Lu Asp Gly Ala GAT AAT GAC AAC TTC CGC ATC AAA CGC TAT ATC TCC Ai'~A TAC ACC ATT 1921 Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser L~~s Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile G.Ly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro A:La Phe Phe Gly GTA AAA CCC AAA ATC GTG ATC AAA GGC GGT ATG GTG G'rG TTC TCT GAA 2065 Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val VaI Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln P:ro Val Tyr Tyr CGC GAA ATG TTT GGG CAT CAC GGC AAG GCG AAA TTT Gi3C ACC AGC ATC 2161 Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu 720 725 7:30 GGC TTA GAG CGC AAG GTG CTA CCC GTG AAA AAC TGC C(3C AAC ATC ACT 2257 Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr AAG AAA GAC TTC AAA TTC AAC AAC AAG ACG GCG CAT A'.CC ACT GTC GAT 2305 Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His I:Le Thr Val Asp CCT AAA ACC TTC GAG GTC TTT GTA GAT GGC AAA CTC T(3C ACC TCT AAA 2353 Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu C;rs Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 5:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C} STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 5:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met A.sp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu G1y Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 6:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr GIy Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp A.la Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro A.sp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T'yr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile H:is Gly Ile Gly Lys AIa Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe G'~ly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu GIu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A.la Asp Glu Tyr Asp VaI Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr 260 265 2?0 His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr IIe Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn A.sp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro A.la Leu Thr His Gly Val Ser Glu Tyr IIe Gly Ser Val Glu Glu Gly Lays Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly A.sp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Miet Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe V'al Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys A.sp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 7:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2183 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (3)..(683) (ix} FEATURE
(A} NAME/KEY: CDS
(B) LOCATION: (694)..(2181) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala CGC CGT GGC AAA AAA ACC GTT GCT GAA CTT ATG GAA G.AA TGT ATG CAC 191 Arg Arg Gly Lys Lys Thr Val AIa Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val 50 Lys Phe Gly Cys Asp Lys Asp IIe Glu Leu Asn Val Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp IIe Pro Ser G:Ly Asn Thr Leu CGC ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA A'rC CCT CTT GGC 575 Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu I:Le Pro Leu Gly 10 Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn IIe Ala GAC GAA CGC CAT AAA CAC AAA GCA CTA GAC AAG GCA Ai'~A TCT CAC GGA 671 Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly 210 215 2:~0 TTC ATC AAG TAA GGAGACTCCC ATG AAA ATG AAA AAA CAi'~ GAG TAT GTA 720 Phe Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val AAC ACC TAC GGA CCC ACC ACA GGC GAT AAA GTG CGC T'rA GGA GAT ACC 768 Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu CTC AAA TTT GGC GCG GGT AAA ACT ATC CGT GAG GGT A'rG GGT CAG AGC 864 Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Tlhr Asn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile G:ly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys A;sp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr G:Lu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Se>r His Thr His TTC CTC TCT CCC CAA CAA TTC CCT ACC GCT CTA GCC Ai'~T GGT GTT ACA 1152 Phe Leu Ser Pro Gln Gln Phe Fro Thr Ala Leu Ala A:an Gly Val Thr Thr Met Phe Gly Gly Gly Thr GIy Pro Val Asp Gly Tlzr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu GIu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn GIy Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His TCA CCT GAT GTT ATC ACC ATG GCA GGC GAG CTC AAT A.TT CTA CCC TCC 1584 Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys A.rg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr ATT AAT CCC GCT TTG ACC CAT GGC GTG AGC GAG TAT A'TC GGC TCT GTG 1968 Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe GGC GTG AAA CCT AAG ATT GTG ATT AAA GGT GGC ATG G'TG GTC TTC TCT 2064 Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser GAA ATG GGC GAT TCT AAC GCG TCC GTG CCC ACG CCT C.AG CCG GTT TAT 2112 Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Arg Val Ser Ser (2) INFORMATION FOR SEQ ID NO.: 8:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 8:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Val Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Ser His Phe His Phe Phe Glu Thr Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val GIn Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys Ala Leu Asp Lys Ala Lys Se r His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 9:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 496 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter fells (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 9:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp A7_a Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys 35 40 9':5 Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro A:;p Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp T~~r Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys G.ln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His G.lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:La Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn G.Lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp VaI Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Le:u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Le:u His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Mea Gly Arg Ala 355 360 36.5 Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile AIa Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly As:p Ser Asn Ala A
Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Arg Val Ser Ser (2) INFORMATION FOR SEQ ID NO.: 10:
(i) SEQUENCE CHARACTERISTICS
10 (A) LENGTH: 2407 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (2)..(682) 20 (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (693)..(2399) (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 10:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln CCC GAA GCC ATT GCC TAC ATT AGT GCC CAT ATT ATG G.AC GAG GCG CGC 145 Pro Glu Ala Ile Ala Tyr IIe Ser Ala His Ile Met Asp Glu Ala Arg 35 40 ~45 Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe TTG AAA AAA GAC GAG GTG ATG CCC GGT GTG GGG AAT A'rG GTC CCT GAT 241 Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Ms~t Val Pro Asp TTA GGC GTG GAA GCT ACT TTT CCC GAT GGC ACC AAA C'CC GTA ACC GTG 289 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val AAT TGG CCC ATC GAA CCC GAT GAA CAC TTC AAA GCG GCiC GAA GTC AAA 337 Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala G_Ly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr 115 120 1~!5 Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu Hi.s Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu AAA GCC TAT GGC AAA CGC CTA GAT ATT CCC TCT GGC P,AC ACG CTA CGC 529 Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly F,sn Thr Leu Arg ATT GGG GCA GGA CAA ACC CGT AAA GTG CAG TTA ATC C'CT CTT GGC GGC 577 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Fro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn A.sn Ile AIa Asp Glu Arg His Lys His Lys Ala Leu Glu Lys Ala Lys Ser His Gly Phe Ile Lys Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met G:Iy Gln Ser Asn AGT CCA GAT GAA AAC ACC CTA GAT TTA GTC.' ATC ACC AAC GCG ATG ATT 914 Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Aan Ala Met Ile ATT GAC TAC ACC GGG ATT TAC AAA GCC GAC ATT GGC A'CT AAA AAT GGC 962 Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly IIe Lys Asn GIy AAA ATC CAT GGC ATT GGC AAG GCA GGA AAC AAG GAC A':L'G CAA GAT GGC 1010 Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr GIu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly Gly Ile Asp Ser Hi.s Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr m s ATG TTT GGC GGT GGC ACA GGT CCG GTA GAT GGC ACG A.AT GCG ACT ACC 1202 Met Phe Gly Gly Gly Thr GIy Pro Val Asp Gly Thr A.sn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu A.rg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Ile G1u Ala Gly Ala Ile Gly Phe Lys TTG CAT GAA GAC TGG GGC ACA ACT CCA AGT GCA ATC G.AT CAC TGC TTG 1394 Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu AGC GTA GCA GAT GAA TAC GAT GTG CAA GTT TGT ATC C:~C ACC GAT ACG 1442 Ser Val Ala Asp Glu Tyr Asp Val Gln Val Cys Ile H.is Thr Asp Thr Val Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn A:La Met Asn GIy 480 485 4'.a0 CGC GCC ATC CAT GCC TAC CAC ATT GAG GGA GCG GGC GczA GGA CAC TCA 1538 Arg Ala Ile His Ala Tyr His IIe Glu Gly Ala Gly GIy Gly His Ser CCT GAT GTT ATC ACC ATG GCA GGC GAG CTC AAT ATT C'.CA CCC TCC TCC 1586 Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Le:u Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Il.e Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser ILe AIa Ala Glu Asp VaI Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met GIy Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Ser Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu 640 645 6.50 Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro A.la Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser VaI Pro Thr Pro Gln Pro Val Tyr Tyr CGC GAA ATG TTT GGG CAT CAC GGC AAG GCG AAA TTT G.AC ACC AGC ATC 2162 Arg Glu Met Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lays Glu Lys Leu 720 725 7:30 Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys A:rg Asn Ile Thr AAG AAA GAC TTC AAA TTC AAC AAC AAG ACG GCG CAT A'CC ACT GTC GAT 2306 Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys CCC GCC TCT GAA GTG CCT CTA GCC CAG CGC TAC ACT T7.'C TTC TAG 2399 Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 11:
(i) SEQUENCE CHARACTERISTICS
(A} LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi} ORIGINAL SOURCE:
(A} ORGANISM: Helicobacter fells (xi} SEQUENCE DESCRIPTION: SEQ ID NO.: 11:
Val Lys Leu Thr Pro Lys GIu Gln Glu Lys Phe Leu Leu Tyr Tyr Ala Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu C".ys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro Asp 65 70 ?5 80 Leu Gly Val Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys Asp Lys Asp Ile Glu Leu Asn Ala Gly L~ys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu T3:is Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr GIy Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg 165 1?0 175 Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly GIy Ser Lys Lys VaI Ile Gly Met Asn Gly Leu Val Asn Asn Ile AIa Asp Glu Arg His Lys His Lys Ala Leu Glu Lys Ala Lys Ser His Gly Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 12:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B} TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A} ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 12:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp AT.a Glu Val Glu m m His Asp Tyr Thr Thr Tyr Gly Glu GIu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp 'I'yr Thr Gly Ile 10 Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Fro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly M:et Ile Ile Thr Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Ile Glu Ala Gly Ala Ile Gly Phe Lys Leu His G:lu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A:La Asp Glu Tyr Asp Val Gln Val Cys Ile His Thr Asp Thr Val Asn G:Lu Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn Gly Arg Ala I:Le His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Leu Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Le:u Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Le:u His Asp Ile a Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala 355 360 ~t65 Gly Glu Val Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Ser Ala Asp Asn Asp Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro F~la Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly L~ys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Fro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly A.sp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met Phe Gly His 465 470 475 4gp His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asn Lys Thr Ala His Ile Thr Val Asp Pro Lys T:hr Phe Glu Val Phe Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Ala Ser Glu Val Pro Leu Ala Gln Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 13:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 2452 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (ix) FEATURE
(A) NAME/KEY: CDS
(B) LOCATION: (48) . . (728) (ix) FEATURE
(A) NAME/KEY: CDS
(B} LOCATION: (739}..(2445) m (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 13:
Val Lys Leu ACA CCC AAA GAG CAA GAA AAG TTC TTG TTA TAT TAT G'CG GGC GAA GTG 104 Thr Pro Lys Glu Gln Glu Lys Phe Leu Leu Tyr Tyr A.la Gly Glu Val Ala Arg Lys Arg Lys Ala Glu Gly Leu Lys Leu Asn Gln Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys AAA ACC GTT GCG GAA CTT ATG GAA GAG TGT ATG CAC T'TT TTG AAA AAA 248 Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His P:he Leu Lys Lys GAC GAG GTG ATG CCC GGG GTG GGG AAT ATG GTC CCT Gi'~T TTG GGC GTG 296 Asp Glu Val Met Pro Gly Val Gly Asn Met Val Pro A:ap Leu Gly Val GAA GCC ACT TTC CCC GAT GGC ACC AAA CTC GTA ACT G':CG AAT TGG CCC 344 Glu Ala Thr Phe Pro Asp Gly Thr Lys Leu Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Phe Lys Ala Gly Glu Val Lys Phe Gly Cys GAT AAA GAC ATT GAA CTC AAC GCA GGT AAG GAA GTT AC'.C GAA CTA GAA 440 Asp Lys Asp Ile Glu Leu Asn Ala Gly Lys Glu Val Thr Glu Leu Glu Val Thr Asn Glu Gly Pro Lys Ser Leu His Val Gly Se:r His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys Phe Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly Asn Thr Leu Arg Ile Gly Ala GGA CAA ACC CGT AAA GTG CAG TTA ATC CCT CTT GGC GG'T AGT AAA AAA 632 Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys Val Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His AAA CAC AAA GCG CTA GAC AAA GCA AAA TCT CAC GGA TT'.C ATC AAG TAA 728 Lys His Lys Ala Leu Asp Lys Ala Lys Ser His Gly Phe~ Ile Lys a c Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gly Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala GAA GTA GAA CAT GAC TAT ACC ACC TAT GGC GAA GAA C;TC AAA TTC GGT 873 Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met Gly Gln Ser P,sn Ser Pro Asp GAA AAC ACC TTA GAT TTA GTG ATC ACC AAC GCG ATG F,TT ATT GAC TAC 969 Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met Ile Ile Asp Tyr Thr Gly Ile Tyr Lys Ala Asp Ile Gly Ile Lys Asn Gly Lys Ile His Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Val Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Met Ile Ile Thr Ala Gly GIy Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe Pro Thr Ala Leu Ala Asn Gly Val Thr T:hr Met Phe Gly 370 375 3g0 Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr T~hr Ile Thr Pro GGC AAA TGG AAC TTG CAC CGC ATG TTG CGC GCA GCA GA.A GAG TAT TCT 1305 Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala G:Lu Glu Tyr Ser ATG AAT GTG GGC TTT TTG GGC AAA GGC AAT AGC TCT A<sT AAA AAA CAA 1353 Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Seer Lys Lys Gln Leu Val Glu Gln Val Glu Ala Gly Ala Ile Gly Phe Lys Leu His Glu 435 440 4~E5 Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Le:u Ser Val Ala *' s b Asp Glu Tyr Asp Val Gln Val Cys IIe His Thr Asp Thr Val Asn Glu GCA GGT TAT GTA GAT GAC ACC CTA AAT GCA ATG AAC CiGG CGC GCC ATC 1545 Ala Gly Tyr Val Asp Asp Thr Leu Asn Ala Met Asn G:ly Arg Ala Ile His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His ~~er Pro Asp Val ATC ACC ATG GCA GGC GAA GTG AAT ATT CTA CCC TCC T'CC ACA ACC CCT 1641 Ile Thr Met Ala Gly Glu Val Asn Ile Leu Pro Ser S'er Thr Thr Pro ACT ATC CCC TAT ACC ATT AAT ACG GTT GCA GAA CAC T'TA GAC ATG CTT 1689 Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met Leu Met Thr Cys His His Leu Asp Lys Arg Ile Arg Glu A.sp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Met Gly Arg Ala Gly Glu Val Ile Pro Arg Thr Trp Gln T:hr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly A:la Asp Asn Asp AAC TTC CGC ATC AAA CGC TAT ATC TCC AAA TAC ACC A'rT AAT CCC GCT 1977 Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr I:Le Asn Pro Ala Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly Lys Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly VaI Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser G7.u Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Met s Phe GIy His His Gly Lys Ala Lys Phe Asp Thr Ser I:le Thr Phe Val TCC AAA GTC GCC TAT GAA AAT GGT GTG AAA GAA AAA C.'TA GGT TTA GAG 2265 Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys I~eu Gly Leu Glu CGC AAG GTG CTC CCC GTG AAA AAC TGC CGT AAC ATC A.CC AAG AAG GAC 2313 10 Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys Asp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val A.sp Pro Lys Thr Phe Glu Val Plie Val Asp Gly Lys Leu Cys Thr Ser Lys Pro Thr Ser Glu Val Pro Leu Ala GIn Arg Tyr Thr Phe Phe (2) INFORMATION FOR SEQ ID NO.: 14:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 226 (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 14:
Val Lys Leu Thr Pro Lys Glu Gln Glu Lys Phe Leu Lf°u Tyr Tyr AIa Gly Glu Val Ala Arg Lys Arg Lys Ala Glu GIy Leu L~,rs Leu Asn Gln 20 25 ~ 30 Pro Glu Ala Ile Ala Tyr Ile Ser Ala His Ile Met Asp Glu Ala Arg Arg Gly Lys Lys Thr Val Ala Glu Leu Met Glu Glu Cys Met His Phe Leu Lys Lys Asp Glu Val Met Pro Gly Val Gly Asn Meat Val Pro Asp Leu Gly Val GIu AIa Thr Phe Pro Asp Gly Thr Lys Le;u Val Thr Val Asn Trp Pro Ile Glu Pro Asp Glu His Fhe Lys Ala Gl.y Glu Val Lys Phe GIy Cys Asp Lys Asp IIe Glu Leu Asn Ala Gly Lys Glu VaI Thr ', at r Glu Leu Glu Val Thr Asn Glu GIy Pro Lys Ser Leu hfis Val Gly Ser His Phe His Phe Phe Glu Ala Asn Lys Ala Leu Lys F~he Asp Arg Glu Lys Ala Tyr Gly Lys Arg Leu Asp Ile Pro Ser Gly A.sn Thr Leu Arg Ile Gly Ala Gly Gln Thr Arg Lys Val Gln Leu Ile Pro Leu Gly Gly Ser Lys Lys VaI Ile Gly Met Asn Gly Leu Val Asn Asn Ile Ala Asp Glu Arg His Lys His Lys AIa Leu Asp Lys Ala Lys S~er His GIy Phe Ile Lys (2) INFORMATION FOR SEQ ID NO.: 15:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 568 (B} TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: polypeptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 15:
Met Lys Met Lys Lys Gln Glu Tyr Val Asn Thr Tyr Gl.y Pro Thr Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu Trp Ala Glu Val Glu His Asp Tyr Thr Thr Tyr Gly Glu Glu Leu Lys Phe Gly Ala Gly Lys Thr Ile Arg Glu Gly Met GIy Gln Ser Asn Ser Pro Asp Glu Asn Thr Leu Asp Leu Val Ile Thr Asn Ala Met IIe Ile Asp Tyr Thr Gly Ile Tyr Lys AIa Asp Ile Gly Ile Lys Asn Gly Lys Ile Hiss Gly Ile Gly Lys Ala Gly Asn Lys Asp Met Gln Asp Gly Va1 Ser Pro His Met Val Val Gly Val Gly Thr Glu Ala Leu Ala Gly Glu Gly Mei: Ile Ile Thr ~ I5 12 0 12 Ei Ala Gly Gly Ile Asp Ser His Thr His Phe Leu Ser Pro Gln Gln Phe a a a.
Pro Thr Ala Leu Ala Asn Gly Val Thr Thr Met Phe Gly Gly Gly Thr Gly Pro Val Asp Gly Thr Asn Ala Thr Thr Ile Thr Pro Gly Lys Trp Asn Leu His Arg Met Leu Arg Ala Ala Glu Glu Tyr ~~er Met Asn Val Gly Phe Leu Gly Lys Gly Asn Ser Ser Ser Lys Lys Gln Leu Val Glu Gln Val Glu Ala GIy Ala Ile Gly Phe Lys Leu His Glu Asp Trp Gly Thr Thr Pro Ser Ala Ile Asp His Cys Leu Ser Val A.la Asp Glu Tyr Asp Val GIn Val Cys Ile His Thr Asp Thr VaI Asn Glu Ala Gly Tyr Val Asp Asp Thr Leu Asn AIa Met Asn Gly Arg Ala IIe His Ala Tyr His Ile Glu Gly Ala Gly Gly Gly His Ser Pro Asp Val Ile Thr Met Ala Gly Glu Val Asn Ile Leu Pro Ser Ser Thr Thr Pro Thr Ile Pro Tyr Thr Ile Asn Thr Val Ala Glu His Leu Asp Met L~eu Met Thr Cys His His Leu Asp Lys Arg Ile Arg GIu Asp Leu Gln Phe Ser Gln Ser Arg Ile Arg Pro Gly Ser Ile Ala Ala Glu Asp Val Leu His Asp Ile Gly Val Ile Ala Met Thr Ser Ser Asp Ser Gln Ala Most GIy Arg Ala 355 360 3Ei5 Gly Glu VaI Ile Pro Arg Thr Trp Gln Thr Ala Asp Lys Asn Lys Lys Glu Phe Gly Lys Leu Pro Glu Asp Gly Ala Asp Asn A:ap Asn Phe Arg Ile Lys Arg Tyr Ile Ser Lys Tyr Thr Ile Asn Pro Al.a Leu Thr His Gly Val Ser Glu Tyr Ile Gly Ser Val Glu Glu Gly L~~s Ile Ala Asp Leu Val Val Trp Asn Pro Ala Phe Phe Gly Val Lys Pro Lys Ile Val Ile Lys Gly Gly Met Val Val Phe Ser Glu Met Gly Asp Ser Asn Ala Ser Val Pro Thr Pro Gln Pro Val Tyr Tyr Arg Glu Nfet Phe Gly His His Gly Lys Ala Lys Phe Asp Thr Ser Ile Thr Phe Val Ser Lys Val Ala Tyr Glu Asn Gly Val Lys Glu Lys Leu Gly Leu Glu Arg Lys Val Leu Pro Val Lys Asn Cys Arg Asn Ile Thr Lys Lys A.sp Phe Lys Phe Asn Asp Lys Thr Ala Lys Ile Thr Val Asp Pro Lys Thr Phe Glu Val Phe Val Asp GIy Lys Leu Cys Thr Ser Lys Pro Thr Ser GIu Val Pro Leu Ala Gln Arg Tyr Thr Phe Fhe (2) INFORMATION FOR SEQ ID NO.: 16:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 21 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 16:
(2) INFORMATION FOR SEQ ID NO.: 17:
{i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 16 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 17:
(2) INFORMATION FOR SEQ ID NO.: 18:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 32 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 18:
.__ _._..-._._.___m. .._..~"~..,~~,~,,~~", ~~~, ,~",. _ .~~_._-__..
(2) INFORMATION FOR SEQ ID NO.: 19:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 19:
(2) INFORMATION FOR SEQ ID NO.: 20:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 27 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 20:
(2) INFORMATION FOR SEQ ID NO.: 21:
(i) SEQUENCE CHARACTERISTICS
(A) LENGTH: 34 (B) TYPE: nucleic acid (C) STRANDEDNESS:
(D) TOPOLOGY:
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Helicobacter felis (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 21:
w
Claims (22)
1) Nucleic acid sequence encoding two subunit polypeptides of a urease complex such as expressed by Helicobacter felis, said nucleic acid sequence having at least 85%
homology with SEQ ID NO: 1, or a part thereof encoding at feast an immunogenic fragment of one of said subunits, said part having a length of at least 40, preferably 45, more preferably 50 nucleotides.
homology with SEQ ID NO: 1, or a part thereof encoding at feast an immunogenic fragment of one of said subunits, said part having a length of at least 40, preferably 45, more preferably 50 nucleotides.
2) Nucleic acid sequence according to claim 1, characterised in that it encodes the urease X subunit polypeptide or the urease Y subunit polypeptide.
3) Nucleic acid sequence according to claim 1 or 2, characterised in that the sequence has at least 90 %, preferably 94 %, more preferably 97 % homology with SEQ ID
NO: 1.
NO: 1.
4) DNA fragment comprising a nucleic acid sequence according to claims 1-3
5) Recombinant DNA molecule comprising a nucleic acid sequence according to claims 1-3 or a DNA fragment according to claim 4, under the control of a functionally linked promoter.
6) Live recombinant carrier comprising a recombinant DNA molecule according to claim
7) Host cell comprising a nucleic acid sequence according to claims 1-3, a DNA
fragment according to claim 4, a recombinant DNA molecule according to claim 5 or a live recombinant carrier according to claim 6.
fragment according to claim 4, a recombinant DNA molecule according to claim 5 or a live recombinant carrier according to claim 6.
8) Helicobacter felis urease X subunit polypeptide, said polypeptide having an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 2 or an immunogenic fragment of said polypeptide with a length of at least 40, preferably 45, more preferably 50 amino acids said immunogenic fragment being capable of inducing an immune response against ureaseXY.
9) Polypeptide according to claim 8, having a sequence homology of at least 90 %, preferably 94 %, more preferably 97 % homology to SEQ ID NO: 2, or an immunogenic fragment of said polypeptide capable of inducing an immune response against ureaseXY.
10) Helicobacter felis urease Y subunit polypeptide, said polypeptide having an amino acid sequence that is at least 85 % homologous to SEQ ID NO: 3 or an immunogenic fragment of said polypeptide with a length of at least 40, preferably 45, more preferably 50 amino acids said immunogenic fragment being capable of inducing an immune response against ureaseXY.
11 ) Polypeptide according to claim 10, having a sequence homology of at least 90 %, preferably 94 %, more preferably 97 % homology to SEQ ID NO: 3, or an immunogenic fragment of said polypeptide capable of inducing an immune response against ureaseXY.
12) Polypeptide according to claims 8-11 for use in a vaccine
13) Use of a polypeptide according to claims 8-11 in the manufacturing of a vaccine for combating Helicobacter felis infections.
14) Vaccine for combating Helicobacter felis infections, characterised in that it comprises a nucleic acid sequence according to claims 1-3, a DNA fragment according to claim 4, a recombinant DNA molecule according to claim 5, a live recombinant carrier according to claim 6, a host cell according to claim 7 or a polypeptide according to claims 8-11, and a pharmaceutically acceptable carrier.
15) Vaccine according to claim 14, characterised in that it comprises an adjuvant.
16) Vaccine according to claim 14 or 15, characterised in that it comprises an additional antigen derived from a virus or micro-organism pathogenic to mammals or genetic information encoding said antigen.
17) Vaccine according to claim 16, characterised in that said virus or micro-organism pathogenic to mammals is selected from the group of Feline Infectious Peritonitis virus, Feline Immune deficiency virus, Canine and Feline Parvovirus, Distemper virus, Adenovirus, Calicivirus, Bordetella bronchiseptica, Borrelia burgdorferi, Leptospira interrogans, Chlamydia and Bartonella henseli.
18) Vaccine for combating Helicobacter felis infections, characterised in that it comprises antibodies against a polypeptide according to claims 8-11.
19) Method for the preparation of a vaccine according to claims 14-17, said method comprising the admixing of a polypeptide according to claims 8-11 and a pharmaceutically acceptable carrier.
20) Diagnostic test for the detection of Helicobacter felis specific DNA
characterised in that the test comprises a nucleic acid sequence according to claims 1-3, or a fragment thereof.
characterised in that the test comprises a nucleic acid sequence according to claims 1-3, or a fragment thereof.
21) Diagnostic test for the detection of antibodies against Helicobacter felis, characterised in that said test comprises a polypeptide or a fragment thereof as described in claims 8-11.
22) Diagnostic test for the detection of antigenic material of Helicobacter felis, characterised in that said test comprises antibodies against a polypeptide or a fragment thereof as described in claims 8-11.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00202565 | 2000-07-17 | ||
NL00202565.8 | 2000-07-17 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2351110A1 true CA2351110A1 (en) | 2002-01-17 |
Family
ID=8171823
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002351110A Abandoned CA2351110A1 (en) | 2000-07-17 | 2001-07-03 | Helicobacter felis vaccine |
Country Status (10)
Country | Link |
---|---|
US (1) | US7514547B2 (en) |
EP (1) | EP1176192B1 (en) |
JP (1) | JP2002355054A (en) |
AT (1) | ATE251215T1 (en) |
AU (1) | AU782172B2 (en) |
CA (1) | CA2351110A1 (en) |
DE (1) | DE60100879T2 (en) |
DK (1) | DK1176192T3 (en) |
ES (1) | ES2208519T3 (en) |
PT (1) | PT1176192E (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8029777B2 (en) | 2004-08-13 | 2011-10-04 | Marshall Barry J | Helicobacter system and uses thereof |
JP2008509168A (en) * | 2004-08-13 | 2008-03-27 | マーシャル,バリー,ジェー. | Bacteria delivery system |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6290962B1 (en) * | 1992-11-03 | 2001-09-18 | Oravax, Inc. | Urease-based vaccine and treatment for helicobacter infection |
US6258359B1 (en) * | 1993-05-19 | 2001-07-10 | Institut Pasteur | Immunogenic compositions against helicobacter infection, polypeptides for use in the compositions, and nucleic acid sequences encoding said polypeptides |
WO1995014093A1 (en) * | 1993-05-19 | 1995-05-26 | Institut Pasteur | Immunogenic compositions against helicobacter infection, polypeptides for use in the compositions and nucleic acid sequences encoding said polypeptides |
US5610060A (en) * | 1994-06-24 | 1997-03-11 | The United States Of America As Represented By The Department Of Health And Human Services | Isolated Helicobacter hepaticus |
WO1996033220A1 (en) * | 1995-04-21 | 1996-10-24 | Csl Limited | Protective helicobacter antigens |
JP3007037B2 (en) * | 1995-07-19 | 2000-02-07 | 秀実 高橋 | Helicobacter pylori urease protein-derived artificial antigen and antibody induced by the artificial antigen |
GB2307987A (en) * | 1995-12-06 | 1997-06-11 | Univ Manchester | Epitopes of the urease of Helicobacter pylori as dignostic agents; pharmaceuticals comprising such epitopes or the antibodies thereto |
JP3430853B2 (en) * | 1997-04-11 | 2003-07-28 | 株式会社ゲン・コーポレーション | Preventive and therapeutic agents for gastritis, gastric ulcer and duodenal ulcer |
US20030158396A1 (en) * | 1997-07-29 | 2003-08-21 | Harold Kleanthous | Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome |
US5985631A (en) * | 1997-09-12 | 1999-11-16 | Oravax-Merieux Co. | Method for preventing the activation of inactive, recombinant Helicobacter pylori apourease |
EP1278768B1 (en) * | 2000-04-27 | 2006-11-15 | Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. | Method for identifying helicobacter antigens |
-
2001
- 2001-07-03 CA CA002351110A patent/CA2351110A1/en not_active Abandoned
- 2001-07-11 PT PT01202666T patent/PT1176192E/en unknown
- 2001-07-11 AU AU54321/01A patent/AU782172B2/en not_active Ceased
- 2001-07-11 EP EP01202666A patent/EP1176192B1/en not_active Expired - Lifetime
- 2001-07-11 DE DE60100879T patent/DE60100879T2/en not_active Expired - Fee Related
- 2001-07-11 ES ES01202666T patent/ES2208519T3/en not_active Expired - Lifetime
- 2001-07-11 DK DK01202666T patent/DK1176192T3/en active
- 2001-07-11 AT AT01202666T patent/ATE251215T1/en not_active IP Right Cessation
- 2001-07-13 US US09/904,994 patent/US7514547B2/en not_active Expired - Fee Related
- 2001-07-16 JP JP2001214711A patent/JP2002355054A/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
JP2002355054A (en) | 2002-12-10 |
ATE251215T1 (en) | 2003-10-15 |
DK1176192T3 (en) | 2004-02-02 |
EP1176192A2 (en) | 2002-01-30 |
DE60100879T2 (en) | 2004-05-19 |
US7514547B2 (en) | 2009-04-07 |
ES2208519T3 (en) | 2004-06-16 |
US20040005325A1 (en) | 2004-01-08 |
AU5432101A (en) | 2002-01-24 |
EP1176192A3 (en) | 2002-02-06 |
DE60100879D1 (en) | 2003-11-06 |
PT1176192E (en) | 2004-01-30 |
AU782172B2 (en) | 2005-07-07 |
EP1176192B1 (en) | 2003-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7491401B2 (en) | Lawsonia intracellularis vaccine | |
AU2005206291B2 (en) | Lawsonia intracellularis subunit vaccines | |
US20090017048A1 (en) | Novel Brachyspira hyodysenteriae vaccine | |
EP1664100B1 (en) | Lawsonia intracellularis subunit vaccine | |
US20070212373A1 (en) | Lawsonia Intracellularis 26 Kd Subunit Vaccine | |
EP1176192B1 (en) | Helicobacter felis vaccine | |
EP1532253B1 (en) | Streptococcus uberis protein, nucleic acid sequence encoding the same and its use in a mastitis vaccine | |
US7090849B2 (en) | Babesia canis vaccine | |
CA2548750A1 (en) | Lawsonia intracellularis 26 kd subunit vaccine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |