US20160122393A1 - Polypeptides having nucleic acid binding activity - Google Patents

Polypeptides having nucleic acid binding activity Download PDF

Info

Publication number
US20160122393A1
US20160122393A1 US14/934,029 US201514934029A US2016122393A1 US 20160122393 A1 US20160122393 A1 US 20160122393A1 US 201514934029 A US201514934029 A US 201514934029A US 2016122393 A1 US2016122393 A1 US 2016122393A1
Authority
US
United States
Prior art keywords
nucleic acid
seq
certain embodiments
polypeptide
acid binding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/934,029
Inventor
Patrick Martin
David Simpson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Applied Biosystems Inc
Applied Biosystems LLC
Original Assignee
Applied Biosystems Inc
Applied Biosystems LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Applied Biosystems Inc, Applied Biosystems LLC filed Critical Applied Biosystems Inc
Priority to US14/934,029 priority Critical patent/US20160122393A1/en
Assigned to APPLERA CORPORATION reassignment APPLERA CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MARTIN, PATRICK, SIMPSON, DAVID
Assigned to APPLIED BIOSYSTEMS INC. reassignment APPLIED BIOSYSTEMS INC. MERGER (SEE DOCUMENT FOR DETAILS). Assignors: APPLERA CORPORATION
Assigned to APPLIED BIOSYSTEMS INC. reassignment APPLIED BIOSYSTEMS INC. MERGER (SEE DOCUMENT FOR DETAILS). Assignors: ATOM ACQUISITION CORPORATION
Assigned to APPLIED BIOSYSTEMS, LLC reassignment APPLIED BIOSYSTEMS, LLC MERGER (SEE DOCUMENT FOR DETAILS). Assignors: ATOM ACQUISITION, LLC & APPLIED BIOSYSTEMS INC.
Publication of US20160122393A1 publication Critical patent/US20160122393A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • C12N9/1252DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6832Enhancement of hybridisation reaction
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/80Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/07Nucleotidyltransferases (2.7.7)
    • C12Y207/07007DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase

Definitions

  • Polypeptides having nucleic acid binding activity are provided. Methods of using polypeptides having nucleic acid binding activity are provided. Fusion proteins and methods of using fusion proteins are provided.
  • Polypeptides with nucleic acid binding activity are present in lower organisms, such as archaea, and higher organisms, such as eukaryotes. See, e.g., Pereira et al. (1997) Proc. Nat'l Acad. Sci. USA 94:12633-12637; and Motz et at. (2002) J. Biol. Chem. 277: 16179-16188.
  • Polypeptides with nucleic acid binding activity have various functions. For example, certain polypeptides with nucleic acid binding activity, such as histones and histone-like proteins, are involved in the packaging of chromatin into higher order structures. See, e.g., Pereira et at. (1997) Proc. Nat'l Acad. Sci.
  • an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1 is provided, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment thereof having nucleic acid binding activity.
  • an isolated polynucleotide comprising a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO: 1, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polynucleotide comprises the polynucleotide sequence of SEQ ID NO:2.
  • the isolated polynucleotide comprises the polynucleotide sequence of SEQ ID NO:3.
  • a vector comprising the isolated polynucleotide is provided.
  • a host cell comprising the vector is provided.
  • a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity comprises culturing, the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4 is provided, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment thereof having nucleic acid binding activity.
  • an isolated polynucleotide comprising a polynucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polynucleotide comprises, the polynucleotide sequence of SEQ ID NO:5.
  • a vector comprising the isolated polynucleotide is provided.
  • a host cell comprising the vector is provided.
  • a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity is provided, wherein the method comprises culturing the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6 is provided, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment thereof having nucleic acid binding activity.
  • an isolated polynucleotide comprising a polynucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity.
  • the isolated polynucleotide comprises, the polynucleotide sequence of SEQ ID NO:7.
  • a vector comprising the isolated polynucleotide is provided.
  • a host cell comprising the vector is provided.
  • a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ. ID NO:6, wherein the polypeptide has nucleic acid binding activity is provided, wherein the method comprises culturing the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:28 is provided, wherein the polypeptide has:
  • a fusion protein comprising, a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase.
  • a fusion protein comprises an amino acid sequence, of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity, and a nucleic acid polymerase.
  • a fusion protein comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase.
  • the nucleic acid polymerase is a thermostable DNA polymerase.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • a fusion protein comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase.
  • the nucleic acid polymerase is a thermostable DNA polymerase.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • a fusion protein comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase.
  • the nucleic acid polymerase is a thermostable DNA polymerase.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • a fusion protein comprising an amino acid sequence selected from SEQ ID NOs:23, 24, 26; 27, 34, 35, 37, 38, 40, 41, 43, and 44 is provided.
  • a method of stabilizing a nucleic acid duplex comprises combining the nucleic acid duplex with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • a method of stabilizing a nucleic acid duplex comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity.
  • a method of stabilizing a nucleic acid duplex comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO: 1, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • a method of stabilizing a nucleic acid duplex comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • a method stabilizing a nucleic acid duplex comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • a method of promoting the annealing of complementary nucleic acid strands comprises combining the complementary nucleic acid strands with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • a method of promoting the annealing of complementary nucleic acid strands comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity.
  • a method of promoting the annealing of complementary nucleic acid strands comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • a method of promoting the annealing of complementary nucleic acid strands comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • a method of promoting the annealing of complementary nucleic acid strands comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 having nucleic acid binding activity.
  • a method of increasing the processivity of a DNA polymerase comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • a method of increasing the processivity of a DNA polymerase comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or fragment thereof having nucleic acid binding activity.
  • a method of increasing the processivity of a DNA polymerase comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • a method of increasing the processivity of a DNA polymerase comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • a method of increasing the processivity of a DNA polymerase comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • kits comprising a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme.
  • kits comprising a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity, and a nucleic acid modification enzyme.
  • kits comprising a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • kits comprising a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • kits comprising a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme.
  • the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment thereof having nucleic acid binding activity.
  • the nucleic acid modification enzyme is a thermostable DNA polymerase.
  • kits comprising any of the above fusion proteins and deoxyribonucleotides.
  • Crenarchaeal nucleic acid binding polypeptide refers to a naturally occurring Crenarchaeal polypeptide that has a molecular weight of about 6 to 11 kilodaltons and a predicted isoelectric point of about 9 to 11; that comprises less than or equal to 4 arginine residues and less than or equal to 15 lysine residues; that has nucleic acid binding activity; and that has an amino acid sequence that is less than 50% identical to the amino acid sequence of Sso7d (SEQ ID NO:30).
  • the Crenarchaea include, but are not limited to, members of the genus Pyrobaculum, Thermoproteus, Thermocladium, Caldivirga, Thermofilum, Staphylothermus, Ignicocus, Aeropyrum, Pyrodictium, Pyrolobus, Sulfolobus , and Metallosphaera . See, e.g., Fitz-Gibbon et al. (2002) Proc. Nat'l Acad. Sci. USA 99:984-989.
  • Nucleic acid binding activity refers to the activity of a polypeptide in binding nucleic acid in at least one of the following two band-shift assays.
  • first assay based on the assay of Guagliardi et al. (1997) J. Mol. Biol. 267:841-848
  • double stranded nucleic acid (the 452-bp HindIII-EcoRV fragment from the S. solfataricus lacS gene) is labeled with 32 P to a specific activity of at least about 2.5 ⁇ 10 7 cpm/ug (or at least about 4000 cpm/fmol) using standard methods.
  • a reaction mixture is prepared containing at least about 0.5 ⁇ g of the polypeptide in about 10 ⁇ l of binding buffer (50 mM sodium phosphate buffer (pH 8.0), 10% glycerol, 25 mM KCl, 25 mM MgCl 2 ). The reaction mixture is heated to 37° C. for ten minutes.
  • binding buffer 50 mM sodium phosphate buffer (pH 8.0), 10% glycerol, 25 mM KCl, 25 mM MgCl 2 .
  • the reaction mixture is loaded onto a native polyacrylamide gel in 0.5 ⁇ Tris-borate buffer.
  • the reaction mixture is subjected to electrophoresis at room temperature.
  • the gel is dried and subjected to autoradiography using standard methods. Any detectable decrease in the mobility of the labeled double stranded nucleic acid indicates formation of a binding complex between the polypeptide and the double stranded nucleic acid.
  • nucleic acid binding activity may be quantified using standard densitometric methods to measure the amount of radioactivity in the binding complex relative to the total amount of radioactivity in the initial reaction mixture.
  • Fusion protein refers to a protein comprising two or more domains joined either covalently or noncovalently, wherein two or more of the domains do not naturally occur in a single protein.
  • Nucleic acid polymerase refers to a polypeptide that catalyzes the synthesis of a polynucleotide using an existing polynucleotide as a template.
  • Polymerase activity refers to the activity of a nucleic acid polymerase in catalyzing the template-directed synthesis of a new polynucleotide. Polymerase activity is measured using the following assay, which is based on that of Lawyer et al. (1989) J. Biol. Chem. 264:6427-647. Serial dilutions of polymerase are prepared in dilution buffer (20 mM Tris Cl, pH 8.0, 50 mM KCl, 0.5% NP 40, and 0.5% Tween-20).
  • reaction mixture containing 25 mM TAPS (pH 9.25), 50 mM KCl, 2 mM MgCl 2 , 0.2 mM dATP, 0.2 mM dGTP, 0.2 mM dTTP, 0.1 mM dCTP, 12.5 ⁇ g activated DNA, 100 ⁇ M [ ⁇ - 32 P] dCTP (0.05 ⁇ Ci/nmol) and sterile deionized water.
  • the reaction mixtures are incubated at 37° C. (or 74° C. for thermostable DNA polymerases) for 10 minutes and then stopped by immediately cooling the reaction to 4° C.
  • DNA Polymerase refers to a nucleic acid polymerase that catalyzes the template-directed synthesis of DNA.
  • Processivity refers to the extent of polymerization by a nucleic acid polymerase during a single contact between the polymerase and its template.
  • the extent of polymerization refers to the number of nucleotides added by the polymerase during a single contact between the polymerase and its template.
  • Thermostable DNA polymerase refers to a DNA polymerase that, at a temperature higher than 37° C., retains its ability to add at least one nucleotide onto the 3′ end of a primer or primer extension product that is annealed to a target nucleic acid sequence.
  • a thermostable DNA polymerase remains active at a temperature greater than about 37° C.
  • a thermostable DNA polymerase remains active at a temperature greater than about 42° C.
  • a thermostable DNA polymerase remains active at a temperature greater than about 50° C.
  • a thermostable DNA polymerase remains active at a temperature greater than about 60° C.
  • thermostable DNA polymerase remains active at a temperature greater than about 70° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 80° C. In certain embodiments, a thermostable polymerase remains active at a temperature greater than about 90° C.
  • Percent identity or “% identity,” with reference to nucleic acid sequences, refers to the percentage of identical nucleotides between at least two polynucleotide sequences aligned using the Basic Local Alignment Search Tool (BLAST) engine. See Tatusova et al. (1999) FEMS Microbiol Lett. 174:247-250.
  • the BLAST engine (version 2:2.10) is provided to the public by the National Center for Biotechnology Information (NCBI), Bethesda, Md.
  • NCBI National Center for Biotechnology Information
  • the “Blast 2 Sequences” tool is used, which employs the “blastn” program with parameters set at default values as follows:
  • Gap_x dropoff 50
  • Percent identity or “% identity,” with reference to polypeptide sequences, refers to the percentage of identical amino acids between at least two polypeptide sequences aligned using the Basic Local Alignment Search Tool (BLAST) engine. See Tatusova et at. (1999) FEMS Microbiol Lett. 174:247-250.
  • the BLAST engine (version 2.2.10) is provided to the public by the National Center for Biotechnology Information (NCBI), Bethesda, Md.
  • NCBI National Center for Biotechnology Information
  • Blast 2 Sequences is used, which employs the “blastp” program with parameters set at default value as follows:
  • Gap_x dropoff 50
  • polypeptide “peptide,” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues.
  • the terms apply to amino acid polymers containing naturally occurring amino acids as well as amino acid polymers in which one or more amino acid residues is an artificial chemical analogue of a corresponding naturally occurring amino acid.
  • the amino acid polymers can be of any length.
  • a “fragment” of a reference polypeptide refers to a contiguous stretch of amino acids from any portion of the reference polypeptide.
  • a fragment may be of any length that is less than the length of the reference polypeptide.
  • a “variant” of a reference polypeptide refers to a polypeptide having one or more amino acid substitutions, deletions, or insertions relative to the reference polypeptide.
  • Exemplary conservative substitutions include, but are not limited to, those set forth below:
  • Nucleic acid modification enzyme refers to an enzymatically active polypeptide that acts on a nucleic acid substrate.
  • Nucleic acid modification enzymes include, but are not limited to, nucleic acid polymerases (such as DNA polymerases and RNA polymerases), nucleases (including endonucleases, such as restriction endonucleases, and exonucleases, such as 3′ or 5′ exonucleases), gyrases, topoisomerases, methylases, and ligases.
  • Melting temperature or “Tm” refers to the temperature at which 50% of the base pairs in a double-stranded nucleic acid have denatured. “Predicted Tm” refers to the Tm calculated for a nucleic acid using the following equation:
  • Tm 81.5° C.+16.6 log 10 [M + ]+0.41(%[ G+C ])
  • [M + ] is the monovalent cation concentration for M + ⁇ 0.5 M. See Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3 rd ed., Cold Spring Harbor Laboratory Press, NY) at 10.47.
  • nucleotide base refers to a substituted or unsubstituted aromatic ring or rings.
  • the aromatic ring or rings contain at least one nitrogen atom.
  • the nucleotide base is capable of forming Watson-Crick and/or Hoogsteen hydrogen bonds with an appropriately complementary nucleotide base.
  • nucleotide bases and analogs thereof include, but are not limited to, naturally occurring nucleotide bases adenine, guanine, cytosine, 6 methyl-cytosine, uracil, thymine, and analogs of the naturally occurring nucleotide bases, e.g.,7-deazaadenine, 7-deazaguanine, 7-deaza-8-azaguanine, 7-deaza-8-azaadenine, N6- ⁇ 2-isopentenyladenine (6iA), N6- ⁇ 2-isopentenyl-2-methylthioadenine (2ms6iA), N2-dimethylguanine (dmG), 7-methylguanine (7mG), inosine, nebularine, 2-aminopurine, 2-amino-6-chloropurine, 2,6-diaminopurine, hypoxanthine, pseudouridine, pseudocytosine, pseudoisocytosine, 5-propynylcytosine
  • nucleotide refers to a compound comprising a nucleotide base linked to the C-1′ carbon of a sugar, such as ribose, arabinose, xylose, and pyranose, and sugar analogs thereof.
  • a sugar such as ribose, arabinose, xylose, and pyranose
  • nucleotide also encompasses nucleotide analogs.
  • the sugar may be substituted or unsubstituted.
  • Substituted ribose sugars include, but are not limited to, those riboses in which one or more of the carbon atoms, for example the 2′-carbon atom, is substituted with one or more of the same or different Cl, F, —R, —OR, —NR 2 or halogen groups, where each R is independently H, C 1 -C 6 alkyl or C 5 -C 14 aryl.
  • Exemplary riboses include, but are not limited to, 2′-(C1-C6)alkoxyribose, 2′-(C5 -C14)aryloxyribose, 2′,3′-didehydroribose, 2′-deoxy-3′-haloribose, 2′-deoxy-3′-fluororibose, 2′-deoxy-3′-chlororibose, 2′-deoxy-3′-aminoribose, 2′-deoxy-3′-(C1-C6)alkylribose, 2′-deoxy-3′-(C1-C6)alkoxyribose and 2′-deoxy-3′-(C5-C14)aryloxyribose, ribose, 2′-deoxyribose, 2′,3′-dideoxyribose, 2′-haloribose, 2′-fluororibose, 2′-chlororibo
  • B is any nucleotide base.
  • Modifications at the 2′- or 3′-position of ribose include, but are not limited to, hydrogen, hydroxy, methoxy, ethoxy, allyloxy, isopropoxy, butoxy, isobutoxy, methoxyethyl, alkoxy, phenoxy, azido, amino, alkylamino, fluoro, chloro and bromo.
  • Nucleotides include, but are not limited to, the natural D optical isomer, as well as the L optical isomer forms (see, e.g., Garbesi (1993) Nucl. Acids Res. 21 :4159-65; Fujimori (1990) J. Amer. Chem. Soc.
  • nucleotide base is purine, e.g. A or G
  • the ribose sugar is attached to the N 9 -position of the nucleotide base.
  • the pentose sugar is attached to the N 1 -position of the nucleotide base, except for pseudouridines, in which the pentose sugar is attached to the C5 position of the uracil nucleotide base (see, e.g., Kornberg and Baker, (1992) DNA Replication, 2 nd Ed., Freeman, San Francisco, Calif.).
  • One or more of the pentose carbons of a nucleotide may be substituted with a phosphate ester having the formula:
  • nucleotides are those in which the nucleotide base is a purine, a 7-deazapurine, a pyrimidine, or an analog thereof.
  • Nucleotide 5′-triphosphate refers to a nucleotide with a triphosphate ester group at the 5′ position, and is sometimes denoted as “NTP”, or “dNTP” and “ddNTP” to particularly point out the structural features of the ribose sugar.
  • the triphosphate ester group may include sulfur substitutions for the various oxygens, e.g. ⁇ -thio-nucleotide 5′-triphosphates.
  • sulfur substitutions for the various oxygens e.g. ⁇ -thio-nucleotide 5′-triphosphates.
  • nucleotide analog refers to embodiments in which the pentose sugar and/or the nucleotide base and/or one or more of the phosphate esters of a nucleotide may be replaced with its respective analog.
  • exemplary pentose sugar analogs are those described above.
  • nucleotide analogs have a nucleotide base analog as described above.
  • exemplary phosphate ester analogs include, but are not limited to, alkylphsosphonates, methylphosphonates, phosphoramidates, phosphotriesters, phosphorothioates, phosphorodithioates, phosphoroselenoates, phosphorodiselenoates, phosphoroanilothioates, phosphoroanilidates, phosphoroamidates, boronophosphates, etc., and may include associated counterions.
  • nucleotide analog also included within the definition of “nucleotide analog” are nucleotide analog monomers that can be polymerized into polynucleotide analogs in which the DNA/RNA phosphate ester and/or sugar phosphate ester backbone is replaced with a different type of internucleotide linkage.
  • Exemplary polynucleotide analogs include, but are not limited to, peptide nucleic acids, in which the sugar phosphate backbone of the polynucleotide is replaced by a peptide backbone.
  • polynucleotide As used herein, the terms “polynucleotide,” “oligonucleotide,” and “nucleic acid” are used interchangeably and mean single-stranded and double-stranded polymers of nucleotide monomers, including 2′-deoxyribonucleotides (DNA) and ribonucleotides (RNA) linked by internucleotide phosphodiester bond linkages, or internucleotide analogs, and associated counter ions, e.g., H + , NH 4 + , trialkylammonium, Mg 2+ , Na + and the like.
  • DNA 2′-deoxyribonucleotides
  • RNA ribonucleotides linked by internucleotide phosphodiester bond linkages
  • counter ions e.g., H + , NH 4 + , trialkylammonium, Mg 2+ , Na + and the like.
  • a nucleic acid may be composed entirely of deoxyribonucleotides, entirely of ribonucleotides, or chimeric mixtures thereof.
  • the nucleotide monomer units may comprise any of the nucleotides described herein, including, but not limited to, naturally occurring nucleotides and nucleotide analogs.
  • Nucleic acids typically range in size from a few monomeric units, e.g. 5-40 when they are sometimes referred to in the art as oligonucleotides, to several thousands of monomeric nucleotide units.
  • nucleic acid sequence is represented, it will be, understood that the nucleotides are in 5′ to 3′ order from left to right and that “A” denotes deoxyadenosine or an analog thereof, “C” denotes deoxycytidine or an analog thereof, “G” denotes deoxyguanosine or an analog thereof, “T” denotes thymidine or an analog thereof, and “U” denotes uridine or an analog thereof, unless otherwise noted.
  • Nucleic acids include, but are not limited to, genomic DNA, cDNA, hnRNA, mRNA, rRNA, tRNA, fragmented nucleic acid, nucleic acid obtained from subcellular organelles such as mitochondria or chloroplasts, and nucleic acid obtained from microorganisms or DNA or RNA viruses that may be present on or in a biological sample. Nucleic acids include, but are not limited to, synthetic or in vitro transcription products.
  • Nucleic acids may be composed of a single type of sugar moiety, e.g., as in the case of RNA and DNA, or mixtures of different sugar moieties, e.g., as in the case of RNA/DNA chimeras.
  • nucleic acids are ribopolynucleotides and 2′-deoxyribopolynucleotides according to the structural formulae below:
  • each B is independently the base moiety of a nucleotide, e.g., a purine, a 7-deazapurine, a pyrimidine, or an analog nucleotide
  • each m defines the length of the respective nucleic acid and can range from zero to thousands, tens of thousands, or even more
  • each R is independently selected from the group comprising hydrogen, halogen, —R′′, —OR′′, and —NR′′H′′, where each R′′ is independently (C1-C6) alkyl or (C5 -C14) aryl, or two adjacent Rs are taken together to form a bond such that the ribose sugar is 2′,3′-didehydroribose; and each R′ is independently hydroxyl or
  • is zero, one or two.
  • nucleotide bases B are covalently attached to the C1′ carbon of the sugar moiety as previously described.
  • nucleic acid may also include nucleic acid analogs, polynucleotide analogs and oligonucleotide analogs.
  • nucleic acid analog refers to a nucleic acid that contains at least one nucleotide analog and/or at least one phosphate ester analog and/or at least one pentose sugar analog.
  • nucleic acid analogs include nucleic acids in which the phosphate ester and/or sugar phosphate ester linkages are replaced with other types of linkages, such as N-(2-aminoethyl)-glycine amides and other amides (see, e.g., Nielsen et al.; 1991, Science 254:1497-1500; WO:92/20702; U.S. Pat. No. 5,719,262; U.S. Pat. No. 5,698,685;); morpholinos (see, e.g., U.S. Pat. No. 5,698,685; U.S. Pat. No. 5,378,841; U.S. Pat. No.
  • PNA 2-aminoethylglycine
  • PNA 2-aminoethylglycine
  • PNA 2-aminoethylglycine
  • sequence encompasses situations where both of the sequences are completely the same or complementary to one another, and situations where only a portion of one of the sequences is the same as, or is complementary to, a portion or the entirety of the other sequence.
  • sequence encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers.
  • sequence encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers. Despite the mismatches, the two sequences should selectively hybridize to one another under appropriate conditions.
  • the term “selectively hybridize” means that, for particular identical sequences, a substantial portion of the particular identical sequences hybridize to a given desired sequence or sequences, and a substantial portion of the particular identical sequences do not hybridize to other undesired sequences.
  • a “substantial portion of the particular identical sequences” in each instance refers to a portion of the total number of the particular identical sequences, and it does not refer to a portion of an individual particular identical sequence.
  • “a substantial portion of the particular identical sequences” means at least 70% of the particular identical sequences.
  • “a substantial portion of the particular identical sequences” means at least 80% of the particular identical sequences.
  • “a substantial portion of the particular identical sequences means at least 90% of the particular identical sequences.
  • “a substantial portion of the particular identical sequences” means at least 95% of the particular identical sequences.
  • the number of mismatches that may be present may vary in view of the complexity of the composition.
  • the more complex the composition the more likely undesired sequences will hybridize.
  • a probe may more likely hybridize to undesired sequences in a composition with the entire genomic DNA than in a composition with fewer DNA sequences, when the same hybridization and wash conditions are employed for both compositions.
  • that given number of mismatches may be appropriate for the composition with fewer DNA sequences, but fewer mismatches may be more optimal for the composition with the entire genomic DNA.
  • sequences are complementary if they have no more than 20% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than 15% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than10% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than 5% mismatched nucleotides.
  • sequence encompasses situations where the entirety of both of the sequences hybridize or bind to one another, and situations where only a portion of one or both of the sequences hybridizes or binds to the entire other sequence or to a portion of the other sequence.
  • sequence encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers.
  • target refers to a nucleic acid comprising a sequence that can be distinguished by a probe or primer.
  • a target polynucleotide is naturally occurring.
  • a target polynucleotide comprises synthetic molecules.
  • primer refers to a polynucleotide that anneals to a target polynucleotide and allows the synthesis from its 3′ end of a sequence complementary to the target polynucleotide.
  • primer extension reaction refers to a reaction in which a polymerase catalyzes the template-directed synthesis of a nucleic acid from the 3′ end of a primer.
  • primer extension product refers to the resultant nucleic acid.
  • a non-limiting exemplary primer extension reaction is the polymerase chain reaction (PCR).
  • probe comprises a polynucleotide that comprises a specific portion designed to hybridize in a sequence-specific manner with a complementary region of a specific nucleic acid sequence, e.g., a target polynucleotide.
  • the specific portion of the probe may be specific for a particular sequence, or alternatively, may be degenerate, e.g., specific for a set of sequences.
  • annealing and “hybridization” are used interchangeably and mean the base-pairing interaction of one nucleic acid with another nucleic acid that results in the formation of a duplex, triplex, or other higher-ordered structure.
  • the primary interaction is base specific, e.g., A/T and G/C, by Watson/Crick and Hoogsteen-type hydrogen bonding.
  • base-stacking and hydrophobic interactions may also contribute to duplex stability.
  • a “nucleic acid duplex” refers to any nucleic acid or portion of a nucleic acid that exists in double stranded form.
  • Double stranded nucleic acid includes homoduplexes, such as double stranded DNA, and heteroduplexes, such as DNA:RNA or DNA:PNA heteroduplexes.
  • Double stranded nucleic acid also includes, for example, a region of a nucleic acid strand to which a primer or probe has annealed.
  • Sso7d and Sac7d are of relatively low molecular weight (about 7 kDa) and are rich in lysine residues. Id. Certain lysine residues are believed to be involved in DNA binding. See Gao et al. (1998) Nature Struct. Biol. 5:782-786. Both protect double stranded DNA from thermal denaturation by increasing its melting temperature (Tm) by about 40° C. Id.; Robinson et al. (1998) Nature 392:202-205. Sso7d also promotes the annealing of complementary DNA strands at temperatures exceeding the predicted Tm of the resulting duplex. See Guagliardi et al. (1997) J. Mol. Biol. 267:841-848.
  • Sso7d exhibits a strong preference for DNA strands that are complementary without any mismatches over DNA strands that contain even a single mismatch. See id.; U.S. Patent Application Publication No. US 2003/0022162 A1. It is postulated that small, basic polypeptides such as Sso7d and Sac7d protect the DNA of hyperthermophiles from denaturation and degradation in the hyperthermophilic environment, where temperatures approach or exceed 100° C. See Guagliardi et al. (1997) J. Mol. Biol. 267:841-848.
  • an isolated polypeptide having nucleic acid binding activity is provided.
  • a polypeptide having nucleic acid binding activity is also referred to herein as a “nucleic acid binding polypeptide.”
  • an isolated nucleic acid binding polypeptide has a relatively low molecular weight and basic isoelectric point.
  • a nucleic acid binding polypeptide stabilizes a double stranded nucleic acid from denaturation or promotes the annealing of complementary nucleic acid strands.
  • a fusion protein comprising a nucleic acid binding polypeptide is provided.
  • a fusion protein comprises a nucleic acid binding polypeptide joined to a nucleic acid modification enzyme, such as a DNA polymerase.
  • an isolated nucleic acid binding polypeptide comprises a Crenarchaeal nucleic acid binding polypeptide.
  • a Crenarchaeal nucleic acid binding polypeptide comprises a naturally occurring polypeptide from the crenarchaeon Pyrobaculum aerophilum .
  • a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:1; which can be found at GenBank accession numbers AAL64739 and AAL64814.
  • SEQ ID NO:1 is encoded by the open reading frames “PAE3192” (SEQ ID NO:2) and “PAE3289” (SEQ ID NO:3) which are unannotated open reading frames identified in the complete genome sequence of P. aerophilum . See GenBank accession no. AE009441.
  • a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:4, which can be found at GenBank accession number AAL62754.
  • SEQ ID NO:4 is encoded by the open reading frame “PAE0384” (SEQ ID NO:5), which is an unannotated open reading frame identified in the complete genome sequence of P. aerophilum . See GenBank accession no. AE009441.
  • SEQ ID NOs:1 and 4 are low molecular weight, basic proteins of 57 and 56 amino acids in length, respectively, with a predicted isoelectric point of about 10.5.
  • SEQ ID.NO:1 contains 12 lysine residues and 2 arginine residues.
  • SEQ ID NO:4 contains 11 lysine residues and 2 arginine residues.
  • SEQ ID NOs:1 and 4 are about 97% identical to each other.
  • SEQ ID NOs:1 and 4 are similar in size and charge to Sso7d, but they are not significantly identical to the amino acid sequence of Sso7d.
  • SEQ ID NO:1 contains a “KKOK” motif (SEQ ID NO: 47) near its N-terminus (residues 3 to 6 of SEQ ID NO:1). This motif resembles the “KQKK” motif (SEQ ID NO: 47) found at the C-terminus of Sso7d (residues 61-64 of SEQ ID NO:30). The location of these motifs at opposite termini of SEQ ID NO:1 and Sso7d may have resulted from gene rearrangements during the divergence of the different Crenarchaeal species.
  • the KQKK motif (SEQ ID NO: 47) of Sso7d is discussed in Shehi et al. (2003) Biochem. 42:8362-8368.
  • an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:1 that has at least one activity of a polypeptide comprising SEQ ID NO:1.
  • Exemplary activities include, but are not limited to, the ability of SEQ ID NO:1 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase.
  • a fragment of SEQ ID NO:1 lacks N-terminal amino acids of SEQ ID NO:1.
  • the fragment of SEQ ID NO:1 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:1. In certain embodiments, a fragment of SEQ ID NO:1 lacks C-terminal amino acids of SEQ ID NO:1. In certain such embodiments, the fragment of SEQ ID NO:1 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:1. In certain embodiments, a fragment of SEQ ID NO:1 has a predicted isoelectric point of about 9-11.
  • an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:1 that has at least one activity of a polypeptide comprising SEQ ID NO:1.
  • a variant of a polypeptide comprising SEQ ID NO:1 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:1.
  • a variant of a polypeptide comprising SEQ ID NO:1 comprises an amino acid sequence having at least 60%, 65%, 70%,75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:1.
  • the lysine and arginine residues of SEQ ID NO:1 are not substituted or deleted in the variant.
  • an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:4 that has at least one activity of a polypeptide comprising SEQ ID NO:4.
  • Exemplary activities include, but are not limited to, the ability of SEQ ID NO:4 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase.
  • a fragment of SEQ ID NO:4 lacks N-terminal amino acids of SEQ ID NO:4.
  • the fragment of SEQ ID NO:4 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:4. In certain embodiments, a fragment of SEQ ID NO:4 lacks C-terminal amino acids of SEQ ID NO:4. In certain such embodiments, the fragment of SEQ ID NO:4 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:4. In certain embodiments, a fragment of SEQ ID NO:4 has a predicted isoelectric point of about 9-11.
  • an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:4 that has at least one activity of a polypeptide comprising SEQ ID NO:4.
  • a variant of a polypeptide comprising SEQ ID NO:4 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:4.
  • a variant of a polypeptide comprising SEQ ID NO:4 comprises an amino acid sequence having at least 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:4.
  • the lysine and arginine residues of SEQ ID NO:4 are not substituted or deleted in the variant.
  • a Crenarchaeal nucleic acid binding polypeptide comprises a naturally occurring polypeptide from the crenarchaeon Aeropyrum pernix .
  • a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:6.
  • SEQ ID NO:6 is 55 amino acids in length with a predicted isoelectric point of about 10.5. It contains 13 lysine residues and 3 arginine residues.
  • SEQ ID NO:6 is similar in size and charge to Sso7d, but is not significantly identical to the amino acid sequence of Sso7d.
  • an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:6 that has at least one activity of a polypeptide comprising SEQ ID NO:6.
  • Exemplary activities include, but are not limited to, the ability of SEQ ID NO:6 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase.
  • a fragment of SEQ ID NO:6 lacks N-terminal amino acids of SEQ ID NO:6.
  • the fragment of SEQ ID NO:6 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:6. In certain embodiments, a fragment of SEQ ID NO:6 lacks C-terminal amino acids of SEQ ID NO:6. In certain such embodiments, the fragment of SEQ ID NO:6 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:6. In certain embodiments, a fragment of SEQ ID NO:6 has a predicted isoelectric point of about 9-11.
  • an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:6 that has at least one activity of a polypeptide comprising SEQ ID NO:6.
  • a variant of a polypeptide comprising SEQ ID NO:6 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:6.
  • a variant of a polypeptide comprising SEQ ID NO:6 comprises an amino acid sequence having at least 60%, 65%,70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:6.
  • the lysine and arginine residues of SEQ ID NO:6 are not substituted or deleted in the variant.
  • an isolated polynucleotide comprising a nucleic acid sequence encoding SEQ ID NO:6 is provided. In certain embodiments, an isolated polynucleotide comprises a nucleic acid sequence encoding a fragment of SEQ ID NO:6 having at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, an isolated polynucleotide comprises a nucleic acid sequence encoding a variant of a polypeptide comprising SEQ ID NO:6 having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • an isolated polynucleotide comprising SEQ ID NO:7 is provided. In certain embodiments, an isolated polynucleotide comprises a fragment of SEQ ID NO:7 that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, an isolated polynucleotide comprises a variant of a polynucleotide comprising SEQ ID NO:7 that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • a variant of a polynucleotide comprising SEQ ID NO:7 comprises a nucleic acid sequence having from about 60% to about 99% identity to SEQ ID NO:7 and that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • a variant of a polynucleotide comprising SEQ ID NO:7 comprises a nucleic acid sequence having at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identity to SEQ ID NO:7 and that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • the length of an isolated polynucleotide is any number of nucleotides less than or equal to 10,000.
  • an isolated polynucleotide is less than or equal to 10,000, 9000, 8000, 7000, 6000, 5000, 4000, 3000, 2000, 1000, or 500 nucleotides in length.
  • the length of an isolated polynucleotide does not include vector sequences.
  • a fragment or variant of a Crenarchaeal nucleic acid binding polypeptide has nucleic acid binding activity that is less than the nucleic acid binding activity of the Crenarchaeal nucleic acid binding polypeptide. In certain embodiments, a fragment or variant of a Crenarchaeal nucleic acid binding polypeptide has from about 10-20%, about 20-30%, about 30-40%, about 40-50%, about 50-60%, about 60-70%, about 70-80%, about 80-90%, or about 90-95% of the nucleic acid binding activity of the Crenarchaeal nucleic acid binding polypeptide.
  • the first of the two alignments below demonstrates that SEQ ID NOs:1 and 6 have 60% identity and 74% similarity as determined by the “Blast 2 Sequence” blastp program set at default parameters. (In calculating percent similarity, the blastp program includes both identical and similar amino acids. Similar amino acids are indicated by “+” signs in the alignments below.)
  • the second of the two alignments below demonstrates that SEQ ID NOs:4 and 6 have 59% identity and 72% similarity as determined by the “Blast 2 Sequence” blastp program set at default parameters.
  • one or more amino acids that are not conserved in at least one of the alignments below are substituted or deleted to create variants of polypeptides comprising SEQ ID NO:1, SEQ ID NO:4, or SEQ ID NO:6.
  • SEQ ID NO: 1 1 MSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFAVAKSPYTGIKVYRLLGKKK 57 M KK+K+KF+D+ AK+ +ETD YEV K+TRG FA AKSPYTG YR+LGK SEQ ID NO: 6: 1 MPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFAKAKSPYTGKIFYRVLGKA 55 SEQ ID NO: 4: 1 MAKQKLKFYDIKAKQSFETDKYEVIEKETARGPMLFAVATSPYTGIKVYRLLGKKK 56 K+K+KF+D+ AK+ +ETD YEV KETRG FA A SPYTG YR+LGK SEQ ID NO: 6: 1 MPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFAKAKSPYTGKIFYRVLGKA 55
  • nucleic acid binding polypeptide Based on the above alignments, a consensus sequence for a nucleic acid binding polypeptide is provided as follows:
  • an isolated nucleic acid binding polypeptide comprises an amino acid sequence that conforms to that consensus sequence.
  • the isolated nucleic acid binding polypeptide has at least one activity of a polypeptide comprising SEQ ID NO:1, SEQ ID NO:4, or SEQ ID NO:6.
  • a polynucleotide encoding a nucleic acid binding polypeptide is obtained by the polymerase chain reaction (PCR). Certain methods employing PCR are known to those skilled in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual , Chapter 8 (3rd ed., Cold Spring Harbor Laboratory Press, NY).
  • PCR polymerase chain reaction
  • a polynucleotide comprising all or a portion of the coding sequence of a nucleic acid binding polypeptide is amplified using appropriate primers.
  • restriction enzyme sites are included in the primers to facilitate cloning of the amplification product into an appropriate expression vector.
  • the polynucleotide is amplified from genomic DNA or from cDNA of a crenarchaeote.
  • the complete genome sequence of certain crenarchaeotes is published and may be used in designing primers for PCR. See, e.g., Fitz-Gibbon et al. (2002) Proc. Nat'l Acad. Sci. USA 99:984-989; Kawarabayasi (1999) DNA Research Supp: 145-152; and She et al (2001) Proc. Nat'l Acad. Sci. USA 98:7835-7840.
  • a polynucleotide encoding a nucleic acid binding polypeptide is obtained by synthesizing individual oligonucleotides which are ligated end-to-end in vitro, with the resulting ligation product comprising the coding sequence of a nucleic acid binding polypeptide .
  • the ligation product is amplified by PCR.
  • the oligonucleotides overlap in sequence, and are extended by PCR, resulting in a PCR product comprising the coding sequence of a nucleic acid binding polypeptide. See, e.g., Stemmer et al. (1995) Gene 164:49-53; Gronlund et at (2003) J. Biol. Chem. 278:40144-40151.
  • the PCR product is cloned into an appropriate expression vector.
  • a polynucleotide encoding a nucleic acid binding polypeptide is cloned into a suitable vector.
  • the vector is transferred (e.g., transformed or transfected) into a host cell.
  • a polynucleotide encoding a nucleic acid binding polypeptide is cloned into an expression vector and, in certain embodiments, expressed in a suitable host cell.
  • Certain exemplary expression vectors are available for use in certain host cells including, but not limited to, prokaryotes, yeast cells, insect cells, plant cells, and mammalian cells. See, e.g., Ausubel et al.
  • a polynucleotide encoding a nucleic acid binding polypeptide is cloned into an expression vector such that its transcription is under the control of an inducible promoter, such as the T7 bacteriophage promoter, the T5 promoter, or the tac promoter.
  • the recombinant expression vector is transformed into bacteria, such as E. coli .
  • the expression of the nucleic acid binding polypeptide is induced by culturing the bacteria under certain growth conditions.
  • expression of the nucleic acid binding polypeptide is induced by addition of isopropylthio- ⁇ -galactoside (IPTG) to the culture medium.
  • a polynucleotide encoding a tag is expressed in frame with a polynucleotide encoding a nucleic acid binding polypeptide.
  • tags can provide a mechanism for detection or purification of the nucleic acid binding polypeptide. Examples of tags include, but are not limited to, polyhistidine tags, which allow purification using nickel chelating resin, and glutathione S-transferase moieties, which allow purification using glutathione-based chromatography.
  • an expression vector further provides a cleavage site between the tag and the nucleic acid binding polypeptide, so that the nucleic acid binding polypeptide may be cleaved from the tag following purification.
  • the nucleic acid binding polypeptide is not cleaved from the tag. It has been reported that the presence of a polyhistidine tag on a recombinant DNA binding protein may enhance the interaction of the DNA binding protein with DNA. See, e.g., Buning et al (1996) Anal. Biochem. 234:227-230.
  • fusion proteins are provided.
  • a fusion protein comprises any of the nucleic acid binding polypeptides described above joined to a nucleic acid modification enzyme.
  • the nucleic acid modification enzyme comprises a nucleic acid polymerase.
  • the nucleic acid polymerase comprises a DNA polymerase.
  • DNA polymerases include DNA-dependent polymerases, which use DNA as a template, or RNA-dependent polymerases, such as reverse transcriptase, which use RNA as a template.
  • DNA-dependent DNA polymerases fall into one of six families (A, B, C, D, X, and Y), with most falling into one of three families (A, B, and C). See, e.g., Ito et al.(1991) Nuc. Acids. Research 19:4045-4057; Braithwaite at al. (1993) Nuc. Acids. Research 21:787-802; Fileé et al. (2002) J. Mol. Evol.
  • a fusion protein comprises a DNA polymerase selected from a family A, B, C, D, X, or Y polymerase.
  • a fusion protein comprises a fragment or variant of an A, B, C, D, X, or Y polymerase having polymerase activity.
  • a fusion protein comprises a family A DNA polymerase or a fragment or variant thereof having polymerase activity.
  • the family A polymerase is a bacterial family A polymerase, such as a polymerase from the genus Bacillus, Thermus , or Thermotoga .
  • the family A polymerase is Taq DNA polymerase (SEQ ID NO:32) or a fragment or variant thereof having polymerase activity.
  • a fusion protein comprises a family B DNA polymerase or a fragment or variant thereof having polymerase activity.
  • the family B polymerase is an archaeal family B polymerase, such as a polymerase from the genus Thermococcus or Pyrococcus .
  • the family B polymerase is Pfu DNA polymerase (SEQ ID NO:31) or a fragment or variant thereof having polymerase activity.
  • DNA polymerases In addition to polymerase activity, certain DNA polymerases also possess other activities, such as 3′ to 5′ exonuclease (proofreading) activity or 5′ to 3′ exonuclease activity. See, e.g., Fileé et al. (2002) J. Mol. Evol. 54:763-773; and Pavlov et al. (2004) Trends in Biotech. 22:253-260.
  • polymerase activity and exonuclease activity are carried out by separate domains.
  • the domain structure of certain DNA polymerases is known to those skilled in the art. See, e.g., id.; Albà (2001) Genome Biol. 2:3002.1-3002.4; and Steitz (1999) J. Biol. Chem. 274:17395-17398.
  • a fusion protein comprises a “chimeric” DNA polymerase.
  • a chimeric DNA polymerase comprises a domain having polymerase activity from a particular DNA polymerase and a domain having exonuclease activity from a different DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591.
  • a fusion protein comprises a DNA polymerase having both polymerase activity and exonuclease activity.
  • the exonuclease activity is 5′ to 3′ exonuclease activity.
  • the level of 5′ to 3′ exonuclease activity is reduced or eliminated relative to the level of 5′ to 3′ exonuclease activity of a native DNA polymerase.
  • mutation of a DNA polymerase results in reduction or elimination of 5′ to 3′ exonuclease activity.
  • one or more amino acid substitutions result in reduction or elimination of 5′ to 3′ exonuclease activity.
  • substitutions are known to those skilled in the art. For example, substitution of a conserved glycine in certain thermostable DNA polymerases reduces or eliminates 5′ to 3′ exonuclease activity. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591(describing the G46D substitution in Taq, Tth, and TZ05 DNA polymerases; the G43D substitution in Tsps17 DNA polymerase; and the G37D substitution in Tma and Taf DNA polymerases).
  • deletion of one or more amino acids from a DNA polymerase results in the reduction or elimination of 5′ to 3′ exonuclease activity.
  • Certain such deletions are known to those skilled in the art.
  • certain N-terminal deletions of certain thermostable DNA polymerases reduce or eliminate 5′ to 3′ exonuclease activity.
  • Exemplary N-terminal deletions include, but are not limited to, deletion of about the first 35-50 amino acid residues of a thermostable DNA polymerase. See, e.g., U.S. Pat. Nos.
  • N-terminal deletions further include, but are not limited to, deletion of about the first 70-80 amino acid residues of a thermostable DNA polymerase. See, e.g., U.S. Pat. Nos.
  • N-terminal deletions further include, but are not limited to, deletion of the first 139 or the first 283 amino acid residues of Tma DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591.
  • a fusion protein comprises a DNA polymerase that lacks an exonuclease domain.
  • the exonuclease domain is a 5′ to 3′ exonuclease domain.
  • Exemplary DNA polymerases that lack a 5′ to 3′ exonuclease domain include, but are not limited to, the large (“Klenow”) fragment of E. coli DNA polymerase I and the “Stoffel” fragment of Taq DNA polymerase, which lacks about the first 289-300 N-terminal amino acids of full-length Taq DNA polymerase. See Lawyer at al. (1989) J. Biol. Chem. 264:6427-6437(describing a “Stoffel” fragment); Vainshtein et al. (1996) Protein Science 5:1785-1792.
  • a fusion protein comprises a DNA polymerase having one or more mutations that reduce the ability of the polymerase to discriminate against the incorporation of dideoxynucleotides.
  • Certain exemplary mutations are described, for example, in U.S. Pat. No. 6,333,183; EP 0 745 676 B1; and U.S. Pat. No. 5,614,365.
  • One such exemplary mutation is the F667Y mutation in Taq DNA polymerase. See, e.g., U.S. Pat. No. 5,614,365.
  • a fusion protein comprises a DNA polymerase having one or more mutations that reduce the ability of the polymerase to discriminate against the incorporation of fluorescently labeled nucleotides into polynucleotides.
  • such “discrimination reduction” mutations occur within the nucleotide label interaction region of a DNA polymerase, which is described, for example, in U.S. Pat. No. 6,265,193. Exemplary discrimination reduction mutations are provided in U.S. Pat. No. 6,265,193.
  • a DNA polymerase further comprises one or more mutations in addition to one or more discrimination reduction mutations.
  • such mutations include, but are not limited to, mutations that increase or decrease 3′ to 5′ exonuclease activity; increase or decrease 5′ to 3′ exonuclease activity; increase or decrease thermostability; increase or decrease processivity; and increase incorporation of dideoxynucleotides.
  • a DNA polymerase comprises one or more discrimination reduction mutations and one or more mutations that decrease 3′ to 5′ exonuclease activity.
  • a DNA polymerase comprises one or more discrimination reduction mutations and one or more mutations that increase incorporation of dideoxynucleotides.
  • Such DNA polymerases are described, for example, in U.S. Pat. No. 6,265,193.
  • a fusion protein comprises a thermostable DNA polymerase.
  • a thermostable DNA polymerase is a naturally occurring thermostable DNA polymerase.
  • a thermostable DNA polymerase is a fragment or variant of a naturally occurring thermostable DNA polymerase that possesses polymerase activity. Exemplary guidance for determining certain such fragments and variants is provided in Pavlov et al. (2004) Trends in Biotech. 22:253-260.
  • thermostable DNA polymerases are known to those skilled in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3 rd ed., Cold Spring Harbor Laboratory Press, NY) at 8.10-8.11. Such thermostable DNA polymerases include, but are not limited to, DNA polymerases from the genus Thermus, Thermococcus, Thermotoga, Bacillus , and Pyrococcus .
  • thermostable DNA polymerases include, but are not limited to, DNA polymerases from Thermus aquaticus (e.g., Taq DNA polymerase), Thermus brockianus (e.g., Tbr polymerase), Thermus flavus (e.g., Tfl DNA polymerase), Thermus caldophilus, Thermus filiformis, Thermus oshimai, Thermus thermophilus (e.g., Tth DNA polymerase), and Thermus ubiquitus .
  • Certain other thermostable DNA polymerases from Thermus include, but are not limited to, Tsps17 and TZ05.
  • thermostable DNA polymerase comprises a DNA polymerase from Thermococcus litoralis (e.g., Tli polymerase), Thermococcus kodakarensis KOD1 (e.g., KOD DNA polymerase), or Thermococcus gorgonarius (e.g., Tgo DNA polymerase).
  • Thermococcus litoralis e.g., Tli polymerase
  • Thermococcus kodakarensis KOD1 e.g., KOD DNA polymerase
  • Thermococcus gorgonarius e.g., Tgo DNA polymerase
  • thermostable DNA polymerase comprises a DNA polymerase from Thermotoga neapolitana (e.g., Tne DNA polymerase) or Thermotoga maritima (e.g., Tma DNA polymerase). See, e.g., US 2003/0092018 A1 and US 2003/0162201 A1.
  • a thermostable DNA polymerase comprises a DNA polymerase from Thermosipho africanus (e.g., Taf DNA polymerase). Certain fragments and variants of Tma, Taf, and Tne DNA polymerases are known to those skilled in the art. See, e.g., US 2003/0092018 A1, US 2003/0162201 A1, U.S. Pat. No. 5,795,762; and and U.S. Pat. No. 5,466,591.
  • thermostable DNA polymerases include, but are not limited to, DNA polymerases from Pyrococcus furiosus (e.g., Pfu DNA polymerase), Pyrococcus woesei (e.g., Pwo polymerase), Pyrococcus spp. GB-D, and Pyrolobus fumarius . See, e.g., U.S. Pat. No. 6,489,150 B1, U.S. Pat. No. 6,673,585 B1, U.S. Pat. No. 5,948,666, U.S. Pat. No. 6,492,511, and EP 0 547 359 B1. Certain fragments and variants of Pfu polymerase are known to those skilled in the art.
  • thermostable DNA polymerase comprises a DNA polymerase from Bacillus stearothermophilus or a variant or fragment thereof, such as the “large fragment” of Bst DNA polymerase.
  • a thermostable DNA polymerase comprises a DNA polymerase from the thermophilic bacterium designated Tsp JS1. See, e.g., US 2004/0005573 A1. Certain fragments and variants of a thermostable DNA polymerase from Tsp JS1 are known to those skilled in the art. Id.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is produced using recombinant methods.
  • a polynucleotide encoding a nucleic acid binding polypeptide and a polynucleotide encoding a nucleic acid modification enzyme are ligated together in the same reading frame, resulting in a polynucleotide encoding a fusion protein.
  • a polynucleotide encoding a nucleic acid binding polypeptide is obtained as described in Part IV: C.1 above.
  • a polynucleotide encoding a nucleic acid modification enzyme is obtained by the polymerase chain reaction (PCR). Certain methods employing PCR are known to those skilled in the art.
  • PCR polymerase chain reaction
  • a polynucleotide comprising all or a portion of the coding sequence of a nucleic acid modification enzyme is amplified using appropriate primers.
  • restriction enzyme sites are included in the primers to facilitate cloning of the amplification product into an appropriate expression vector.
  • the polynucleotide encoding a nucleic acid modification enzyme is a polynucleotide encoding a DNA polymerase.
  • polynucleotide sequences encoding certain DNA polymerases are known to those skilled in the art. See, e.g., Ito et al. (1991) Nuc. Acids. Research 19:4045-4057; Braithwaite et al. (1993) Nuc. Acids. Research 21:787-802; and Fileé et al. (2002) J. Mol. Evol. 54:763-773.
  • the polynucleotide encoding a DNA polymerase is a polynucleotide encoding Taq DNA polymerase or a fragment or variant thereof having polymerase activity.
  • the polynucleotide encoding a DNA polymerase is a polynucleotide encoding Pfu DNA polymerase or a fragment or variant thereof having polymerase activity.
  • a polynucleotide encoding a fusion protein is cloned into a suitable vector.
  • the vector is transferred (e.g., transformed or transfected) into a suitable host cell.
  • a polynucleotide encoding a fusion protein is cloned into an expression vector and, in certain embodiments, expressed in a suitable host cell.
  • Certain exemplary expression vectors and host cells are known to those skilled in the art, as described in Part IV.C.1 above.
  • the fusion protein is isolated from the host cell.
  • a nucleic acid binding polypeptide is joined to the N-terminus of a nucleic acid modification enzyme. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is joined to the C-terminus of a nucleic acid modification enzyme. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is disposed internally within a nucleic acid modification enzyme.
  • a nucleic acid binding polypeptide is joined to the N-terminus of a DNA polymerase. In certain embodiments, of a fusion protein, a nucleic acid binding polypeptide is joined to the C-terminus of a DNA polymerase. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is disposed internally within a DNA polymerase. Certain three dimensional structures of certain DNA polymerases are known to those skilled in the art. See, e.g. Steitz (1999) J. Biol. Chem. 274:17395-17398; Albà (2001) Genome Biol. 2:3002.1-3002.4.
  • DNA polymerases typically have a “hand-like” three dimensional structure comprising “finger,” “palm,” and “thumb” domains. See, e.g., Steitz (1999) J. Biol. Chem. 274:17395-17398; Albà (2001) Genome Biol. 2:3002.1-3002.4.
  • the nucleic acid binding polypeptide occurs within a loop in the “thumb” domain of the DNA polymerase. See, e.g., U.S. Pat. No. 5,972,603, e.g., FIG. 4.
  • one skilled in the art can routinely determine whether a DNA polymerase retains polymerase activity in the context of a fusion protein by assaying the fusion protein for polymerase activity.
  • a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by chemical methods.
  • the nucleic acid modification enzyme is a DNA polymerase.
  • a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by a chemical coupling agent. Certain such methods are known to those skilled in the art. See, e.g., Hermanson, ed., Bioconiuqate Techniques (Academic Press 1996).
  • a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by a linker.
  • a linker is a peptide, which is joined by peptide bonds to a nucleic acid binding polypeptide and to a nucleic acid modification enzyme.
  • a linker is engineered into a fusion protein by standard recombinant methods.
  • a polynucleotide encoding a fusion protein is constructed, wherein a polynucleotide encoding a linker is in frame with and disposed between a polynucleotide encoding a nucleic acid modification enzyme and a polynucleotide encoding a nucleic acid binding polypeptide.
  • a linker is any whole number of amino acids less than or equal to 25. In certain embodiments, a linker does not form an ⁇ -helix or ⁇ -strand. In certain such embodiments, a linker forms an extended, or “loop,” conformation. In certain embodiments, a linker sequence comprises one or more glycine residues. In certain embodiments, a suitable linker sequence is determined using the LINKER program. See, e.g., Crasto et al. (2000) Protein Eng. 13:309-312.
  • linkers include, but are not limited to, carbohydrate linkers, lipid linkers, fatty acid linkers, and polymeric linkers.
  • exemplary polymeric linkers include, but are not limited to, polyether linkers, such as polyethylene glycol (PEG).
  • one or more nucleic acid binding polypeptides are used to stabilize a nucleic acid duplex from denaturation at temperatures above the Tm of the nucleic acid duplex, thereby effectively increasing the Tm of the nucleic acid duplex.
  • one or more nucleic acid binding polypeptides are combined with a nucleic acid duplex.
  • the ratio of the concentration of a nucleic acid binding polypeptide to the concentration of the nucleic acid duplex (in nucleotides) is at least about 1:25, 1:10, 1:5, 1:3, 1:1, or any ratio wherein the concentration of the nucleic acid binding polypeptide exceeds that of the nucleic acid duplex.
  • one or more nucleic acid binding polypeptides are used to promote the annealing of complementary nucleic acid strands. In certain embodiments, annealing takes place with greater rapidity and specificity in the presence of a nucleic acid binding polypeptide than in the absence of a nucleic acid binding polypeptide. In certain embodiments, complementary nucleic acid strands are allowed to anneal in a composition comprising one or more nucleic acid binding polypeptides. In certain such embodiments, a nucleic acid binding polypeptide is present at any concentration from about 1 ⁇ g/ml to about 500 ⁇ g/ml.
  • one or more nucleic acid binding polypeptides are used to favor the annealing of nucleic acid strands that are complementary without mismatches over the annealing of nucleic acid strands that are complementary with mismatches.
  • nucleic acid binding polypeptides are used in hybridization-based detection assays or primer extension assays in which a probe or primer is annealed to a target nucleic acid sequence. Certain examples of the use of nucleic acid binding polypeptides in certain such assays are provided below.
  • one or more nucleic acid binding polypeptides are used to increase the efficiency, e.g., the speed and specificity, of a hybridization-based detection assay.
  • exemplary hybridization-based detection assays include, but, are not limited to, assays in which target nucleic acid is immobilized on a solid support and exposed to a labeled probe (see, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3 rd ed., Cold Spring Harbor Laboratory Press, NY), e.g., at 6.33-6.58 (describing “Southern” hybridizations).
  • exemplary hybridization-based detection assays include microarray-based assays in which target nucleic acid is labeled and exposed to a plurality of polynucleotides immobilized on a solid support. See id. Appendix 10. An example of the use of the nucleic acid binding polypeptide Sso7d in a microarray-based detection assay is described, e.g., in Hatakeyama, US 2003/0022162 A1.
  • a nucleic acid probe is exposed to a mixture of nucleic acids.
  • a target nucleic acid which, comprises a sequence that is complementary to the probe.
  • the probe specifically anneals to the target nucleic acid to form a hybridization complex under certain conditions, e.g., conditions in which the probe is exposed to the target nucleic acid for an appropriate length of time and at an annealing temperature below that of the predicted Tm of the probe.
  • one or more nucleic acid binding polypeptides are used to increase the Tm of a probe, thereby increasing the temperature at which the annealing may be carried out.
  • the annealing is carried out in the presence of one or more nucleic acid binding polypeptides.
  • the annealing takes place at any temperature from 10° C. below to 40° C. above the predicted Tm of the probe. In certain such embodiments, the annealing takes place at a temperature up to 40° C. above the predicted Tm of the probe.
  • annealing takes place in the presence of one or more nucleic acid binding polypeptides at any temperature between 40° C. and 85° C.
  • one or more nucleic acid binding polypeptides are used to increase the Tm of a probe, thereby allowing the use of shorter probes.
  • the annealing is carried out in the presence of one or more nucleic acid binding polypeptides.
  • a probe is of any length between 12 and 25 nucleotides. In certain such embodiments, a probe is of any length between 12 and 19 nucleotides. In certain such embodiments, a probe is of any length between 12 and 16 nucleotides.
  • one or more nucleic acid binding polypeptides are used to decrease the duration of time to achieve annealing.
  • the annealing is carried out in the presence of one or more nucleic acid binding polypeptides.
  • the annealing takes place over any amount of time from about 0.5 minute to about three hours. In certain such embodiments, the annealing takes place over any amount of time from about 1 minute to about 30 minutes. In certain such embodiments, the annealing takes place over any amount of time from about 1 minute to about 15 minutes.
  • a probe may selectively hybridize to a target nucleic acid that is complementary without mismatches to the probe. In certain embodiments, a probe may also selectively hybridize to a target nucleic acid that is complementary to the probe but that contains one or more mismatches relative to the probe. In certain embodiments, one or more nucleic acid binding polypeptides are used to favor the hybridization of a probe to a target nucleic acid that is complementary without mismatches to the probe over the hybridization of a probe to a target nucleic acid that is complementary but that contains one or more mismatches relative to the probe. Thus, in certain embodiments, the specificity of hybridization is increased. In certain such embodiments, annealing is carried out under any of the conditions of time or temperature described above. In certain such embodiments, annealing is carried out at a temperature greater than the predicted Tm of the probe.
  • nucleic acid binding polypeptides can substantially increase the speed and specificity of a hybridization-based detection assay, such polypeptides can be used in certain hybridization-based “point-of-use” devices.
  • Point-of-use devices are typically portable devices that allow rapid diagnosis or detection of a physiological or pathological condition, in certain instances, in a non-clinical or small-scale laboratory setting.
  • An exemplary point-of-use device is, for example, a typical pregnancy test.
  • An exemplary point-of-use device that uses hybridization-based detection is, for example, the Affirm VPIII Microbial Identification System (Becton Dickinson and Company—BD Diagnostics, Sparks, Md.), whereby the presence of certain vaginal pathogens is detected in vaginal swab specimens using an oligonucleotide hybridization assay.
  • Affirm VPIII Microbial Identification System Becton Dickinson and Company—BD Diagnostics, Sparks, Md.
  • one or more nucleic acid binding polypeptides can be used in a hybridization-based point-of-use device that diagnoses a pathological condition, such as an infection, by detecting genetic material from a pathogen in a biological sample from a host.
  • a pathological condition such as an infection
  • the volume of a biological sample to be used with a point-of-use device is reduced in the presence of one or more nucleic acid binding polypeptides.
  • the hybridization-based point-of-use device utilizes microarray technology.
  • nucleic acid binding polypeptides can substantially increase the specificity of a hybridization-based detection assay
  • one or more nucleic acid binding polypeptides can be used in assays that detect mutations or polymorphisms in a target polynucleotide.
  • one or more nucleic acid binding polypeptides can be used in assays that detect single nucleotide polymorphisms (SNPs).
  • SNPs single nucleotide polymorphisms
  • one or more nucleic acid binding polypeptides are used in assays that detect rare copies of a target polynucleotide in a complex mixture of nucleic acids.
  • the target polynucleotide comprises genetic material from a pathogen contained within a biological sample from a host.
  • one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction.
  • primer extension reactions such as PCR
  • one or more primers are annealed to a template nucleic acid.
  • the annealing typically takes place over 30 seconds at about 55° C., a temperature that is less than the predicted Tm of a typical primer of about 20-30 nucleotides.
  • one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction, thereby increasing the temperature at which the annealing may be carried out.
  • the annealing is carried out in the presence of one or more nucleic acid binding polypeptides.
  • the annealing is carried out at any temperature from about 55° C. up to about 75° C.
  • the annealing is carried out at any temperature between 60° C. and 70° C.
  • increased annealing temperature reduces certain primer artifacts, such as primer dimmers and hairpin formation.
  • one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction, thereby allowing the use of shorter primers.
  • the annealing is carried out in the presence of one or more nucleic acid binding polypeptides.
  • a primer is of any length between 12 and 19 nucleotides. In certain such embodiments, a primer is of any length between 12 and 16 nucleotides.
  • one or more nucleic acid binding polypeptides are used to enhance the activity of a nucleic acid modification enzyme.
  • one or more nucleic acid binding polypeptides are included in a composition comprising a nucleic acid modification enzyme and a nucleic acid, thus enhancing the activity of the nucleic acid modification enzyme.
  • the enhancement in the activity of a nucleic acid modification enzyme is demonstrated by comparing the activity of the nucleic acid modification enzyme in the presence of one or more nucleic acid binding polypeptides with its activity in the absence of one or more nucleic acid binding polypeptides.
  • the following assays may be used to evaluate the activity of a nucleic acid modification enzyme:
  • the activity of a gyrase or topoisomerase is assessed by determining the change in the supercoiled state of a nucleic acid exposed to the gyrase or topoisomerase in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • the activity of a nuclease is assessed by determining the amount of cleavage product produced by the nuclease in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • the activity of a restriction endonuclease is assessed by exposing a nucleic acid to a restriction endonuclease in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • the extent of digestion by the restriction endonuclease is determined by gel electrophoresis.
  • the activity of a methylase is determined by assessing the methylation state of a nucleic acid exposed to a methylase in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • the methylation state of the nucleic acid is assessed, for example, by determining the extent to which the nucleic acid is cleaved by a methylation sensitive restriction endonuclease, such as MboI.
  • the activity of a ligase is assessed by determining the amount of ligation product produced by the ligase in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • a circularized plasmid is linearized by a restriction endonuclease, isolated from the restriction endonuclease, and exposed to ligase in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • the ligation reaction mixture is used to transform competent bacteria.
  • the number of transformants is proportional to the activity of the ligase.
  • the activity of a polymerase is assessed in the presence and in the absence of one or more nucleic acid binding polypeptides using the polymerase activity assay described above in Part IV.A.
  • one or more nucleic acid binding polypeptides are used to improve the performance of DNA polymerase.
  • improved performance of DNA polymerase is increased processivity of the DNA polymerase in a primer extension reaction.
  • the primer extension reaction is PCR.
  • the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction allows for more efficient amplification of targets under suboptimal conditions, such as high salt concentrations. Examples of certain high salt concentrations include from 60 mM KCl to 130 mM KCl for Taq DNA polymerase, and from 40 mM KCl to 130 mM KCl for Pfu polymerase.
  • the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction decreases the time of the extension step of PCR to, for example, ⁇ 5 minutes, ⁇ 3 minutes, ⁇ 2 minutes, ⁇ 1 minute, or ⁇ 30 seconds. In certain embodiments, the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction allows for more efficient amplification of long targets, for example, targets from about 5 kb to about 20 kb.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any method that uses a nucleic acid binding polypeptide (as described, for example, in Part IV.D. above), except that the fusion protein replaces the nucleic acid binding polypeptide in the method.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any method that uses a nucleic acid binding polypeptide (as described, for example, in Part IV.D. above), except that the fusion protein is used in combination with the nucleic acid binding polypeptide in the method.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any reaction in which the nucleic acid modification enzyme alone can be used.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used to improve the efficiency of any reaction in which the nucleic acid modification enzyme alone can be used.
  • a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme has increased activity relative to the nucleic acid modification enzyme alone.
  • the assays set forth in Part IV.D.3 above may be used to evaluate the activity of a nucleic acid modification enzyme or a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme.
  • a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase has increased processivity relative to the DNA polymerase alone.
  • a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase is used in a primer extension reaction.
  • the fusion protein increases the efficiency of the primer extension reaction.
  • a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase is included in a primer extension reaction to increase the Tm of one or more primers in the reaction.
  • the temperature at which annealing is carried out may be increased. In certain embodiments, shorter primers may be used.
  • a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction. In certain such embodiments, the fusion protein increases the efficiency of PCR. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction that is conducted under suboptimal conditions, such as high salt concentrations. Exemplary high salt concentrations are described above in Part IV.D.4. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to decrease the time of the extension step of PCR.
  • a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to more efficiently amplify long targets.
  • Exemplary target lengths are provided above in Part IV.D.4.
  • a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to increase the amount of PCR amplification product.
  • a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is used in “hot start” PCR.
  • “hot start” PCR is used to suppress non-specific binding of primer to template. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3 rd ed., Cold Spring Harbor Laboratory Press, NY) at 8.110 (describing “hot start” PCR).
  • one or more components to be used in a PCR are prevented from functioning in the PCR until the reaction mixture reaches or exceeds a temperature at which non-specific priming does not occur. Id.
  • thermostable DNA polymerase an antibody to the thermostable DNA polymerase is used to reversibly block polymerase activity until a suitable temperature is reached. See, e.g., Kellogg et al. (1994) Biotechnigues 16:1134-1137 (describing the use of antibodies to Tag DNA polymerase).
  • a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is used in “hot start” PCR.
  • an antibody to the nucleic acid binding polypeptide is used to reversibly block nucleic acid binding activity and/or polymerase activity until a suitable temperature is reached.
  • kits are provided.
  • a kit comprises any one or more of the nucleic acid binding polypeptides described above.
  • a kit further comprises a nucleic acid modification enzyme.
  • the nucleic acid modification enzyme is a DNA polymerase.
  • the DNA polymerase is a thermostable DNA polymerase.
  • a kit further comprises deoxynucleotides.
  • a kit further comprises dideoxynucleotides.
  • a kit comprises any one or more of the fusion proteins described above.
  • the fusion protein comprises a nucleic acid binding polypeptide and a DNA polymerase.
  • the DNA polymerase is a thermostable DNA polymerase.
  • a kit further comprises deoxynucleotides.
  • a kit further comprises dideoxynucleotides.
  • a polynucleotide encoding SEQ ID NO:1 was constructed by ligating the following oligonucleotides (SEQ ID NOs:8-10) end-to-end, such that the 5′ end of SEQ ID NO:9 was ligated to the 3′ end of SEQ ID NO:8, and the 5′ end of SEQ ID NO:10 was ligated to the 3′ end of SEQ ID NO:9.
  • SEQ ID NO: 8 5′ atgtccaaga agcagaaact Gaagttctac gacatTaagg cgaagcaggc gtttgag 3′
  • SEQ ID NO: 9 5′ acCgaccagt acgaggttat tgagaagcag acCgcccgcg gtccgatgat gttcgcc 3′
  • SEQ ID NO: 10 5′ gtggccaaat cgccgtacac cggcatTaaa gtGtacCgCc tgttaggcaa gaagaaataa 3′
  • the capital letters in SEQ ID NOs:8-10 represent changes from the naturally occurring PAE3192 sequence (SEQ ID NO:2). Those changes were made to generate codons more favorable for the expression of SEQ ID NO:1 in E. coli . Those changes do not result in any alterations in the amino
  • SEQ ID NOS:11-12 oligonucleotides
  • SEQ ID NO: 11 5′ gtactggtcg gtctcaaacg cctg 3′
  • SEQ ID NO: 12 5′ cgatttggcc acggcgaaca tcat 3′
  • SEQ ID NO:11 is complementary to the 3′ end of SEQ ID NO:8 and the 5′ end of SEQ ID NO:9.
  • the annealing of SEQ ID NO:11 to SEQ ID NOs:8-9 created a region of double stranded DNA where SEQ ID NO:11 spans the junction of SEQ ID NOS:8-9. This region of double stranded DNA was a suitable substrate for DNA ligase.
  • SEQ ID NO:12 is complementary to the 3′ end of SEQ ID NO:9 and the 5′ end of SEQ ID NO:10.
  • the annealing of SEQ ID NO:12 to SEQ ID NOS:9-10 created a region of double stranded DNA where SEQ ID NO:12 spans the junction of SEQ ID NOS:9-10.
  • SEQ 10 NOs:8-10 were then ligated.
  • the resulting polynucleotide (SEQ ID NO:13) was amplified by PCR.
  • a polynucleotide encoding SEQ ID NO:6 was constructed by ligating the following oligonucleotides (SEQ ID NOs:14-16) end-to-end:
  • SEQ ID NO: 14 5′ atgccGaaga aggagaagat Taagttcttc gacctGgtcg ccaagaagta ctacgag 3′
  • SEQ ID NO: 15 5′ actgacaact acgaagtcga gatTaaggag actaagCgCg gcaagtttCg Cttcgcc 3′
  • SEQ ID NO: 16 5′ aaagccaaga gcccgtacac cggcaagatc ttctatCgCg tgctGggcaa agcctag 3′
  • the capital letters represent changes from the naturally occurring APE3192 sequence (SEQ ID NO:7). Those changes were made to generate codons more favorable for the expression of SEQ ID NO:6 in E. coli . Those changes do not result in any alterations in the amino acid sequence of SEQ ID NO:6.
  • SEQ ID NOs:17-18 The following oligonucleotides (SEQ ID NOs:17-18) were annealed to SEQ ID NOs:14-16 to create regions of double stranded DNA spanning the junctions between SEQ ID NOs:14-15 and SEQ ID NOs:15-16.
  • SEQ ID NO: 17 5′ gtagttgtca gtctcgtagt actt 3′
  • SEQ ID NO: 18 5′ gctcttggct ttggcgaagc gaaa 3′ SEQ ID NOs:14-16 were then ligated.
  • the resulting polynucleotide (SEQ ID NO:19) was amplified by PCR.
  • SEQ ID NO:13 was cloned into the pET16b vector (Novagen, Milwaukee, Wis.) using standard recombinant methods. That vector allows expression of the cloned sequences from the inducible T7 promoter. It also includes sequences encoding polyhistidine (10 ⁇ His) (SEQ ID NO: 48) followed by a Factor Xa cleavage site upstream of the cloning site. Thus, the encoded proteins are tagged at their N-termini with a polyhistidine moiety. Recombinant vector comprising SEQ ID NO:13 was transformed into competent E. coli host cells using standard methods.
  • SEQ ID NO:19 was also cloned into the pET16b vector using standard recombinant methods. Recombinant vector comprising SEQ ID NO:19 was transformed into competent E. coli host cells using standard methods.
  • Host cells containing a recombinant vector comprising SEQ ID NO:13 are induced to express a tagged polypeptide comprising SEQ ID NO:1 by adding IPTG to the media in which the host cells are grown.
  • the tagged polypeptide is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the isolated polypeptide by treatment with Factor Xa.
  • Host cells containing a recombinant vector comprising SEQ ID NO:19 are induced to express a tagged polypeptide comprising SEQ ID NO:6 by adding IPTG to the media in which the host cells are grown.
  • the tagged polypeptide is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the isolated polypeptide by treatment with Factor Xa.
  • nucleic acid binding polypeptide to stabilize a DNA duplex from thermal denaturation is demonstrated by the following assay, which measures the increase in the Tm of a nucleic acid in the presence of a nucleic acid binding polypeptide. See, e.g., Baumann et al. (1994) Nature Struct. Biol. 1:808-819; and McAfee et al. (1995) Biochem. 34:10063-10077.
  • Poly(dI-dC) at a concentration of about 70 ⁇ M (in nucleotides) is combined with a nucleic acid binding polypeptide at a concentration of about 350 ⁇ M in 5 mM Tris.Cl (pH 7.0).
  • Poly(dI-dC) at a concentration of about 70 ⁇ M (in nucleotides) in 5 mM Tris.Cl (pH 7.0) without a nucleic acid binding polypeptide is used as a negative control.
  • the absorbance of the poly(dI-dC) with and without a nucleic acid binding polypeptide is measured at 260 nm as a function of temperature using a spectrophotometer.
  • the temperature is increased in steps, and absorbance is measured at each step. For each step, the temperature is raised by 1° C. over 30 seconds, followed by a holding time of 60 seconds prior to the measuring of absorbance.
  • a melting curve is generated based on the increase in absorbance as a function of temperature.
  • the Tm (temperature at which 50% of the poly(dI-dC) is denatured) occurs at the inflection point of the melting curve.
  • the Tm of poly(dI-dC) in the negative control is subtracted from the Tm of poly(dI-dC) in the presence of a nucleic acid binding polypeptide to determine the increase in Tm due to the presence of the nucleic acid binding polypeptide.
  • a 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a reaction composition comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl 2 , 250 ⁇ m dNTPs, and 0.1% Triton X-100.
  • the concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively.
  • the primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 90° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second.
  • a thermostable DNA polymerase such as Taq DNA polymerase, is added to the above reaction composition at a concentration of about 1:4000 (DNA polymerase:template).
  • Two parallel reactions are prepared.
  • a nucleic acid binding polypeptide is added to a final concentration of about 70 ⁇ g/ml in 20 ⁇ l of the above reaction composition.
  • the second parallel reaction contains 20 ⁇ l of the above reaction composition with no added nucleic acid binding polypeptide.
  • DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products in the samples are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA Sequencer. The median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration (to ensure that the template is in excess), that length is used as a measure of processivity.
  • nucleic acid binding polypeptide to increase the specificity of a hybridization-based detection assay is measured using an annealing assay based on that of Guagliardi at al. (1997) J. Mol. Biol. 267:841-848.
  • a first set of two reaction compositions is prepared as follows: In a first reaction composition, a single stranded M13mp18 circular DNA (0.05 ⁇ mol) is combined with an equal amount of 32 P end-labeled oligonucleotide of sequence 5′-gtaaaacgacggccagt-3′ (SEQ ID NO:20) in a buffered reaction mixture (20 mM Tris-HCl pH 7.5, 2 mM DTT, 5 mM MgCl2, 100 ⁇ g/ml BSA).
  • single stranded M13mp18 circular DNA (0.05 pmol) is combined with an equal amount of 32 P end-labeled oligonucleotide of sequence 5′-gtaaaacgtcggccagt-3′ (SEQ ID NO:21) in a buffered reaction mixture (20 mM Tris-HCl pH 7.5, 2 mM DTT, 5 mM MgCl2, 100 ⁇ g/ml BSA).
  • the nucleotide indicated in bold is a mismatch with respect to the M13mp18 DNA sequence.
  • a nucleic acid binding polypeptide is added separately to both reaction compositions at a final concentration of about 5 ⁇ g/ml.
  • a second set of two reaction compositions is prepared.
  • the second set is the same as the first set of reaction compositions, except that a nucleic acid binding polypeptide is not added to either the first or second reaction composition of the second set of reaction compositions.
  • the final volume of each reaction composition is 10 ⁇ l.
  • reaction compositions are incubated at 60° C. for three minutes.
  • the reactions are stopped by adding 1% SDS in standard loading dye to each reaction composition.
  • the reactions are analyzed by 1.5% agarose gel electrophoresis followed by autoradiography to visualize annealed product, which can be distinguished from unannealed probe by its slower mobility.
  • Annealed product is quantified for each reaction using standard densitometric methods.
  • An increase in the amount of annealed product in the first reaction compared to the second reaction is determined for both sets of reactions.
  • the ability of a nucleic acid binding polypeptide to increase the specificity of hybridization is demonstrated by a larger increase in the amount of annealed product for the first set of reactions compared to the second set of reactions.
  • a typical microarray-based detection assay may be performed, for example, using the Mouse Genome Survey Microarray system (Applied Biosystems, Foster City, Calif.; P/N 4345065). That system includes reagents, hybridization controls, and reference nucleic acids that can be used to detect selective hybridization of a reference nucleic acid to a probe (i.e., a portion of a mouse cDNA) immobilized on a microarray.
  • a probe i.e., a portion of a mouse cDNA
  • a nucleic acid binding polypeptide is added to the hybridization solution at a concentration of about 50 to 250 ug/m L.
  • the hybridization time is from about 1 to 30 minutes at a temperature of about 45° C. to 75° C.
  • the arrays are washed, and hybridization is detected using the Chemiluminescence Detection Kit (Applied Biosystems, Foster City, Calif., P/N 4342142) according to the manufacturer's instructions.
  • the arrays are analyzed using the Applied Biosystems 1700 Chemiluminescent Microarray Analyzer (Applied Biosystems, Foster City, Calif., P/N 4338036).
  • a fusion protein comprising SEQ ID NO:1 joined to the C-terminus of full length Pfu DNA polymerase was constructed as follows.
  • An NdeI-XhoI restriction fragment comprising the coding sequence of full length Pfu DNA polymerase in frame with SEQ ID NO:13 was cloned into the NdeI and XhoI sites of the pET16b vector (Novagen, Milwaukee, Wis.) using standard recombinant methods.
  • the resulting recombinant vector encodes a fusion protein comprising SEQ ID NO:1 joined to the C-terminus of Pfu DNA polymerase by a Gly-Thr-Gly-Gly-Gly-Gly (SEQ ID NO: 49) peptide linker.
  • a 10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:22.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:23.
  • the recombinant vector pDS2r was transformed into competent E. coli host cells. Host cells comprising pDS2r were induced to express the fusion protein of SEQ ID NO:23 by adding IPTG to the media in which the host cells were grown. The fusion protein was isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:23 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:24.
  • a fusion protein comprising SEQ ID NO:6 joined to the C-terminus of full length Pfu DNA polymerase was constructed as follows: An NdeI-XhoI restriction fragment comprising the coding sequence of full length Pfu DNA polymerase in frame with SEQ ID NO:19 was cloned into the NdeI and XhoI sites of the pET16b vector using standard recombinant methods. The resulting recombinant vector (pDS1r) encodes a fusion protein comprising SEQ ID NO:6 joined to the C-terminus of Pfu DNA polymerase by a Gly-Thr-Gly-Gly-Gly-Gly (SEQ ID NO: 49) peptide linker.
  • a 10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:25.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:26.
  • the recombinant vector pDS1 r was transformed into competent E. coli host cells. Host cells comprising pDS1 r were induced to express the fusion protein of SEQ ID NO:26 by adding IPTG to the media in which the host cells were grown. The fusion protein was isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:26 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:27.
  • a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of Taq DNA polymerase was constructed as follows. SEQ ID NO:13 was cloned in frame at the 5′ end of a polynucleotide encoding Taq DNA polymerase in the pET16b vector. The resulting recombinant vector (pDS17-7) encodes a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of Taq DNA polymerase by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker.
  • a 10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:33.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:34.
  • the recombinant vector pDS17-7 was transformed into competent host cells.
  • the fusion protein is induced in the host cells using IPTG.
  • the fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:34 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:35.
  • a fusion protein comprising SEQ ID NO:6 joined to the N-terminus of Taq DNA polymerase was constructed as follows. SEQ ID NO:19 was cloned in frame at the 5′ end of a polynucleotide encoding Taq DNA polymerase in the pET16b vector. The resulting recombinant vector (pDS16-3) encodes a fusion protein comprising SEQ ID NO:6 joined to Taq DNA polymerase by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker. A10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:36.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:37.
  • the recombinant vector pDS16-3 was transformed into competent host cells.
  • fusion protein is induced in the host cells using IPTG.
  • the fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:37 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:38.
  • a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of a Stoffel fragment of Taq DNA polymerase (amino acid residues 291-832 of SEQ ID NO:32) was constructed as follows. SEQ ID NO:13 was cloned in frame at the 5′ end of a polynucleotide encoding the Stoffel fragment in the pET16b vector. The resulting recombinant vector (pDS25-7) encodes a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of the Stoffel fragment by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker.
  • a 10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:39.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:40.
  • the recombinant vector pDS25-7 was transformed into competent host cells.
  • the fusion protein is induced in the host cells using IPTG.
  • the fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:40 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:41.
  • a fusion protein comprising SEQ ID NO:6 joined to the N-terminus of a Stoffel fragment of Taq DNA polymerase (amino acid residues 291-832 of SEQ ID NO:32) was constructed as follows. SEQ ID NO:19 was cloned in frame at the 5′ end of a polynucleotide encoding the Stoffel fragment in the pET16b vector. The resulting recombinant vector (pDS24-4) encodes a fusion protein comprising SEQ ID NO:6 joined to the N-terminus of the Stoffel fragment by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker.
  • a 10 ⁇ His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein.
  • the polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:42.
  • the amino acid sequence of the fusion protein is shown in SEQ ID NO:43.
  • the recombinant vector pDS24-4 was transformed into competent host cells.
  • the fusion protein is induced in the host cells using IPTG.
  • the fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • the polyhistidine tag is removed from the fusion protein of SEQ ID NO:43 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:44.
  • a 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a mixture comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl 2 , 250 ⁇ m dNTPs, and 0.1% Triton X-100.
  • the concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively.
  • the primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 96° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second.
  • a reaction composition is prepared in which a fusion protein comprising Taq DNA polymerase is added at a molar concentration of about 1:4000 (fusion protein:template) to 20 ⁇ l of the above mixture.
  • a control reaction composition is prepared in which Taq DNA polymerase is added at a molar concentration of about 1:4000 (DNA polymerase:template) to 20 ⁇ l of the above mixture.
  • DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA Sequencer.
  • the median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration, that length is used as a measure of processivity.
  • a 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a mixture comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl 2 , 250 ⁇ m dNTPs, and 0.1% Triton X-100.
  • the concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively.
  • the primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 90° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second.
  • a reaction composition is prepared in which a fusion protein comprising Pfu DNA polymerase is added at a molar concentration of about 1:4000 (fusion protein:template) to 20 ⁇ l of the above mixture.
  • a control reaction composition is prepared in which Pfu DNA polymerase is added at a molar concentration of about 1:4000 (DNA polymerase:template) to 20 ⁇ l of the above mixture.
  • DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA sequencer.
  • the median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration, that length is used as a measure of processivity.
  • the above assay may be modified so as to assess the processivity of a fusion protein comprising a DNA polymerase other than Taq or Pfu.
  • thermostable DNA polymerase e.g., Taq or Pfu
  • An exemplary PCR reaction is prepared which contains PCR buffer (1 ⁇ ), dNTPs (200 ⁇ M each), template DNA (250 ng), forward and reverse primers (0.25 ⁇ M each) and fusion protein (about 0.5 to 2.5 units) in a final volume of 50 ⁇ l.
  • thermostable DNA polymerase alone is used in place of the fusion protein.
  • the primers used in the PCR reaction are tPAF7 (5′-ggaagtacagctcagagttctgcagcacccctgc-3′ (SEQ ID NO:45)) and tPAR10 (5′-gatgcgaaactgaggctggctgtactgtctc-3′ (SEQ ID NO:46)).
  • the template DNA is human genomic DNA (Roche, Indianapolis, Ind., P/N 1-691-112).
  • the primers tPAF7 and tPAR10 amplify a product of approximately 5 kb from human genomic DNA.
  • the fusion protein being used in the PCR reaction comprises Pfu DNA polymerase
  • the standard PCR buffer for Pfu (Stratagene; La. Jolla, Calif.) is used, except that the KCl concentration is elevated.
  • the final working concentration (1 ⁇ ) of the buffer thus contains 20 mM Tris, pH 8:8; 10 mM (NH 4 ) 2 SO 4 , 0.1% Triton X-100, 2 mM MgSO 4 , 100 ⁇ g/mL BSA and 60 mM KCl.
  • the standard PCR buffer for Taq (Applied Biosystems, Foster City, Calif.) is used. Cycling is performed as follows:

Abstract

Polynucleotides having nucleic acid binding activity are provided. Methods of stabilizing a nucleic acid duplex are provided. Methods of promoting the annealing of complementary nucleic acid strands are provided. Methods of increasing the processivity of a DNA polymerase are provided. Methods of enhancing the activity of a nucleic acid modification enzyme are provided. Fusion proteins are provided. Methods of using fusion proteins are provided. Kits are provided.

Description

  • This application is a continuation application of U.S. patent application Ser. No. 13/619,099, filed Sep. 9, 2014, which is a continuation application of U.S. patent application Ser. No. 11/327,195, filed Jan. 6, 2006, now U.S. Pat. No. 8,349,600, which claims the benefit of priority to U.S. Provisional Application No. 60/641,987, filed Jan. 6, 2005.
  • I. FIELD
  • Polypeptides having nucleic acid binding activity are provided. Methods of using polypeptides having nucleic acid binding activity are provided. Fusion proteins and methods of using fusion proteins are provided.
  • II. INTRODUCTION
  • Polypeptides with nucleic acid binding activity are present in lower organisms, such as archaea, and higher organisms, such as eukaryotes. See, e.g., Pereira et al. (1997) Proc. Nat'l Acad. Sci. USA 94:12633-12637; and Motz et at. (2002) J. Biol. Chem. 277: 16179-16188. Polypeptides with nucleic acid binding activity have various functions. For example, certain polypeptides with nucleic acid binding activity, such as histones and histone-like proteins, are involved in the packaging of chromatin into higher order structures. See, e.g., Pereira et at. (1997) Proc. Nat'l Acad. Sci. USA 94:12633:12637. Certain other polypeptides with nucleic acid binding activity may play a role as processivity factors in DNA replication. See, e.g., Motz et al. (2002) J. Biol. Chem. 277:16179-16188.
  • III. SUMMARY
  • In certain embodiments, an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1 is provided, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments, an isolated polynucleotide is provided, wherein the isolated polynucleotide comprises a polynucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO: 1, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polynucleotide comprises the polynucleotide sequence of SEQ ID NO:2. In certain embodiments, the isolated polynucleotide comprises the polynucleotide sequence of SEQ ID NO:3. In certain embodiments, a vector comprising the isolated polynucleotide is provided. In certain embodiments, a host cell comprising the vector is provided. In certain embodiments, a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity, is provided, wherein the method comprises culturing, the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • In certain embodiments, an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4 is provided, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments, an isolated polynucleotide is provided, wherein the isolated polynucleotide comprises a polynucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polynucleotide comprises, the polynucleotide sequence of SEQ ID NO:5. In certain embodiments, a vector comprising the isolated polynucleotide is provided. In certain embodiments, a host cell comprising the vector is provided. In certain embodiments, a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity, is provided, wherein the method comprises culturing the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • In certain embodiments, an isolated polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6 is provided, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments, an isolated polynucleotide is provided, wherein the isolated polynucleotide comprises a polynucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the isolated polynucleotide comprises, the polynucleotide sequence of SEQ ID NO:7. In certain embodiments, a vector comprising the isolated polynucleotide is provided. In certain embodiments, a host cell comprising the vector is provided. In certain embodiments, a method of producing a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ. ID NO:6, wherein the polypeptide has nucleic acid binding activity, is provided, wherein the method comprises culturing the host cell under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
  • In certain embodiments, an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:28 is provided, wherein the polypeptide has:
  • nucleic acid binding activity.
  • In certain embodiments, a fusion protein is provided, wherein the fusion protein comprises, a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase.
  • In certain embodiments, a fusion protein is provided, wherein the fusion protein comprises an amino acid sequence, of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity, and a nucleic acid polymerase.
  • In certain embodiments, a fusion protein is provided, wherein the fusion protein comprises an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase. In certain embodiments, the nucleic acid polymerase is a thermostable DNA polymerase. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • In certain embodiments, a fusion protein is provided, wherein the fusion protein comprises an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase. In certain embodiments, the nucleic acid polymerase is a thermostable DNA polymerase. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • In certain embodiments, a fusion protein is provided, wherein the fusion protein comprises an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid polymerase. In certain embodiments, the nucleic acid polymerase is a thermostable DNA polymerase. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • In certain embodiments, a fusion protein comprising an amino acid sequence selected from SEQ ID NOs:23, 24, 26; 27, 34, 35, 37, 38, 40, 41, 43, and 44 is provided.
  • In certain embodiments, a method of stabilizing a nucleic acid duplex is provided, wherein the method comprises combining the nucleic acid duplex with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • In certain embodiments, a method of stabilizing a nucleic acid duplex is provided, wherein the method comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments, a method of stabilizing a nucleic acid duplex is provided, wherein the method comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO: 1, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • In certain embodiments, a method of stabilizing a nucleic acid duplex is provided, wherein the method comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • In certain embodiments, a method stabilizing a nucleic acid duplex is provided, wherein the method comprises combining the nucleic acid duplex with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • In certain embodiments, a method of promoting the annealing of complementary nucleic acid strands is provided, wherein the method comprises combining the complementary nucleic acid strands with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • In certain embodiments, a method of promoting the annealing of complementary nucleic acid strands is provided, wherein the method comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments, a method of promoting the annealing of complementary nucleic acid strands is provided, wherein the method comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • In certain embodiments, a method of promoting the annealing of complementary nucleic acid strands is provided, wherein the method comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • In certain embodiments, a method of promoting the annealing of complementary nucleic acid strands is provided, wherein the method comprises combining the complementary nucleic acid strands with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 having nucleic acid binding activity.
  • In certain embodiments, a method of increasing the processivity of a DNA polymerase is provided, wherein the method comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity.
  • In certain embodiments, a method of increasing the processivity of a DNA polymerase is provided, wherein the method comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or fragment thereof having nucleic acid binding activity.
  • In certain embodiments, a method of increasing the processivity of a DNA polymerase is provided, wherein the method comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • In certain embodiments, a method of increasing the processivity of a DNA polymerase is provided, wherein the method comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • In certain embodiments; a method of increasing the processivity of a DNA polymerase is provided, wherein the method comprises combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment of SEQ ID NO:6 that has nucleic acid binding activity.
  • In certain embodiments, a kit is provided, wherein the kit comprises a polypeptide comprising the amino acid sequence of SEQ ID NO:28, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme.
  • In certain embodiments, a kit is provided, wherein the kit comprises a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity, and a nucleic acid modification enzyme.
  • In certain embodiments, a kit is provided, wherein the kit comprises a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:1, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:1 or a fragment of SEQ ID NO:1 that has nucleic acid binding activity.
  • In certain embodiments, a kit is provided, wherein the kit comprises a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:4, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:4 or a fragment of SEQ ID NO:4 that has nucleic acid binding activity.
  • In certain embodiments, a kit is provided, wherein the kit comprises a polypeptide comprising an amino acid sequence having at least 80% identity to SEQ ID NO:6, wherein the polypeptide has nucleic acid binding activity, and a nucleic acid modification enzyme. In certain embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO:6 or a fragment thereof having nucleic acid binding activity.
  • In certain embodiments of any of the above kits, the nucleic acid modification enzyme is a thermostable DNA polymerase.
  • In certain embodiments, a kit is provided, wherein the kit comprises any of the above fusion proteins and deoxyribonucleotides.
  • IV. DETAILED DESCRIPTION OF VARIOUS EMBODIMENTS
  • In this application, the use of the singular includes the plural unless specifically stated otherwise. In this application, the word “a” or “an” means “at least one” unless specifically stated otherwise. In this application, the use of “or” means “and/or” unless stated otherwise. Furthermore, the use of the term “including,” as, well as other forms, such as “includes” and “included,” is not limiting. Also, terms such as “element” or “component” encompass both elements or components comprising one unit and elements or components that comprise more than one unit unless specifically stated otherwise.
  • The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described. All documents, or portions of documents, cited in this application, including but not limited to patents, patent applications, articles, books, and treatises are hereby expressly incorporated by reference in their entirety for any purpose. In the event that one or more of the incorporated literature and similar materials defines a term that contradicts that term's definition in this application, this application controls.
  • A. Certain Definitions
  • It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
  • “Crenarchaeal nucleic acid binding polypeptide” refers to a naturally occurring Crenarchaeal polypeptide that has a molecular weight of about 6 to 11 kilodaltons and a predicted isoelectric point of about 9 to 11; that comprises less than or equal to 4 arginine residues and less than or equal to 15 lysine residues; that has nucleic acid binding activity; and that has an amino acid sequence that is less than 50% identical to the amino acid sequence of Sso7d (SEQ ID NO:30). The Crenarchaea include, but are not limited to, members of the genus Pyrobaculum, Thermoproteus, Thermocladium, Caldivirga, Thermofilum, Staphylothermus, Ignicocus, Aeropyrum, Pyrodictium, Pyrolobus, Sulfolobus, and Metallosphaera. See, e.g., Fitz-Gibbon et al. (2002) Proc. Nat'l Acad. Sci. USA 99:984-989.
  • “Nucleic acid binding activity” refers to the activity of a polypeptide in binding nucleic acid in at least one of the following two band-shift assays. In the first assay (based on the assay of Guagliardi et al. (1997) J. Mol. Biol. 267:841-848), double stranded nucleic acid (the 452-bp HindIII-EcoRV fragment from the S. solfataricus lacS gene) is labeled with 32P to a specific activity of at least about 2.5×107 cpm/ug (or at least about 4000 cpm/fmol) using standard methods. See, e.g., Sambrook et at (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor. Laboratory Press, NY),at 9.63-9.75 (describing end-labeling of nucleic acids). A reaction mixture is prepared containing at least about 0.5 μg of the polypeptide in about 10 μl of binding buffer (50 mM sodium phosphate buffer (pH 8.0), 10% glycerol, 25 mM KCl, 25 mM MgCl2). The reaction mixture is heated to 37° C. for ten minutes. About 1×104 to 5×104 cpm(or about 0.5-2 ng) of the labeled double stranded nucleic acid is added to the reaction mixture and incubated for an additional ten minutes. The reaction mixture is loaded onto a native polyacrylamide gel in 0.5× Tris-borate buffer. The reaction mixture is subjected to electrophoresis at room temperature. The gel is dried and subjected to autoradiography using standard methods. Any detectable decrease in the mobility of the labeled double stranded nucleic acid indicates formation of a binding complex between the polypeptide and the double stranded nucleic acid. Such nucleic acid binding activity may be quantified using standard densitometric methods to measure the amount of radioactivity in the binding complex relative to the total amount of radioactivity in the initial reaction mixture.
  • In the second assay (based on the assay of Mai et al. (1998) J. Bacteriol. 180:2560-2563), about 0.5 μg each of negatively supercoiled circular pBluescript KS(−) plasmid and nicked circular pBluescript KS(−) plasmid (Stratagene, La Jolla, Calif.) are mixed with a polypeptide at a polypeptide/DNA mass ratio of about ≧2.6. The mixture is incubated for 10 minutes at 40° C. The mixture is subjected to 0.8% agarose gel electrophoresis. DNA is visualized using an appropriate dye. Any detectable decrease in the mobility of the negatively supercoiled circular plasmid and/or nicked circular plasmid indicates formation of a binding complex between the polypeptide and the plasmid.
  • “Fusion protein” refers to a protein comprising two or more domains joined either covalently or noncovalently, wherein two or more of the domains do not naturally occur in a single protein.
  • “Nucleic acid polymerase” refers to a polypeptide that catalyzes the synthesis of a polynucleotide using an existing polynucleotide as a template.
  • “Polymerase activity” refers to the activity of a nucleic acid polymerase in catalyzing the template-directed synthesis of a new polynucleotide. Polymerase activity is measured using the following assay, which is based on that of Lawyer et al. (1989) J. Biol. Chem. 264:6427-647. Serial dilutions of polymerase are prepared in dilution buffer (20 mM Tris Cl, pH 8.0, 50 mM KCl, 0.5% NP 40, and 0.5% Tween-20). For each dilution, 5 μl is removed and added to 45 μl of a reaction mixture containing 25 mM TAPS (pH 9.25), 50 mM KCl, 2 mM MgCl2, 0.2 mM dATP, 0.2 mM dGTP, 0.2 mM dTTP, 0.1 mM dCTP, 12.5 μg activated DNA, 100 μM [α-32P] dCTP (0.05 μCi/nmol) and sterile deionized water. The reaction mixtures are incubated at 37° C. (or 74° C. for thermostable DNA polymerases) for 10 minutes and then stopped by immediately cooling the reaction to 4° C. and adding 10 μl of ice-cold 60 mM EDTA. A 25 μl aliquot is removed from each reaction mixture. Unincorporated radioactively labeled dCTP is removed from each aliquot by gel filtration (Centri-Sep, Princeton Separations, Adelphia, N.J.). The column eluate is mixed with scintillation fluid (1 ml). Radioactivity in the column eluate is quantified with a scintillation counter to determine the amount of product synthesized by the polymerase. One unit of polymerase activity is defined as the amount of polymerase necessary to synthesize 10 nmole of product in 30 minutes.
  • “DNA Polymerase” refers to a nucleic acid polymerase that catalyzes the template-directed synthesis of DNA.
  • “Processivity” refers to the extent of polymerization by a nucleic acid polymerase during a single contact between the polymerase and its template. The extent of polymerization refers to the number of nucleotides added by the polymerase during a single contact between the polymerase and its template.
  • “Thermostable DNA polymerase” refers to a DNA polymerase that, at a temperature higher than 37° C., retains its ability to add at least one nucleotide onto the 3′ end of a primer or primer extension product that is annealed to a target nucleic acid sequence. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 37° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 42° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 50° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 60° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 70° C. In certain embodiments, a thermostable DNA polymerase remains active at a temperature greater than about 80° C. In certain embodiments, a thermostable polymerase remains active at a temperature greater than about 90° C.
  • “Percent identity” or “% identity,” with reference to nucleic acid sequences, refers to the percentage of identical nucleotides between at least two polynucleotide sequences aligned using the Basic Local Alignment Search Tool (BLAST) engine. See Tatusova et al. (1999) FEMS Microbiol Lett. 174:247-250. The BLAST engine (version 2:2.10) is provided to the public by the National Center for Biotechnology Information (NCBI), Bethesda, Md. To align two polynucleotide sequences, the “Blast 2 Sequences” tool is used, which employs the “blastn” program with parameters set at default values as follows:
  • Matrix: not applicable
  • Reward for match: 1
  • Penalty for mismatch: −2
  • Open gap: 5 penalties
  • Extension gap: 2 penalties
  • Gap_x dropoff: 50
  • Expect: 10.0
  • Word size: 11
  • Filter: on
  • “Percent identity” or “% identity,” with reference to polypeptide sequences, refers to the percentage of identical amino acids between at least two polypeptide sequences aligned using the Basic Local Alignment Search Tool (BLAST) engine. See Tatusova et at. (1999) FEMS Microbiol Lett. 174:247-250. The BLAST engine (version 2.2.10) is provided to the public by the National Center for Biotechnology Information (NCBI), Bethesda, Md. To align two polypeptide sequences, the “Blast 2 Sequences” tool is used, which employs the “blastp” program with parameters set at default value as follows:
  • Matrix: BLOSUM62
  • Open gap: 11 penalties
  • Extension gap: 1 penalty
  • Gap_x dropoff: 50
  • Expect: 10.0
  • Word size: 3
  • Filter: on
  • The terms “polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers containing naturally occurring amino acids as well as amino acid polymers in which one or more amino acid residues is an artificial chemical analogue of a corresponding naturally occurring amino acid. The amino acid polymers can be of any length.
  • A “fragment” of a reference polypeptide refers to a contiguous stretch of amino acids from any portion of the reference polypeptide. A fragment may be of any length that is less than the length of the reference polypeptide.
  • A “variant” of a reference polypeptide refers to a polypeptide having one or more amino acid substitutions, deletions, or insertions relative to the reference polypeptide. Exemplary conservative substitutions include, but are not limited to, those set forth below:
  • TABLE 1
    Exemplary Amino Acid Substitutions
    Original Residues Exemplary Substitutions
    Ala Val, Leu, Ile
    Arg Lys, Gln, Asn
    Asn Gln
    Asp Glu
    Cys Ser, Ala
    Gln Asn
    Glu Asp
    Gly Pro, Ala
    His Asn, Gln, Lys, Arg
    Ile Leu, Val, Met, Ala, Phe, Norleucine
    Leu Norleucine, Ile, Val, Met, Ala, Phe
    Lys Arg, 1,4 Diamino-butyric Acid, Gln, Asn
    Met Leu, Phe, Ile
    Phe Leu, Val, Ile, Ala, Tyr
    Pro Ala
    Ser Thr, Ala, Cys
    Thr Ser
    Trp Tyr, Phe
    Tyr Trp, Phe, Thr, Ser
    Val Ile, Met, Leu, Phe, Ala, Norleucine
  • “Nucleic acid modification enzyme” refers to an enzymatically active polypeptide that acts on a nucleic acid substrate. Nucleic acid modification enzymes include, but are not limited to, nucleic acid polymerases (such as DNA polymerases and RNA polymerases), nucleases (including endonucleases, such as restriction endonucleases, and exonucleases, such as 3′ or 5′ exonucleases), gyrases, topoisomerases, methylases, and ligases.
  • “Melting temperature” or “Tm” refers to the temperature at which 50% of the base pairs in a double-stranded nucleic acid have denatured. “Predicted Tm” refers to the Tm calculated for a nucleic acid using the following equation:

  • Tm=81.5° C.+16.6 log10 [M +]+0.41(%[G+C])
  • where [M+] is the monovalent cation concentration for M+≦0.5 M. See Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor Laboratory Press, NY) at 10.47.
  • The term “nucleotide base,” as used herein, refers to a substituted or unsubstituted aromatic ring or rings. In certain embodiments, the aromatic ring or rings contain at least one nitrogen atom. In certain embodiments, the nucleotide base is capable of forming Watson-Crick and/or Hoogsteen hydrogen bonds with an appropriately complementary nucleotide base. Exemplary nucleotide bases and analogs thereof include, but are not limited to, naturally occurring nucleotide bases adenine, guanine, cytosine, 6 methyl-cytosine, uracil, thymine, and analogs of the naturally occurring nucleotide bases, e.g.,7-deazaadenine, 7-deazaguanine, 7-deaza-8-azaguanine, 7-deaza-8-azaadenine, N6-Δ2-isopentenyladenine (6iA), N6-Δ2-isopentenyl-2-methylthioadenine (2ms6iA), N2-dimethylguanine (dmG), 7-methylguanine (7mG), inosine, nebularine, 2-aminopurine, 2-amino-6-chloropurine, 2,6-diaminopurine, hypoxanthine, pseudouridine, pseudocytosine, pseudoisocytosine, 5-propynylcytosine, isocytosine, isoguanine,7-deazaguanine, 2-thiopyrimidine, 6-thioguanine, 4-thiothymine, 4-thiouracil, O6-methylguanine, N6-methyladenine, O4-methylthymine, 5,6-dihydrothymine, 5,6-dihydrouracil, pyrazolo[3,4-D]pyrimidines (see, e.g., U.S. Pat. Nos. 6,143,877 and 6,127,121 and PCT published application WO 01/38584), ethenoadenine, indoles such as nitroindole and 4-methylindole, and pyrroles such as nitropyrrole. Certain exemplary nucleotide bases can be found, e.g., in Fasman (1989) Practical Handbook of Biochemistry and Molecular Biology, pages 385-394, (CRC Press, Boca Raton, Fla.) and the references cited therein.
  • The term “nucleotide,” as used herein, refers to a compound comprising a nucleotide base linked to the C-1′ carbon of a sugar, such as ribose, arabinose, xylose, and pyranose, and sugar analogs thereof. The term nucleotide also encompasses nucleotide analogs. The sugar may be substituted or unsubstituted. Substituted ribose sugars include, but are not limited to, those riboses in which one or more of the carbon atoms, for example the 2′-carbon atom, is substituted with one or more of the same or different Cl, F, —R, —OR, —NR2 or halogen groups, where each R is independently H, C1-C6 alkyl or C5-C14 aryl. Exemplary riboses include, but are not limited to, 2′-(C1-C6)alkoxyribose, 2′-(C5 -C14)aryloxyribose, 2′,3′-didehydroribose, 2′-deoxy-3′-haloribose, 2′-deoxy-3′-fluororibose, 2′-deoxy-3′-chlororibose, 2′-deoxy-3′-aminoribose, 2′-deoxy-3′-(C1-C6)alkylribose, 2′-deoxy-3′-(C1-C6)alkoxyribose and 2′-deoxy-3′-(C5-C14)aryloxyribose, ribose, 2′-deoxyribose, 2′,3′-dideoxyribose, 2′-haloribose, 2′-fluororibose, 2′-chlororibose, and 2′-alkylribose, e.g., 2′-O-methyl, 4′-α-anomeric nucleotides, 1′-α-anonmeric nucleotides, 2′-4′- and 3′-4′-linked and other “locked” or “LNA”, bicyclic sugar modifications (see, e.g., PCT published application nos. WO 98/22489, WO 98/39352;, and WO 99/14226). Exemplary LNA sugar analogs within a polynucleotide include, but are not limited to, the structures:
  • Figure US20160122393A1-20160505-C00001
  • where B is any nucleotide base.
  • Modifications at the 2′- or 3′-position of ribose include, but are not limited to, hydrogen, hydroxy, methoxy, ethoxy, allyloxy, isopropoxy, butoxy, isobutoxy, methoxyethyl, alkoxy, phenoxy, azido, amino, alkylamino, fluoro, chloro and bromo. Nucleotides include, but are not limited to, the natural D optical isomer, as well as the L optical isomer forms (see, e.g., Garbesi (1993) Nucl. Acids Res. 21 :4159-65; Fujimori (1990) J. Amer. Chem. Soc. 112:7435; Urata, (1993) Nucleic Acids Symposium Ser. No. 29:69-70). When the nucleotide base is purine, e.g. A or G, the ribose sugar is attached to the N9-position of the nucleotide base. When the nucleotide base is pyrimidine; e.g., C, T or U, the pentose sugar is attached to the N1-position of the nucleotide base, except for pseudouridines, in which the pentose sugar is attached to the C5 position of the uracil nucleotide base (see, e.g., Kornberg and Baker, (1992) DNA Replication, 2nd Ed., Freeman, San Francisco, Calif.).
  • One or more of the pentose carbons of a nucleotide may be substituted with a phosphate ester having the formula:
  • Figure US20160122393A1-20160505-C00002
  • where α is an integer from 0 to 4. In certain embodiments, α is 2 and the phosphate ester is attached to the 3′- or 5′-carbon of the pentose. In certain embodiments, the nucleotides are those in which the nucleotide base is a purine, a 7-deazapurine, a pyrimidine, or an analog thereof. “Nucleotide 5′-triphosphate” refers to a nucleotide with a triphosphate ester group at the 5′ position, and is sometimes denoted as “NTP”, or “dNTP” and “ddNTP” to particularly point out the structural features of the ribose sugar. The triphosphate ester group may include sulfur substitutions for the various oxygens, e.g. α-thio-nucleotide 5′-triphosphates. For a review of nucleotide chemistry, see: Shabarova, Z. and Bogdanov, A. Advanced Organic Chemistry of Nucleic Acids, VCH, New York, 1994.
  • The term “nucleotide analog,” as used herein, refers to embodiments in which the pentose sugar and/or the nucleotide base and/or one or more of the phosphate esters of a nucleotide may be replaced with its respective analog. In certain embodiments, exemplary pentose sugar analogs are those described above. In certain embodiments, the nucleotide analogs have a nucleotide base analog as described above. In certain embodiments, exemplary phosphate ester analogs include, but are not limited to, alkylphsosphonates, methylphosphonates, phosphoramidates, phosphotriesters, phosphorothioates, phosphorodithioates, phosphoroselenoates, phosphorodiselenoates, phosphoroanilothioates, phosphoroanilidates, phosphoroamidates, boronophosphates, etc., and may include associated counterions.
  • Also included within the definition of “nucleotide analog” are nucleotide analog monomers that can be polymerized into polynucleotide analogs in which the DNA/RNA phosphate ester and/or sugar phosphate ester backbone is replaced with a different type of internucleotide linkage. Exemplary polynucleotide analogs include, but are not limited to, peptide nucleic acids, in which the sugar phosphate backbone of the polynucleotide is replaced by a peptide backbone.
  • As used herein, the terms “polynucleotide,” “oligonucleotide,” and “nucleic acid” are used interchangeably and mean single-stranded and double-stranded polymers of nucleotide monomers, including 2′-deoxyribonucleotides (DNA) and ribonucleotides (RNA) linked by internucleotide phosphodiester bond linkages, or internucleotide analogs, and associated counter ions, e.g., H+, NH4 +, trialkylammonium, Mg2+, Na+ and the like. A nucleic acid may be composed entirely of deoxyribonucleotides, entirely of ribonucleotides, or chimeric mixtures thereof. The nucleotide monomer units may comprise any of the nucleotides described herein, including, but not limited to, naturally occurring nucleotides and nucleotide analogs. Nucleic acids typically range in size from a few monomeric units, e.g. 5-40 when they are sometimes referred to in the art as oligonucleotides, to several thousands of monomeric nucleotide units. Unless denoted otherwise, whenever a nucleic acid sequence is represented, it will be, understood that the nucleotides are in 5′ to 3′ order from left to right and that “A” denotes deoxyadenosine or an analog thereof, “C” denotes deoxycytidine or an analog thereof, “G” denotes deoxyguanosine or an analog thereof, “T” denotes thymidine or an analog thereof, and “U” denotes uridine or an analog thereof, unless otherwise noted.
  • Nucleic acids include, but are not limited to, genomic DNA, cDNA, hnRNA, mRNA, rRNA, tRNA, fragmented nucleic acid, nucleic acid obtained from subcellular organelles such as mitochondria or chloroplasts, and nucleic acid obtained from microorganisms or DNA or RNA viruses that may be present on or in a biological sample. Nucleic acids include, but are not limited to, synthetic or in vitro transcription products.
  • Nucleic acids may be composed of a single type of sugar moiety, e.g., as in the case of RNA and DNA, or mixtures of different sugar moieties, e.g., as in the case of RNA/DNA chimeras. In certain embodiments, nucleic acids are ribopolynucleotides and 2′-deoxyribopolynucleotides according to the structural formulae below:
  • Figure US20160122393A1-20160505-C00003
  • wherein each B is independently the base moiety of a nucleotide, e.g., a purine, a 7-deazapurine, a pyrimidine, or an analog nucleotide; each m defines the length of the respective nucleic acid and can range from zero to thousands, tens of thousands, or even more; each R is independently selected from the group comprising hydrogen, halogen, —R″, —OR″, and —NR″H″, where each R″ is independently (C1-C6) alkyl or (C5 -C14) aryl, or two adjacent Rs are taken together to form a bond such that the ribose sugar is 2′,3′-didehydroribose; and each R′ is independently hydroxyl or
  • Figure US20160122393A1-20160505-C00004
  • where α is zero, one or two.
  • In certain embodiments of the ribopolynucleotides and 2′-deoxyribopolynucleotides illustrated above, the nucleotide bases B are covalently attached to the C1′ carbon of the sugar moiety as previously described.
  • The terms “nucleic acid,” “polynucleotide,” and “oligonucleotide” may also include nucleic acid analogs, polynucleotide analogs and oligonucleotide analogs. The terms “nucleic acid analog”, “polynucleotide analog” and “oligonucleotide analog” are used interchangeably and, as used herein, refer to a nucleic acid that contains at least one nucleotide analog and/or at least one phosphate ester analog and/or at least one pentose sugar analog. Also included within the definition of nucleic acid analogs are nucleic acids in which the phosphate ester and/or sugar phosphate ester linkages are replaced with other types of linkages, such as N-(2-aminoethyl)-glycine amides and other amides (see, e.g., Nielsen et al.; 1991, Science 254:1497-1500; WO:92/20702; U.S. Pat. No. 5,719,262; U.S. Pat. No. 5,698,685;); morpholinos (see, e.g., U.S. Pat. No. 5,698,685; U.S. Pat. No. 5,378,841; U.S. Pat. No. 5,185,144); carbamates (see, e.g., Stirchak & Summerton, 1987, J. Org. Chem. 52: 4202); methylene(methylimino) (see, e.g., Vasseur et al., 1992, J. Am. Chem. Soc. 114:4006); 3′-thioformacetals (see, e.g., Jones at al.,1993, J. Org. Chem. 58:2983); sulfamates (see, e.g., U.S. Pat. No. 5,470,967); 2-aminoethylglycine, commonly referred to as PNA (see, e.g., Buchardt, WO 92/20702; Nielsen (1991) Science 254:1497-1500); and others (see, e.g., U.S. Pat. No. 5,817,781; Frier & Altman,1997, Nucl. Acids Res. 25:4429 and the references cited therein). Phosphate ester analogs include, but are not limited to, (i) C1-C4 alkylphosphonate, e.g. methylphosphonate; (ii) phosphoramidate; (iii) C1-C6 alkyl-phosphotriester; (iv) phosphorothioate; and (v) phosphorodithioate.
  • In this application, a statement that one sequence is the same as or is complementary, to another sequence encompasses situations where both of the sequences are completely the same or complementary to one another, and situations where only a portion of one of the sequences is the same as, or is complementary to, a portion or the entirety of the other sequence. Here, the term “sequence” encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers.
  • In this application, a statement that one sequence is complementary to another sequence encompasses situations in which the two sequences have mismatches. Here, the term “sequence” encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers. Despite the mismatches, the two sequences should selectively hybridize to one another under appropriate conditions.
  • The term “selectively hybridize” means that, for particular identical sequences, a substantial portion of the particular identical sequences hybridize to a given desired sequence or sequences, and a substantial portion of the particular identical sequences do not hybridize to other undesired sequences. A “substantial portion of the particular identical sequences” in each instance refers to a portion of the total number of the particular identical sequences, and it does not refer to a portion of an individual particular identical sequence. In certain embodiments, “a substantial portion of the particular identical sequences” means at least 70% of the particular identical sequences. In certain embodiments, “a substantial portion of the particular identical sequences” means at least 80% of the particular identical sequences. In certain embodiments, “a substantial portion of the particular identical sequences means at least 90% of the particular identical sequences. In certain embodiments, “a substantial portion of the particular identical sequences” means at least 95% of the particular identical sequences.
  • In certain embodiments, the number of mismatches that may be present may vary in view of the complexity of the composition. Thus, in certain embodiments, the more complex the composition, the more likely undesired sequences will hybridize. For example, in certain embodiments, with a given number of mismatches, a probe may more likely hybridize to undesired sequences in a composition with the entire genomic DNA than in a composition with fewer DNA sequences, when the same hybridization and wash conditions are employed for both compositions. Thus, that given number of mismatches may be appropriate for the composition with fewer DNA sequences, but fewer mismatches may be more optimal for the composition with the entire genomic DNA.
  • In certain embodiments, sequences are complementary if they have no more than 20% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than 15% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than10% mismatched nucleotides. In certain embodiments, sequences are complementary if they have no more than 5% mismatched nucleotides.
  • In this application, a statement that one sequence hybridizes or binds to another sequence encompasses situations where the entirety of both of the sequences hybridize or bind to one another, and situations where only a portion of one or both of the sequences hybridizes or binds to the entire other sequence or to a portion of the other sequence. Here, the term “sequence” encompasses, but is not limited to, nucleic acid sequences, polynucleotides, oligonucleotides, probes, and primers.
  • A “target,” “target nucleic acid,” or “target polynucleotide” refers to a nucleic acid comprising a sequence that can be distinguished by a probe or primer. In certain embodiments, a target polynucleotide is naturally occurring. In certain embodiments a target polynucleotide comprises synthetic molecules.
  • The term “primer” refers to a polynucleotide that anneals to a target polynucleotide and allows the synthesis from its 3′ end of a sequence complementary to the target polynucleotide.
  • The term “primer extension reaction” refers to a reaction in which a polymerase catalyzes the template-directed synthesis of a nucleic acid from the 3′ end of a primer. The term “primer extension product” refers to the resultant nucleic acid. A non-limiting exemplary primer extension reaction is the polymerase chain reaction (PCR).
  • The term “probe” comprises a polynucleotide that comprises a specific portion designed to hybridize in a sequence-specific manner with a complementary region of a specific nucleic acid sequence, e.g., a target polynucleotide. In certain embodiments, the specific portion of the probe may be specific for a particular sequence, or alternatively, may be degenerate, e.g., specific for a set of sequences.
  • The terms “annealing” and “hybridization” are used interchangeably and mean the base-pairing interaction of one nucleic acid with another nucleic acid that results in the formation of a duplex, triplex, or other higher-ordered structure. In certain embodiments, the primary interaction is base specific, e.g., A/T and G/C, by Watson/Crick and Hoogsteen-type hydrogen bonding. In certain embodiments, base-stacking and hydrophobic interactions may also contribute to duplex stability.
  • A “nucleic acid duplex” refers to any nucleic acid or portion of a nucleic acid that exists in double stranded form. Double stranded nucleic acid includes homoduplexes, such as double stranded DNA, and heteroduplexes, such as DNA:RNA or DNA:PNA heteroduplexes. Double stranded nucleic acid also includes, for example, a region of a nucleic acid strand to which a primer or probe has annealed.
  • B. Certain Nucleic Acid Binding Polypeptides from Sulfolobus
  • Certain small, basic DNA binding polypeptides from the hyperthermophilic archaeotes Sulfolobus solfataricus and Sulfolobus acidocaldarius have been discovered. These polypeptides, which include Sso7d and Sac7d, bind DNA in a sequence non-specific manner. See Gao et al. (1998) Nature Struct. Biol. 5:782-786; Robinson et al. (1998) Nature 392:202-205; McAfee et al. (1995) Biochem. 34:10063-10077; and Baumann et al. (1994) Nature Struct. Biol. 1:808-819. Sso7d and Sac7d are of relatively low molecular weight (about 7 kDa) and are rich in lysine residues. Id. Certain lysine residues are believed to be involved in DNA binding. See Gao et al. (1998) Nature Struct. Biol. 5:782-786. Both protect double stranded DNA from thermal denaturation by increasing its melting temperature (Tm) by about 40° C. Id.; Robinson et al. (1998) Nature 392:202-205. Sso7d also promotes the annealing of complementary DNA strands at temperatures exceeding the predicted Tm of the resulting duplex. See Guagliardi et al. (1997) J. Mol. Biol. 267:841-848. Sso7d exhibits a strong preference for DNA strands that are complementary without any mismatches over DNA strands that contain even a single mismatch. See id.; U.S. Patent Application Publication No. US 2003/0022162 A1. It is postulated that small, basic polypeptides such as Sso7d and Sac7d protect the DNA of hyperthermophiles from denaturation and degradation in the hyperthermophilic environment, where temperatures approach or exceed 100° C. See Guagliardi et al. (1997) J. Mol. Biol. 267:841-848.
  • C. Certain Exemplary Components
  • In certain embodiments, an isolated polypeptide having nucleic acid binding activity is provided. A polypeptide having nucleic acid binding activity is also referred to herein as a “nucleic acid binding polypeptide.” In certain embodiments, an isolated nucleic acid binding polypeptide has a relatively low molecular weight and basic isoelectric point. In certain embodiments, a nucleic acid binding polypeptide stabilizes a double stranded nucleic acid from denaturation or promotes the annealing of complementary nucleic acid strands. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide is provided. In certain such embodiments, a fusion protein comprises a nucleic acid binding polypeptide joined to a nucleic acid modification enzyme, such as a DNA polymerase.
  • 1. Nucleic Acid Binding Polypeptides
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a Crenarchaeal nucleic acid binding polypeptide. In certain embodiments, a Crenarchaeal nucleic acid binding polypeptide comprises a naturally occurring polypeptide from the crenarchaeon Pyrobaculum aerophilum. In certain embodiments, a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:1; which can be found at GenBank accession numbers AAL64739 and AAL64814. SEQ ID NO:1 is encoded by the open reading frames “PAE3192” (SEQ ID NO:2) and “PAE3289” (SEQ ID NO:3) which are unannotated open reading frames identified in the complete genome sequence of P. aerophilum. See GenBank accession no. AE009441.
  • In certain embodiments, a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:4, which can be found at GenBank accession number AAL62754. SEQ ID NO:4 is encoded by the open reading frame “PAE0384” (SEQ ID NO:5), which is an unannotated open reading frame identified in the complete genome sequence of P. aerophilum. See GenBank accession no. AE009441.
  • SEQ ID NOs:1 and 4 are low molecular weight, basic proteins of 57 and 56 amino acids in length, respectively, with a predicted isoelectric point of about 10.5. SEQ ID.NO:1 contains 12 lysine residues and 2 arginine residues. SEQ ID NO:4 contains 11 lysine residues and 2 arginine residues. SEQ ID NOs:1 and 4 are about 97% identical to each other. SEQ ID NOs:1 and 4 are similar in size and charge to Sso7d, but they are not significantly identical to the amino acid sequence of Sso7d. Additionally, SEQ ID NO:1 contains a “KKOK” motif (SEQ ID NO: 47) near its N-terminus (residues 3 to 6 of SEQ ID NO:1). This motif resembles the “KQKK” motif (SEQ ID NO: 47) found at the C-terminus of Sso7d (residues 61-64 of SEQ ID NO:30). The location of these motifs at opposite termini of SEQ ID NO:1 and Sso7d may have resulted from gene rearrangements during the divergence of the different Crenarchaeal species. The KQKK motif (SEQ ID NO: 47) of Sso7d is discussed in Shehi et al. (2003) Biochem. 42:8362-8368.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:1 that has at least one activity of a polypeptide comprising SEQ ID NO:1. Exemplary activities include, but are not limited to, the ability of SEQ ID NO:1 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase. In certain embodiments, a fragment of SEQ ID NO:1 lacks N-terminal amino acids of SEQ ID NO:1. In certain such embodiments, the fragment of SEQ ID NO:1 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:1. In certain embodiments, a fragment of SEQ ID NO:1 lacks C-terminal amino acids of SEQ ID NO:1. In certain such embodiments, the fragment of SEQ ID NO:1 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:1. In certain embodiments, a fragment of SEQ ID NO:1 has a predicted isoelectric point of about 9-11.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:1 that has at least one activity of a polypeptide comprising SEQ ID NO:1. In certain embodiments, a variant of a polypeptide comprising SEQ ID NO:1 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:1. For example, in certain embodiments, a variant of a polypeptide comprising SEQ ID NO:1 comprises an amino acid sequence having at least 60%, 65%, 70%,75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:1. In certain such embodiments, the lysine and arginine residues of SEQ ID NO:1 are not substituted or deleted in the variant.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:4 that has at least one activity of a polypeptide comprising SEQ ID NO:4. Exemplary activities include, but are not limited to, the ability of SEQ ID NO:4 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase. In certain embodiments, a fragment of SEQ ID NO:4 lacks N-terminal amino acids of SEQ ID NO:4. In certain such embodiments, the fragment of SEQ ID NO:4 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:4. In certain embodiments, a fragment of SEQ ID NO:4 lacks C-terminal amino acids of SEQ ID NO:4. In certain such embodiments, the fragment of SEQ ID NO:4 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:4. In certain embodiments, a fragment of SEQ ID NO:4 has a predicted isoelectric point of about 9-11.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:4 that has at least one activity of a polypeptide comprising SEQ ID NO:4. In certain embodiments, a variant of a polypeptide comprising SEQ ID NO:4 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:4. For example, in certain embodiments, a variant of a polypeptide comprising SEQ ID NO:4 comprises an amino acid sequence having at least 60%, 65%, 70%, 75%, 80%,85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:4. In certain such embodiments, the lysine and arginine residues of SEQ ID NO:4 are not substituted or deleted in the variant.
  • In certain embodiments, a Crenarchaeal nucleic acid binding polypeptide comprises a naturally occurring polypeptide from the crenarchaeon Aeropyrum pernix. In certain embodiments, a Crenarchaeal nucleic acid binding polypeptide comprises the amino acid sequence of SEQ ID NO:6. SEQ ID NO:6 is 55 amino acids in length with a predicted isoelectric point of about 10.5. It contains 13 lysine residues and 3 arginine residues. SEQ ID NO:6 is similar in size and charge to Sso7d, but is not significantly identical to the amino acid sequence of Sso7d.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a fragment of SEQ ID NO:6 that has at least one activity of a polypeptide comprising SEQ ID NO:6. Exemplary activities include, but are not limited to, the ability of SEQ ID NO:6 to bind nucleic acid, stabilize nucleic acid duplexes, promote annealing of complementary nucleic acid strands, increase the Tm of primers, enhance the activity of a nucleic acid modification enzyme, and increase the processivity of a polymerase. In certain embodiments, a fragment of SEQ ID NO:6 lacks N-terminal amino acids of SEQ ID NO:6. In certain such embodiments, the fragment of SEQ ID NO:6 lacks up to the first 12 N-terminal amino acids of SEQ ID NO:6. In certain embodiments, a fragment of SEQ ID NO:6 lacks C-terminal amino acids of SEQ ID NO:6. In certain such embodiments, the fragment of SEQ ID NO:6 lacks up to the last 12 C-terminal amino acids of SEQ ID NO:6. In certain embodiments, a fragment of SEQ ID NO:6 has a predicted isoelectric point of about 9-11.
  • In certain embodiments, an isolated nucleic acid binding polypeptide comprises a variant of a polypeptide comprising SEQ ID NO:6 that has at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, a variant of a polypeptide comprising SEQ ID NO:6 comprises an amino acid sequence having from about 60% to about 99% identity to SEQ ID NO:6. For example, in certain embodiments, a variant of a polypeptide comprising SEQ ID NO:6 comprises an amino acid sequence having at least 60%, 65%,70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% identity to SEQ ID NO:6. In certain such embodiments, the lysine and arginine residues of SEQ ID NO:6 are not substituted or deleted in the variant.
  • In certain embodiments, an isolated polynucleotide comprising a nucleic acid sequence encoding SEQ ID NO:6 is provided. In certain embodiments, an isolated polynucleotide comprises a nucleic acid sequence encoding a fragment of SEQ ID NO:6 having at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, an isolated polynucleotide comprises a nucleic acid sequence encoding a variant of a polypeptide comprising SEQ ID NO:6 having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • In certain embodiments, an isolated polynucleotide comprising SEQ ID NO:7 is provided. In certain embodiments, an isolated polynucleotide comprises a fragment of SEQ ID NO:7 that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, an isolated polynucleotide comprises a variant of a polynucleotide comprising SEQ ID NO:7 that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6. In certain embodiments, a variant of a polynucleotide comprising SEQ ID NO:7 comprises a nucleic acid sequence having from about 60% to about 99% identity to SEQ ID NO:7 and that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6. For example, in certain embodiments, a variant of a polynucleotide comprising SEQ ID NO:7 comprises a nucleic acid sequence having at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identity to SEQ ID NO:7 and that encodes a polypeptide having at least one activity of a polypeptide comprising SEQ ID NO:6.
  • In certain embodiments, the length of an isolated polynucleotide is any number of nucleotides less than or equal to 10,000. For example, in certain embodiments, an isolated polynucleotide is less than or equal to 10,000, 9000, 8000, 7000, 6000, 5000, 4000, 3000, 2000, 1000, or 500 nucleotides in length. In certain embodiments, the length of an isolated polynucleotide does not include vector sequences.
  • In certain embodiments, a fragment or variant of a Crenarchaeal nucleic acid binding polypeptide has nucleic acid binding activity that is less than the nucleic acid binding activity of the Crenarchaeal nucleic acid binding polypeptide. In certain embodiments, a fragment or variant of a Crenarchaeal nucleic acid binding polypeptide has from about 10-20%, about 20-30%, about 30-40%, about 40-50%, about 50-60%, about 60-70%, about 70-80%, about 80-90%, or about 90-95% of the nucleic acid binding activity of the Crenarchaeal nucleic acid binding polypeptide.
  • The first of the two alignments below demonstrates that SEQ ID NOs:1 and 6 have 60% identity and 74% similarity as determined by the “Blast 2 Sequence” blastp program set at default parameters. (In calculating percent similarity, the blastp program includes both identical and similar amino acids. Similar amino acids are indicated by “+” signs in the alignments below.) The second of the two alignments below demonstrates that SEQ ID NOs:4 and 6 have 59% identity and 72% similarity as determined by the “Blast 2 Sequence” blastp program set at default parameters. In certain embodiments, one or more amino acids that are not conserved in at least one of the alignments below (i.e., amino acids that are not identical or similar) are substituted or deleted to create variants of polypeptides comprising SEQ ID NO:1, SEQ ID NO:4, or SEQ ID NO:6.
  • SEQ ID NO: 1: 1 MSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFAVAKSPYTGIKVYRLLGKKK 57
      M KK+K+KF+D+ AK+ +ETD YEV  K+TRG    FA AKSPYTG   YR+LGK
    SEQ ID NO: 6: 1 MPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFAKAKSPYTGKIFYRVLGKA  55
    SEQ ID NO: 4: 1  MAKQKLKFYDIKAKQSFETDKYEVIEKETARGPMLFAVATSPYTGIKVYRLLGKKK 56
         K+K+KF+D+ AK+ +ETD YEV  KETRG    FA A SPYTG   YR+LGK
    SEQ ID NO: 6: 1 MPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFAKAKSPYTGKIFYRVLGKA  55
  • Based on the above alignments, a consensus sequence for a nucleic acid binding polypeptide is provided as follows:
  • SEQ ID NO: 28
    5′ KXKXKFXDXXAKXXXETDXYEVXXKXTXRGXXXFAXAKSPYTGXXXY
    RXLGK3′
  • In the above consensus sequence, “X” is any amino acid. In certain embodiments, an isolated nucleic acid binding polypeptide comprises an amino acid sequence that conforms to that consensus sequence. In certain such embodiments, the isolated nucleic acid binding polypeptide has at least one activity of a polypeptide comprising SEQ ID NO:1, SEQ ID NO:4, or SEQ ID NO:6.
  • In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is obtained by the polymerase chain reaction (PCR). Certain methods employing PCR are known to those skilled in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual, Chapter 8 (3rd ed., Cold Spring Harbor Laboratory Press, NY). In certain embodiments, a polynucleotide comprising all or a portion of the coding sequence of a nucleic acid binding polypeptide is amplified using appropriate primers. In certain embodiments, restriction enzyme sites are included in the primers to facilitate cloning of the amplification product into an appropriate expression vector. In certain embodiments, the polynucleotide is amplified from genomic DNA or from cDNA of a crenarchaeote. The complete genome sequence of certain crenarchaeotes is published and may be used in designing primers for PCR. See, e.g., Fitz-Gibbon et al. (2002) Proc. Nat'l Acad. Sci. USA 99:984-989; Kawarabayasi (1999) DNA Research Supp:145-152; and She et al (2001) Proc. Nat'l Acad. Sci. USA 98:7835-7840.
  • In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is obtained by synthesizing individual oligonucleotides which are ligated end-to-end in vitro, with the resulting ligation product comprising the coding sequence of a nucleic acid binding polypeptide . In certain embodiments, the ligation product is amplified by PCR. In certain embodiments, the oligonucleotides overlap in sequence, and are extended by PCR, resulting in a PCR product comprising the coding sequence of a nucleic acid binding polypeptide. See, e.g., Stemmer et al. (1995) Gene 164:49-53; Gronlund et at (2003) J. Biol. Chem. 278:40144-40151. In certain embodiments, the PCR product is cloned into an appropriate expression vector.
  • In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is cloned into a suitable vector. In certain such embodiments, the vector is transferred (e.g., transformed or transfected) into a host cell. In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is cloned into an expression vector and, in certain embodiments, expressed in a suitable host cell. Certain exemplary expression vectors are available for use in certain host cells including, but not limited to, prokaryotes, yeast cells, insect cells, plant cells, and mammalian cells. See, e.g., Ausubel et al. (1991) Current Protocols in Molecular Biology, Chapter 16, John Wiley & Sons, New York. Certain expression vectors for the inducible expression of recombinant proteins in prokaryotes are known to those skilled in the art. For example, in certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is cloned into an expression vector such that its transcription is under the control of an inducible promoter, such as the T7 bacteriophage promoter, the T5 promoter, or the tac promoter. See, e.g., the pET series of vectors (Invitrogen, Carlsbad, Calif.), the pQE series of vectors (Qiagen, Valencia, Calif.), or the pGEX series of vectors (Amersham Biosciences, Piscataway, N.J.). In certain embodiments, the recombinant expression vector is transformed into bacteria, such as E. coli. In certain embodiments, the expression of the nucleic acid binding polypeptide is induced by culturing the bacteria under certain growth conditions. For example, in certain embodiments, expression of the nucleic acid binding polypeptide is induced by addition of isopropylthio-β-galactoside (IPTG) to the culture medium.
  • In various embodiments of expression vectors, a polynucleotide encoding a tag, such as an affinity tag, is expressed in frame with a polynucleotide encoding a nucleic acid binding polypeptide. In certain embodiments, certain such tags can provide a mechanism for detection or purification of the nucleic acid binding polypeptide. Examples of tags include, but are not limited to, polyhistidine tags, which allow purification using nickel chelating resin, and glutathione S-transferase moieties, which allow purification using glutathione-based chromatography. In certain embodiments, an expression vector further provides a cleavage site between the tag and the nucleic acid binding polypeptide, so that the nucleic acid binding polypeptide may be cleaved from the tag following purification. In certain embodiments, e.g., embodiments using polyhistidine tags, the nucleic acid binding polypeptide is not cleaved from the tag. It has been reported that the presence of a polyhistidine tag on a recombinant DNA binding protein may enhance the interaction of the DNA binding protein with DNA. See, e.g., Buning et al (1996) Anal. Biochem. 234:227-230.
  • 2. Fusion Proteins
  • In certain embodiments, fusion proteins are provided. In certain such embodiments, a fusion protein comprises any of the nucleic acid binding polypeptides described above joined to a nucleic acid modification enzyme. In certain such embodiments, the nucleic acid modification enzyme comprises a nucleic acid polymerase. In certain embodiments, the nucleic acid polymerase comprises a DNA polymerase.
  • a) Certain DNA Polymerases for Use in Fusion Proteins
  • Certain DNA polymerases are known to those skilled in the art. For example, DNA polymerases include DNA-dependent polymerases, which use DNA as a template, or RNA-dependent polymerases, such as reverse transcriptase, which use RNA as a template. Currently, DNA-dependent DNA polymerases fall into one of six families (A, B, C, D, X, and Y), with most falling into one of three families (A, B, and C). See, e.g., Ito et al.(1991) Nuc. Acids. Research 19:4045-4057; Braithwaite at al. (1993) Nuc. Acids. Research 21:787-802; Fileé et al. (2002) J. Mol. Evol. 54:763-773; and Albà (2001) Genome Biol. 2:3002.1-3002.4. Certain DNA polymerases may be single-chain polypeptides (e.g., certain family A and B polymerases) or multi-subunit enzymes (e.g., certain family C polymerases) with one of the subunits having polymerase activity. Id. In certain embodiments, a fusion protein comprises a DNA polymerase selected from a family A, B, C, D, X, or Y polymerase.
  • In certain embodiments, a fusion protein comprises a fragment or variant of an A, B, C, D, X, or Y polymerase having polymerase activity. In certain embodiments, a fusion protein comprises a family A DNA polymerase or a fragment or variant thereof having polymerase activity. In certain such embodiments, the family A polymerase is a bacterial family A polymerase, such as a polymerase from the genus Bacillus, Thermus, or Thermotoga. In certain such embodiments, the family A polymerase is Taq DNA polymerase (SEQ ID NO:32) or a fragment or variant thereof having polymerase activity. In certain embodiments, a fusion protein comprises a family B DNA polymerase or a fragment or variant thereof having polymerase activity. In certain such embodiments, the family B polymerase is an archaeal family B polymerase, such as a polymerase from the genus Thermococcus or Pyrococcus. In certain such embodiments, the family B polymerase is Pfu DNA polymerase (SEQ ID NO:31) or a fragment or variant thereof having polymerase activity.
  • In addition to polymerase activity, certain DNA polymerases also possess other activities, such as 3′ to 5′ exonuclease (proofreading) activity or 5′ to 3′ exonuclease activity. See, e.g., Fileé et al. (2002) J. Mol. Evol. 54:763-773; and Pavlov et al. (2004) Trends in Biotech. 22:253-260. In certain such DNA polymerases, polymerase activity and exonuclease activity are carried out by separate domains. The domain structure of certain DNA polymerases is known to those skilled in the art. See, e.g., id.; Albà (2001) Genome Biol. 2:3002.1-3002.4; and Steitz (1999) J. Biol. Chem. 274:17395-17398.
  • In certain embodiments, a fusion protein comprises a “chimeric” DNA polymerase. In certain such embodiments, a chimeric DNA polymerase comprises a domain having polymerase activity from a particular DNA polymerase and a domain having exonuclease activity from a different DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591.
  • In certain embodiments, a fusion protein comprises a DNA polymerase having both polymerase activity and exonuclease activity. In certain such embodiments, the exonuclease activity is 5′ to 3′ exonuclease activity. In certain such embodiments, the level of 5′ to 3′ exonuclease activity is reduced or eliminated relative to the level of 5′ to 3′ exonuclease activity of a native DNA polymerase. In certain such embodiments, mutation of a DNA polymerase results in reduction or elimination of 5′ to 3′ exonuclease activity. In certain such embodiments, one or more amino acid substitutions result in reduction or elimination of 5′ to 3′ exonuclease activity. Certain such substitutions are known to those skilled in the art. For example, substitution of a conserved glycine in certain thermostable DNA polymerases reduces or eliminates 5′ to 3′ exonuclease activity. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591(describing the G46D substitution in Taq, Tth, and TZ05 DNA polymerases; the G43D substitution in Tsps17 DNA polymerase; and the G37D substitution in Tma and Taf DNA polymerases).
  • In certain embodiments, deletion of one or more amino acids from a DNA polymerase results in the reduction or elimination of 5′ to 3′ exonuclease activity. Certain such deletions are known to those skilled in the art. For example, certain N-terminal deletions of certain thermostable DNA polymerases reduce or eliminate 5′ to 3′ exonuclease activity. Exemplary N-terminal deletions include, but are not limited to, deletion of about the first 35-50 amino acid residues of a thermostable DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591 (describing deletion of N-terminal amino acid residues up to and including the conserved glycine residues in Taq, Tth, TZ05, Tsps17, Tma, and Taf, described above). Exemplary N-terminal deletions further include, but are not limited to, deletion of about the first 70-80 amino acid residues of a thermostable DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591 (describing deletion of N-terminal amino acid residues up to and including the following residues: Ala 77 (Taq DNA polymerase), Ala 78 (Tth DNA polymerase), Ala 78 (TZ05 DNA polymerase), Ala 74 (TSPS17 DNA polymerase), Leu 72 (Tma DNA polymerase), and Ile 73 (Taf DNA polymerase)). Exemplary N-terminal deletions further include, but are not limited to, deletion of the first 139 or the first 283 amino acid residues of Tma DNA polymerase. See, e.g., U.S. Pat. Nos. 5,795,762 and 5,466,591.
  • In certain embodiments, a fusion protein comprises a DNA polymerase that lacks an exonuclease domain. In certain such embodiments, the exonuclease domain is a 5′ to 3′ exonuclease domain. Exemplary DNA polymerases that lack a 5′ to 3′ exonuclease domain include, but are not limited to, the large (“Klenow”) fragment of E. coli DNA polymerase I and the “Stoffel” fragment of Taq DNA polymerase, which lacks about the first 289-300 N-terminal amino acids of full-length Taq DNA polymerase. See Lawyer at al. (1989) J. Biol. Chem. 264:6427-6437(describing a “Stoffel” fragment); Vainshtein et al. (1996) Protein Science 5:1785-1792.
  • In certain embodiments, a fusion protein comprises a DNA polymerase having one or more mutations that reduce the ability of the polymerase to discriminate against the incorporation of dideoxynucleotides. Certain exemplary mutations are described, for example, in U.S. Pat. No. 6,333,183; EP 0 745 676 B1; and U.S. Pat. No. 5,614,365. One such exemplary mutation is the F667Y mutation in Taq DNA polymerase. See, e.g., U.S. Pat. No. 5,614,365.
  • In certain embodiments, a fusion protein comprises a DNA polymerase having one or more mutations that reduce the ability of the polymerase to discriminate against the incorporation of fluorescently labeled nucleotides into polynucleotides. In certain embodiments, such “discrimination reduction” mutations occur within the nucleotide label interaction region of a DNA polymerase, which is described, for example, in U.S. Pat. No. 6,265,193. Exemplary discrimination reduction mutations are provided in U.S. Pat. No. 6,265,193. In certain embodiments, a DNA polymerase further comprises one or more mutations in addition to one or more discrimination reduction mutations. In certain embodiments, such mutations include, but are not limited to, mutations that increase or decrease 3′ to 5′ exonuclease activity; increase or decrease 5′ to 3′ exonuclease activity; increase or decrease thermostability; increase or decrease processivity; and increase incorporation of dideoxynucleotides. In certain embodiments, a DNA polymerase comprises one or more discrimination reduction mutations and one or more mutations that decrease 3′ to 5′ exonuclease activity. In certain embodiments, a DNA polymerase comprises one or more discrimination reduction mutations and one or more mutations that increase incorporation of dideoxynucleotides. Such DNA polymerases are described, for example, in U.S. Pat. No. 6,265,193.
  • In certain, embodiments, a fusion protein comprises a thermostable DNA polymerase. In certain embodiments, a thermostable DNA polymerase is a naturally occurring thermostable DNA polymerase. In certain embodiments, a thermostable DNA polymerase is a fragment or variant of a naturally occurring thermostable DNA polymerase that possesses polymerase activity. Exemplary guidance for determining certain such fragments and variants is provided in Pavlov et al. (2004) Trends in Biotech. 22:253-260.
  • Certain exemplary thermostable DNA polymerases are known to those skilled in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor Laboratory Press, NY) at 8.10-8.11. Such thermostable DNA polymerases include, but are not limited to, DNA polymerases from the genus Thermus, Thermococcus, Thermotoga, Bacillus, and Pyrococcus. Certain exemplary thermostable DNA polymerases include, but are not limited to, DNA polymerases from Thermus aquaticus (e.g., Taq DNA polymerase), Thermus brockianus (e.g., Tbr polymerase), Thermus flavus (e.g., Tfl DNA polymerase), Thermus caldophilus, Thermus filiformis, Thermus oshimai, Thermus thermophilus (e.g., Tth DNA polymerase), and Thermus ubiquitus. Certain other thermostable DNA polymerases from Thermus include, but are not limited to, Tsps17 and TZ05. Certain fragments and variants of Taq, Tfl, Tth, Tsps17, and TZ05 DNA polymerases are known to those skilled in the art. See, e.g., Vainshtein et al. (1996) Protein Science 5:1785-1792 (discussing the Taq Stoffel fragment), EP 0 745 676 B1, WO 01/14568, US 2004/0005573 A1, U.S. Pat. No. 5,795,762, and U.S. Pat. No. 5,466,591.
  • In certain embodiments, a thermostable DNA polymerase comprises a DNA polymerase from Thermococcus litoralis (e.g., Tli polymerase), Thermococcus kodakarensis KOD1 (e.g., KOD DNA polymerase), or Thermococcus gorgonarius (e.g., Tgo DNA polymerase). Certain fragments and variants of KOD DNA polymerase are known to those skilled in the art. See, e.g., EP 1 154 017 A1. In certain embodiments, a thermostable DNA polymerase comprises a DNA polymerase from Thermotoga neapolitana (e.g., Tne DNA polymerase) or Thermotoga maritima (e.g., Tma DNA polymerase). See, e.g., US 2003/0092018 A1 and US 2003/0162201 A1. In certain embodiments, a thermostable DNA polymerase comprises a DNA polymerase from Thermosipho africanus (e.g., Taf DNA polymerase). Certain fragments and variants of Tma, Taf, and Tne DNA polymerases are known to those skilled in the art. See, e.g., US 2003/0092018 A1, US 2003/0162201 A1, U.S. Pat. No. 5,795,762; and and U.S. Pat. No. 5,466,591.
  • Certain exemplary thermostable DNA polymerases include, but are not limited to, DNA polymerases from Pyrococcus furiosus (e.g., Pfu DNA polymerase), Pyrococcus woesei (e.g., Pwo polymerase), Pyrococcus spp. GB-D, and Pyrolobus fumarius. See, e.g., U.S. Pat. No. 6,489,150 B1, U.S. Pat. No. 6,673,585 B1, U.S. Pat. No. 5,948,666, U.S. Pat. No. 6,492,511, and EP 0 547 359 B1. Certain fragments and variants of Pfu polymerase are known to those skilled in the art. See, e.g., U.S. Pat. No. 6,333,183 B1. In certain embodiments, a thermostable DNA polymerase comprises a DNA polymerase from Bacillus stearothermophilus or a variant or fragment thereof, such as the “large fragment” of Bst DNA polymerase. In certain embodiments, a thermostable DNA polymerase comprises a DNA polymerase from the thermophilic bacterium designated Tsp JS1. See, e.g., US 2004/0005573 A1. Certain fragments and variants of a thermostable DNA polymerase from Tsp JS1 are known to those skilled in the art. Id.
  • b) Certain Recombinant Methods for Making Fusion Proteins
  • In various embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is produced using recombinant methods. In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide and a polynucleotide encoding a nucleic acid modification enzyme are ligated together in the same reading frame, resulting in a polynucleotide encoding a fusion protein.
  • In certain embodiments, a polynucleotide encoding a nucleic acid binding polypeptide is obtained as described in Part IV: C.1 above.
  • In certain embodiments, a polynucleotide encoding a nucleic acid modification enzyme is obtained by the polymerase chain reaction (PCR). Certain methods employing PCR are known to those skilled in the art. In certain embodiments, a polynucleotide comprising all or a portion of the coding sequence of a nucleic acid modification enzyme is amplified using appropriate primers. In certain embodiments, restriction enzyme sites are included in the primers to facilitate cloning of the amplification product into an appropriate expression vector. In certain embodiments, the polynucleotide encoding a nucleic acid modification enzyme is a polynucleotide encoding a DNA polymerase. Polynucleotide sequences encoding certain DNA polymerases are known to those skilled in the art. See, e.g., Ito et al. (1991) Nuc. Acids. Research 19:4045-4057; Braithwaite et al. (1993) Nuc. Acids. Research 21:787-802; and Fileé et al. (2002) J. Mol. Evol. 54:763-773. In certain embodiments, the polynucleotide encoding a DNA polymerase is a polynucleotide encoding Taq DNA polymerase or a fragment or variant thereof having polymerase activity. In certain embodiments, the polynucleotide encoding a DNA polymerase is a polynucleotide encoding Pfu DNA polymerase or a fragment or variant thereof having polymerase activity.
  • In various embodiments, a polynucleotide encoding a fusion protein is cloned into a suitable vector. In certain such embodiments, the vector is transferred (e.g., transformed or transfected) into a suitable host cell. In certain embodiments, a polynucleotide encoding a fusion protein is cloned into an expression vector and, in certain embodiments, expressed in a suitable host cell. Certain exemplary expression vectors and host cells are known to those skilled in the art, as described in Part IV.C.1 above. In certain embodiments, the fusion protein is isolated from the host cell.
  • In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is joined to the N-terminus of a nucleic acid modification enzyme. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is joined to the C-terminus of a nucleic acid modification enzyme. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is disposed internally within a nucleic acid modification enzyme.
  • In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is joined to the N-terminus of a DNA polymerase. In certain embodiments, of a fusion protein, a nucleic acid binding polypeptide is joined to the C-terminus of a DNA polymerase. In certain embodiments of a fusion protein, a nucleic acid binding polypeptide is disposed internally within a DNA polymerase. Certain three dimensional structures of certain DNA polymerases are known to those skilled in the art. See, e.g. Steitz (1999) J. Biol. Chem. 274:17395-17398; Albà (2001) Genome Biol. 2:3002.1-3002.4. Certain DNA polymerases typically have a “hand-like” three dimensional structure comprising “finger,” “palm,” and “thumb” domains. See, e.g., Steitz (1999) J. Biol. Chem. 274:17395-17398; Albà (2001) Genome Biol. 2:3002.1-3002.4. In certain embodiments of a fusion protein, wherein a nucleic acid binding polypeptide is disposed internally within a DNA polymerase, the nucleic acid binding polypeptide occurs within a loop in the “thumb” domain of the DNA polymerase. See, e.g., U.S. Pat. No. 5,972,603, e.g., FIG. 4.
  • In certain embodiments, one skilled in the art can routinely determine whether a DNA polymerase retains polymerase activity in the context of a fusion protein by assaying the fusion protein for polymerase activity.
  • c) Certain Other Methods for Making Fusion Proteins
  • In certain embodiments, a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by chemical methods. In certain such embodiments, the nucleic acid modification enzyme is a DNA polymerase. In certain embodiments, a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by a chemical coupling agent. Certain such methods are known to those skilled in the art. See, e.g., Hermanson, ed., Bioconiuqate Techniques (Academic Press 1996).
  • d) Certain Linkers
  • In certain embodiments, a nucleic acid binding polypeptide is joined to a nucleic acid modification enzyme by a linker. In certain embodiments, a linker is a peptide, which is joined by peptide bonds to a nucleic acid binding polypeptide and to a nucleic acid modification enzyme. In certain embodiments, a linker is engineered into a fusion protein by standard recombinant methods. For example, in certain embodiments, a polynucleotide encoding a fusion protein is constructed, wherein a polynucleotide encoding a linker is in frame with and disposed between a polynucleotide encoding a nucleic acid modification enzyme and a polynucleotide encoding a nucleic acid binding polypeptide.
  • In certain embodiments, a linker is any whole number of amino acids less than or equal to 25. In certain embodiments, a linker does not form an α-helix or β-strand. In certain such embodiments, a linker forms an extended, or “loop,” conformation. In certain embodiments, a linker sequence comprises one or more glycine residues. In certain embodiments, a suitable linker sequence is determined using the LINKER program. See, e.g., Crasto et al. (2000) Protein Eng. 13:309-312.
  • Other exemplary linkers include, but are not limited to, carbohydrate linkers, lipid linkers, fatty acid linkers, and polymeric linkers. Exemplary polymeric linkers include, but are not limited to, polyether linkers, such as polyethylene glycol (PEG).
  • D. Certain Exemplary Methods Using Nucleic Acid Binding Polypeptides
  • 1. Stabilize Nucleic Acid Duplexes
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to stabilize a nucleic acid duplex from denaturation at temperatures above the Tm of the nucleic acid duplex, thereby effectively increasing the Tm of the nucleic acid duplex. In certain such embodiments, one or more nucleic acid binding polypeptides are combined with a nucleic acid duplex. In certain such embodiments, the ratio of the concentration of a nucleic acid binding polypeptide to the concentration of the nucleic acid duplex (in nucleotides) is at least about 1:25, 1:10, 1:5, 1:3, 1:1, or any ratio wherein the concentration of the nucleic acid binding polypeptide exceeds that of the nucleic acid duplex.
  • 2. Anneal Complementary Nucleic Acid Strands
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to promote the annealing of complementary nucleic acid strands. In certain embodiments, annealing takes place with greater rapidity and specificity in the presence of a nucleic acid binding polypeptide than in the absence of a nucleic acid binding polypeptide. In certain embodiments, complementary nucleic acid strands are allowed to anneal in a composition comprising one or more nucleic acid binding polypeptides. In certain such embodiments, a nucleic acid binding polypeptide is present at any concentration from about 1 μg/ml to about 500 μg/ml. In certain embodiments, one or more nucleic acid binding polypeptides are used to favor the annealing of nucleic acid strands that are complementary without mismatches over the annealing of nucleic acid strands that are complementary with mismatches.
  • In certain embodiments, nucleic acid binding polypeptides are used in hybridization-based detection assays or primer extension assays in which a probe or primer is annealed to a target nucleic acid sequence. Certain examples of the use of nucleic acid binding polypeptides in certain such assays are provided below.
  • a) Hybridization-Based Detection Assays
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the efficiency, e.g., the speed and specificity, of a hybridization-based detection assay. Exemplary hybridization-based detection assays include, but, are not limited to, assays in which target nucleic acid is immobilized on a solid support and exposed to a labeled probe (see, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor Laboratory Press, NY), e.g., at 6.33-6.58 (describing “Southern” hybridizations). In certain embodiments, exemplary hybridization-based detection assays include microarray-based assays in which target nucleic acid is labeled and exposed to a plurality of polynucleotides immobilized on a solid support. See id. Appendix 10. An example of the use of the nucleic acid binding polypeptide Sso7d in a microarray-based detection assay is described, e.g., in Hatakeyama, US 2003/0022162 A1.
  • In certain hybridization-based detection assays, a nucleic acid probe is exposed to a mixture of nucleic acids. Within that mixture is a target nucleic acid, which, comprises a sequence that is complementary to the probe. The probe specifically anneals to the target nucleic acid to form a hybridization complex under certain conditions, e.g., conditions in which the probe is exposed to the target nucleic acid for an appropriate length of time and at an annealing temperature below that of the predicted Tm of the probe.
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the Tm of a probe, thereby increasing the temperature at which the annealing may be carried out. In certain such embodiments, the annealing is carried out in the presence of one or more nucleic acid binding polypeptides. In certain such embodiments, the annealing takes place at any temperature from 10° C. below to 40° C. above the predicted Tm of the probe. In certain such embodiments, the annealing takes place at a temperature up to 40° C. above the predicted Tm of the probe. In certain embodiments in which a probe is an oligonucleotide of about 15-35 nucleotides, annealing takes place in the presence of one or more nucleic acid binding polypeptides at any temperature between 40° C. and 85° C.
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the Tm of a probe, thereby allowing the use of shorter probes. In certain such embodiments, the annealing is carried out in the presence of one or more nucleic acid binding polypeptides. In certain such embodiments, a probe is of any length between 12 and 25 nucleotides. In certain such embodiments, a probe is of any length between 12 and 19 nucleotides. In certain such embodiments, a probe is of any length between 12 and 16 nucleotides.
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to decrease the duration of time to achieve annealing. In certain such embodiments, the annealing is carried out in the presence of one or more nucleic acid binding polypeptides. In certain such embodiments, the annealing takes place over any amount of time from about 0.5 minute to about three hours. In certain such embodiments, the annealing takes place over any amount of time from about 1 minute to about 30 minutes. In certain such embodiments, the annealing takes place over any amount of time from about 1 minute to about 15 minutes.
  • In certain embodiments of hybridization-based detection assays, a probe may selectively hybridize to a target nucleic acid that is complementary without mismatches to the probe. In certain embodiments, a probe may also selectively hybridize to a target nucleic acid that is complementary to the probe but that contains one or more mismatches relative to the probe. In certain embodiments, one or more nucleic acid binding polypeptides are used to favor the hybridization of a probe to a target nucleic acid that is complementary without mismatches to the probe over the hybridization of a probe to a target nucleic acid that is complementary but that contains one or more mismatches relative to the probe. Thus, in certain embodiments, the specificity of hybridization is increased. In certain such embodiments, annealing is carried out under any of the conditions of time or temperature described above. In certain such embodiments, annealing is carried out at a temperature greater than the predicted Tm of the probe.
  • In certain embodiments, because nucleic acid binding polypeptides can substantially increase the speed and specificity of a hybridization-based detection assay, such polypeptides can be used in certain hybridization-based “point-of-use” devices. Point-of-use devices are typically portable devices that allow rapid diagnosis or detection of a physiological or pathological condition, in certain instances, in a non-clinical or small-scale laboratory setting. An exemplary point-of-use device is, for example, a typical pregnancy test. An exemplary point-of-use device that uses hybridization-based detection is, for example, the Affirm VPIII Microbial Identification System (Becton Dickinson and Company—BD Diagnostics, Sparks, Md.), whereby the presence of certain vaginal pathogens is detected in vaginal swab specimens using an oligonucleotide hybridization assay. See Briselden et al. (1994) J. Clin. Microbiol. 32:148-52; Witt et al. (2002) J. Clin. Microbiol. 40:3057-3059.
  • In certain embodiments, one or more nucleic acid binding polypeptides can be used in a hybridization-based point-of-use device that diagnoses a pathological condition, such as an infection, by detecting genetic material from a pathogen in a biological sample from a host. In certain embodiments, the volume of a biological sample to be used with a point-of-use device is reduced in the presence of one or more nucleic acid binding polypeptides. In certain embodiments, the hybridization-based point-of-use device utilizes microarray technology.
  • In certain embodiments, because nucleic acid binding polypeptides can substantially increase the specificity of a hybridization-based detection assay, one or more nucleic acid binding polypeptides can be used in assays that detect mutations or polymorphisms in a target polynucleotide. For example, one or more nucleic acid binding polypeptides can be used in assays that detect single nucleotide polymorphisms (SNPs). For a review of SNP detection methods, see., e.g., Shi (2001) Clinical Chem. 47:164-172. In certain embodiments, one or more nucleic acid binding polypeptides are used in assays that detect rare copies of a target polynucleotide in a complex mixture of nucleic acids. For example, in certain such embodiments, the target polynucleotide comprises genetic material from a pathogen contained within a biological sample from a host.
  • b) Increase Tm of Primers in Primer Extension Reactions
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction. In certain primer extension reactions, such as PCR, one or more primers are annealed to a template nucleic acid. In PCR, e.g., the annealing typically takes place over 30 seconds at about 55° C., a temperature that is less than the predicted Tm of a typical primer of about 20-30 nucleotides. Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor Laboratory Press, NY) at 8.22.
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction, thereby increasing the temperature at which the annealing may be carried out. In certain such embodiments, the annealing is carried out in the presence of one or more nucleic acid binding polypeptides. In certain such embodiments, the annealing is carried out at any temperature from about 55° C. up to about 75° C. In certain such embodiments, the annealing is carried out at any temperature between 60° C. and 70° C. In certain embodiments, increased annealing temperature reduces certain primer artifacts, such as primer dimmers and hairpin formation.
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to increase the Tm of a primer in a primer extension reaction, thereby allowing the use of shorter primers. In certain such embodiments, the annealing is carried out in the presence of one or more nucleic acid binding polypeptides. In certain such embodiments, a primer is of any length between 12 and 19 nucleotides. In certain such embodiments, a primer is of any length between 12 and 16 nucleotides.
  • 3. Enhance Activity of Nucleic Acid Modification Enzymes
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to enhance the activity of a nucleic acid modification enzyme. In certain such embodiments, one or more nucleic acid binding polypeptides are included in a composition comprising a nucleic acid modification enzyme and a nucleic acid, thus enhancing the activity of the nucleic acid modification enzyme. In various embodiments, the enhancement in the activity of a nucleic acid modification enzyme is demonstrated by comparing the activity of the nucleic acid modification enzyme in the presence of one or more nucleic acid binding polypeptides with its activity in the absence of one or more nucleic acid binding polypeptides. In certain embodiments, the following assays may be used to evaluate the activity of a nucleic acid modification enzyme:
  • In certain embodiments, the activity of a gyrase or topoisomerase is assessed by determining the change in the supercoiled state of a nucleic acid exposed to the gyrase or topoisomerase in the presence and in the absence of one or more nucleic acid binding polypeptides.
  • In certain embodiments, the activity of a nuclease is assessed by determining the amount of cleavage product produced by the nuclease in the presence and in the absence of one or more nucleic acid binding polypeptides. In certain such embodiments, the activity of a restriction endonuclease is assessed by exposing a nucleic acid to a restriction endonuclease in the presence and in the absence of one or more nucleic acid binding polypeptides. In certain such embodiments, the extent of digestion by the restriction endonuclease is determined by gel electrophoresis.
  • In certain embodiments, the activity of a methylase is determined by assessing the methylation state of a nucleic acid exposed to a methylase in the presence and in the absence of one or more nucleic acid binding polypeptides. In certain such embodiments, the methylation state of the nucleic acid is assessed, for example, by determining the extent to which the nucleic acid is cleaved by a methylation sensitive restriction endonuclease, such as MboI.
  • In certain embodiments, the activity of a ligase is assessed by determining the amount of ligation product produced by the ligase in the presence and in the absence of one or more nucleic acid binding polypeptides. In certain such embodiments, a circularized plasmid is linearized by a restriction endonuclease, isolated from the restriction endonuclease, and exposed to ligase in the presence and in the absence of one or more nucleic acid binding polypeptides. In certain such embodiments, the ligation reaction mixture is used to transform competent bacteria. In certain such embodiments, the number of transformants is proportional to the activity of the ligase.
  • In certain embodiments, the activity of a polymerase is assessed in the presence and in the absence of one or more nucleic acid binding polypeptides using the polymerase activity assay described above in Part IV.A.
  • 4. Increase Processivity of a DNA Polymerase
  • In certain embodiments, one or more nucleic acid binding polypeptides are used to improve the performance of DNA polymerase. In certain such embodiments, improved performance of DNA polymerase is increased processivity of the DNA polymerase in a primer extension reaction. In certain embodiments, the primer extension reaction is PCR. For example, in certain embodiments, the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction allows for more efficient amplification of targets under suboptimal conditions, such as high salt concentrations. Examples of certain high salt concentrations include from 60 mM KCl to 130 mM KCl for Taq DNA polymerase, and from 40 mM KCl to 130 mM KCl for Pfu polymerase. In certain embodiments, the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction decreases the time of the extension step of PCR to, for example, ≦5 minutes, ≦3 minutes, ≦2 minutes, ≦1 minute, or ≦30 seconds. In certain embodiments, the inclusion of one or more nucleic acid binding polypeptides in a PCR reaction allows for more efficient amplification of long targets, for example, targets from about 5 kb to about 20 kb.
  • E. Certain Methods Using Fusion Proteins
  • In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any method that uses a nucleic acid binding polypeptide (as described, for example, in Part IV.D. above), except that the fusion protein replaces the nucleic acid binding polypeptide in the method. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any method that uses a nucleic acid binding polypeptide (as described, for example, in Part IV.D. above), except that the fusion protein is used in combination with the nucleic acid binding polypeptide in the method.
  • In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used in any reaction in which the nucleic acid modification enzyme alone can be used. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme is used to improve the efficiency of any reaction in which the nucleic acid modification enzyme alone can be used. In certain such embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme has increased activity relative to the nucleic acid modification enzyme alone. In certain such embodiments, the assays set forth in Part IV.D.3 above may be used to evaluate the activity of a nucleic acid modification enzyme or a fusion protein comprising a nucleic acid binding polypeptide and a nucleic acid modification enzyme. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase has increased processivity relative to the DNA polymerase alone.
  • In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase is used in a primer extension reaction. In certain such embodiments, the fusion protein increases the efficiency of the primer extension reaction. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a DNA polymerase is included in a primer extension reaction to increase the Tm of one or more primers in the reaction. In certain embodiments, the temperature at which annealing is carried out may be increased. In certain embodiments, shorter primers may be used.
  • In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction. In certain such embodiments, the fusion protein increases the efficiency of PCR. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction that is conducted under suboptimal conditions, such as high salt concentrations. Exemplary high salt concentrations are described above in Part IV.D.4. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to decrease the time of the extension step of PCR. Exemplary extension times are provided above in Part IV.D.4. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to more efficiently amplify long targets. Exemplary target lengths are provided above in Part IV.D.4. In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is included in a PCR reaction to increase the amount of PCR amplification product.
  • In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is used in “hot start” PCR. In certain embodiments, “hot start” PCR is used to suppress non-specific binding of primer to template. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual (3rd ed., Cold Spring Harbor Laboratory Press, NY) at 8.110 (describing “hot start” PCR). In certain embodiments of “hot start” PCR, one or more components to be used in a PCR are prevented from functioning in the PCR until the reaction mixture reaches or exceeds a temperature at which non-specific priming does not occur. Id. For example, in certain embodiments of “hot start” PCR, an antibody to the thermostable DNA polymerase is used to reversibly block polymerase activity until a suitable temperature is reached. See, e.g., Kellogg et al. (1994) Biotechnigues 16:1134-1137 (describing the use of antibodies to Tag DNA polymerase). In certain embodiments, a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase is used in “hot start” PCR. In certain such embodiments, an antibody to the nucleic acid binding polypeptide is used to reversibly block nucleic acid binding activity and/or polymerase activity until a suitable temperature is reached.
  • F. Certain Kits
  • In various embodiments, kits are provided. In certain embodiments, a kit comprises any one or more of the nucleic acid binding polypeptides described above. In certain embodiments, a kit further comprises a nucleic acid modification enzyme. In certain such embodiments the nucleic acid modification enzyme is a DNA polymerase. In certain such embodiments, the DNA polymerase is a thermostable DNA polymerase. In certain embodiments, a kit further comprises deoxynucleotides. In certain embodiments, a kit further comprises dideoxynucleotides.
  • In certain embodiments, a kit comprises any one or more of the fusion proteins described above. In certain such embodiments, the fusion protein comprises a nucleic acid binding polypeptide and a DNA polymerase. In certain such embodiments, the DNA polymerase is a thermostable DNA polymerase. In certain embodiments, a kit further comprises deoxynucleotides. In certain embodiments, a kit further comprises dideoxynucleotides.
  • V. EXAMPLES
  • A. Cloning and Expression of Polynucleotides Encoding Nucleic Acid Binding Polypeptides
  • A polynucleotide encoding SEQ ID NO:1 was constructed by ligating the following oligonucleotides (SEQ ID NOs:8-10) end-to-end, such that the 5′ end of SEQ ID NO:9 was ligated to the 3′ end of SEQ ID NO:8, and the 5′ end of SEQ ID NO:10 was ligated to the 3′ end of SEQ ID NO:9.
  • SEQ ID NO: 8
    5′ atgtccaaga agcagaaact Gaagttctac gacatTaagg
    cgaagcaggc gtttgag 3′
    SEQ ID NO: 9
    5′ acCgaccagt acgaggttat tgagaagcag acCgcccgcg
    gtccgatgat gttcgcc 3′
    SEQ ID NO: 10
    5′ gtggccaaat cgccgtacac cggcatTaaa gtGtacCgCc
    tgttaggcaa gaagaaataa 3′

    The capital letters in SEQ ID NOs:8-10 represent changes from the naturally occurring PAE3192 sequence (SEQ ID NO:2). Those changes were made to generate codons more favorable for the expression of SEQ ID NO:1 in E. coli. Those changes do not result in any alterations in the amino acid sequence of SEQ ID NO:1.
  • To ligate SEQ ID NOs:8-10 together, the following oligonucleotides (SEQ ID NOS:11-12) were first annealed to SEQ 10 NOs:8-10 as discussed below.
  • SEQ ID NO: 11
    5′ gtactggtcg gtctcaaacg cctg 3′
    SEQ ID NO: 12
    5′ cgatttggcc acggcgaaca tcat 3′

    SEQ ID NO:11 is complementary to the 3′ end of SEQ ID NO:8 and the 5′ end of SEQ ID NO:9. Thus, the annealing of SEQ ID NO:11 to SEQ ID NOs:8-9 created a region of double stranded DNA where SEQ ID NO:11 spans the junction of SEQ ID NOS:8-9. This region of double stranded DNA was a suitable substrate for DNA ligase. Likewise, SEQ ID NO:12 is complementary to the 3′ end of SEQ ID NO:9 and the 5′ end of SEQ ID NO:10. Thus, the annealing of SEQ ID NO:12 to SEQ ID NOS:9-10 created a region of double stranded DNA where SEQ ID NO:12 spans the junction of SEQ ID NOS:9-10. SEQ 10 NOs:8-10 were then ligated. The resulting polynucleotide (SEQ ID NO:13) was amplified by PCR.
  • A polynucleotide encoding SEQ ID NO:6 was constructed by ligating the following oligonucleotides (SEQ ID NOs:14-16) end-to-end:
  • SEQ ID NO: 14
    5′ atgccGaaga aggagaagat Taagttcttc gacctGgtcg
    ccaagaagta ctacgag 3′
    SEQ ID NO: 15
    5′ actgacaact acgaagtcga gatTaaggag actaagCgCg
    gcaagtttCg Cttcgcc 3′
    SEQ ID NO: 16
    5′ aaagccaaga gcccgtacac cggcaagatc ttctatCgCg
    tgctGggcaa agcctag 3′

    The capital letters represent changes from the naturally occurring APE3192 sequence (SEQ ID NO:7). Those changes were made to generate codons more favorable for the expression of SEQ ID NO:6 in E. coli. Those changes do not result in any alterations in the amino acid sequence of SEQ ID NO:6.
  • The following oligonucleotides (SEQ ID NOs:17-18) were annealed to SEQ ID NOs:14-16 to create regions of double stranded DNA spanning the junctions between SEQ ID NOs:14-15 and SEQ ID NOs:15-16.
  • SEQ ID NO: 17
    5′ gtagttgtca gtctcgtagt actt 3′
    SEQ ID NO: 18
    5′ gctcttggct ttggcgaagc gaaa 3′

    SEQ ID NOs:14-16 were then ligated. The resulting polynucleotide (SEQ ID NO:19) was amplified by PCR.
  • SEQ ID NO:13 was cloned into the pET16b vector (Novagen, Milwaukee, Wis.) using standard recombinant methods. That vector allows expression of the cloned sequences from the inducible T7 promoter. It also includes sequences encoding polyhistidine (10× His) (SEQ ID NO: 48) followed by a Factor Xa cleavage site upstream of the cloning site. Thus, the encoded proteins are tagged at their N-termini with a polyhistidine moiety. Recombinant vector comprising SEQ ID NO:13 was transformed into competent E. coli host cells using standard methods.
  • SEQ ID NO:19 was also cloned into the pET16b vector using standard recombinant methods. Recombinant vector comprising SEQ ID NO:19 was transformed into competent E. coli host cells using standard methods.
  • Host cells containing a recombinant vector comprising SEQ ID NO:13 are induced to express a tagged polypeptide comprising SEQ ID NO:1 by adding IPTG to the media in which the host cells are grown. The tagged polypeptide is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the isolated polypeptide by treatment with Factor Xa.
  • Host cells containing a recombinant vector comprising SEQ ID NO:19 are induced to express a tagged polypeptide comprising SEQ ID NO:6 by adding IPTG to the media in which the host cells are grown. The tagged polypeptide is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the isolated polypeptide by treatment with Factor Xa.
  • B. Use of Nucleic Acid Binding Polypeptides to Stabilize a DNA Duplex from Thermal Denaturation
  • The ability of a nucleic acid binding polypeptide to stabilize a DNA duplex from thermal denaturation is demonstrated by the following assay, which measures the increase in the Tm of a nucleic acid in the presence of a nucleic acid binding polypeptide. See, e.g., Baumann et al. (1994) Nature Struct. Biol. 1:808-819; and McAfee et al. (1995) Biochem. 34:10063-10077. Poly(dI-dC) at a concentration of about 70 μM (in nucleotides) is combined with a nucleic acid binding polypeptide at a concentration of about 350 μM in 5 mM Tris.Cl (pH 7.0). Poly(dI-dC) at a concentration of about 70 μM (in nucleotides) in 5 mM Tris.Cl (pH 7.0) without a nucleic acid binding polypeptide is used as a negative control. The absorbance of the poly(dI-dC) with and without a nucleic acid binding polypeptide is measured at 260 nm as a function of temperature using a spectrophotometer. The temperature is increased in steps, and absorbance is measured at each step. For each step, the temperature is raised by 1° C. over 30 seconds, followed by a holding time of 60 seconds prior to the measuring of absorbance. A melting curve is generated based on the increase in absorbance as a function of temperature. The Tm (temperature at which 50% of the poly(dI-dC) is denatured) occurs at the inflection point of the melting curve. The Tm of poly(dI-dC) in the negative control is subtracted from the Tm of poly(dI-dC) in the presence of a nucleic acid binding polypeptide to determine the increase in Tm due to the presence of the nucleic acid binding polypeptide.
  • C. Use of Nucleic Acid Binding Polypeptides to Increase Processivity of a DNA Polymerase
  • The ability of a nucleic acid binding polypeptide to increase the processivity of a DNA polymerase is assessed using a processivity assay based on that of Wang et al. (2004) Nuc. Acids Res. 32:1197-1207. A 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a reaction composition comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl2, 250 μm dNTPs, and 0.1% Triton X-100. The concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively. The primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 90° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second. A thermostable DNA polymerase, such as Taq DNA polymerase, is added to the above reaction composition at a concentration of about 1:4000 (DNA polymerase:template).
  • Two parallel reactions are prepared. In one of the parallel reactions, a nucleic acid binding polypeptide is added to a final concentration of about 70 μg/ml in 20 μl of the above reaction composition. The second parallel reaction contains 20 μl of the above reaction composition with no added nucleic acid binding polypeptide.
  • DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products in the samples are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA Sequencer. The median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration (to ensure that the template is in excess), that length is used as a measure of processivity.
  • D. Use of Nucleic Acid Binding Polypeptides to Increase the Efficiency (Speed and Specificity) of a Hybridization-Based Detection Assay
  • 1. Annealing Assay
  • The ability of a nucleic acid binding polypeptide to increase the specificity of a hybridization-based detection assay is measured using an annealing assay based on that of Guagliardi at al. (1997) J. Mol. Biol. 267:841-848. A first set of two reaction compositions is prepared as follows: In a first reaction composition, a single stranded M13mp18 circular DNA (0.05 μmol) is combined with an equal amount of 32P end-labeled oligonucleotide of sequence 5′-gtaaaacgacggccagt-3′ (SEQ ID NO:20) in a buffered reaction mixture (20 mM Tris-HCl pH 7.5, 2 mM DTT, 5 mM MgCl2, 100 μg/ml BSA). In a second reaction composition, single stranded M13mp18 circular DNA (0.05 pmol) is combined with an equal amount of 32P end-labeled oligonucleotide of sequence 5′-gtaaaacgtcggccagt-3′ (SEQ ID NO:21) in a buffered reaction mixture (20 mM Tris-HCl pH 7.5, 2 mM DTT, 5 mM MgCl2, 100 μg/ml BSA). The nucleotide indicated in bold is a mismatch with respect to the M13mp18 DNA sequence. A nucleic acid binding polypeptide is added separately to both reaction compositions at a final concentration of about 5 μg/ml.
  • A second set of two reaction compositions is prepared. The second set is the same as the first set of reaction compositions, except that a nucleic acid binding polypeptide is not added to either the first or second reaction composition of the second set of reaction compositions. The final volume of each reaction composition is 10 μl.
  • The reaction compositions are incubated at 60° C. for three minutes. The reactions are stopped by adding 1% SDS in standard loading dye to each reaction composition. The reactions are analyzed by 1.5% agarose gel electrophoresis followed by autoradiography to visualize annealed product, which can be distinguished from unannealed probe by its slower mobility. Annealed product is quantified for each reaction using standard densitometric methods. An increase in the amount of annealed product in the first reaction compared to the second reaction is determined for both sets of reactions. The ability of a nucleic acid binding polypeptide to increase the specificity of hybridization is demonstrated by a larger increase in the amount of annealed product for the first set of reactions compared to the second set of reactions.
  • 2. Microarray-Based Assay
  • The ability of a nucleic acid binding polypeptide to increase the speed and specificity of a hybridization-based detection assay is also demonstrated by a decrease in the hybridization time (approximately 16 hours) required to perform a typical microarray-based detection assay. A typical microarray-based detection assay may be performed, for example, using the Mouse Genome Survey Microarray system (Applied Biosystems, Foster City, Calif.; P/N 4345065). That system includes reagents, hybridization controls, and reference nucleic acids that can be used to detect selective hybridization of a reference nucleic acid to a probe (i.e., a portion of a mouse cDNA) immobilized on a microarray. In an exemplary assay, a nucleic acid binding polypeptide is added to the hybridization solution at a concentration of about 50 to 250 ug/m L. The hybridization time is from about 1 to 30 minutes at a temperature of about 45° C. to 75° C. The arrays are washed, and hybridization is detected using the Chemiluminescence Detection Kit (Applied Biosystems, Foster City, Calif., P/N 4342142) according to the manufacturer's instructions. The arrays are analyzed using the Applied Biosystems 1700 Chemiluminescent Microarray Analyzer (Applied Biosystems, Foster City, Calif., P/N 4338036).
  • E. Construction and Expression of Fusion Proteins Comprising a Nucleic Acid Binding Polypeptide and a Thermostable DNA Polymerase
  • 1. Fusion Proteins Comprising Pfu DNA Polymerase
  • A fusion protein comprising SEQ ID NO:1 joined to the C-terminus of full length Pfu DNA polymerase was constructed as follows. An NdeI-XhoI restriction fragment comprising the coding sequence of full length Pfu DNA polymerase in frame with SEQ ID NO:13 was cloned into the NdeI and XhoI sites of the pET16b vector (Novagen, Milwaukee, Wis.) using standard recombinant methods. The resulting recombinant vector (pDS2r) encodes a fusion protein comprising SEQ ID NO:1 joined to the C-terminus of Pfu DNA polymerase by a Gly-Thr-Gly-Gly-Gly-Gly (SEQ ID NO: 49) peptide linker. A 10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:22. The amino acid sequence of the fusion protein is shown in SEQ ID NO:23.
  • The recombinant vector pDS2r was transformed into competent E. coli host cells. Host cells comprising pDS2r were induced to express the fusion protein of SEQ ID NO:23 by adding IPTG to the media in which the host cells were grown. The fusion protein was isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:23 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:24.
  • A fusion protein comprising SEQ ID NO:6 joined to the C-terminus of full length Pfu DNA polymerase was constructed as follows: An NdeI-XhoI restriction fragment comprising the coding sequence of full length Pfu DNA polymerase in frame with SEQ ID NO:19 was cloned into the NdeI and XhoI sites of the pET16b vector using standard recombinant methods. The resulting recombinant vector (pDS1r) encodes a fusion protein comprising SEQ ID NO:6 joined to the C-terminus of Pfu DNA polymerase by a Gly-Thr-Gly-Gly-Gly-Gly (SEQ ID NO: 49) peptide linker. A 10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:25. The amino acid sequence of the fusion protein is shown in SEQ ID NO:26.
  • The recombinant vector pDS1 r was transformed into competent E. coli host cells. Host cells comprising pDS1 r were induced to express the fusion protein of SEQ ID NO:26 by adding IPTG to the media in which the host cells were grown. The fusion protein was isolated from the host cells by affinity chromatography using nickel-NTA resin.
  • In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:26 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:27.
  • 2. Fusion Proteins Comprising Taq DNA Polymerase
  • A fusion protein comprising SEQ ID NO:1 joined to the N-terminus of Taq DNA polymerase (SEQ ID NO:32 lacking the first two amino acid residues) was constructed as follows. SEQ ID NO:13 was cloned in frame at the 5′ end of a polynucleotide encoding Taq DNA polymerase in the pET16b vector. The resulting recombinant vector (pDS17-7) encodes a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of Taq DNA polymerase by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker. A 10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:33. The amino acid sequence of the fusion protein is shown in SEQ ID NO:34. The recombinant vector pDS17-7 was transformed into competent host cells.
  • Expression of the fusion protein is induced in the host cells using IPTG. The fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:34 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:35.
  • A fusion protein comprising SEQ ID NO:6 joined to the N-terminus of Taq DNA polymerase (SEQ ID NO:32 lacking the first two amino acid residues) was constructed as follows. SEQ ID NO:19 was cloned in frame at the 5′ end of a polynucleotide encoding Taq DNA polymerase in the pET16b vector. The resulting recombinant vector (pDS16-3) encodes a fusion protein comprising SEQ ID NO:6 joined to Taq DNA polymerase by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker. A10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:36. The amino acid sequence of the fusion protein is shown in SEQ ID NO:37. The recombinant vector pDS16-3 was transformed into competent host cells.
  • Expression of the fusion protein is induced in the host cells using IPTG. The fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:37 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:38.
  • A fusion protein comprising SEQ ID NO:1 joined to the N-terminus of a Stoffel fragment of Taq DNA polymerase (amino acid residues 291-832 of SEQ ID NO:32) was constructed as follows. SEQ ID NO:13 was cloned in frame at the 5′ end of a polynucleotide encoding the Stoffel fragment in the pET16b vector. The resulting recombinant vector (pDS25-7) encodes a fusion protein comprising SEQ ID NO:1 joined to the N-terminus of the Stoffel fragment by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker. A 10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:39. The amino acid sequence of the fusion protein is shown in SEQ ID NO:40. The recombinant vector pDS25-7 was transformed into competent host cells.
  • Expression of the fusion protein is induced in the host cells using IPTG. The fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:40 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:41.
  • A fusion protein comprising SEQ ID NO:6 joined to the N-terminus of a Stoffel fragment of Taq DNA polymerase (amino acid residues 291-832 of SEQ ID NO:32) was constructed as follows. SEQ ID NO:19 was cloned in frame at the 5′ end of a polynucleotide encoding the Stoffel fragment in the pET16b vector. The resulting recombinant vector (pDS24-4) encodes a fusion protein comprising SEQ ID NO:6 joined to the N-terminus of the Stoffel fragment by a Gly-Gly-Val-Thr-Ser (SEQ ID NO: 50) peptide linker. A 10× His affinity tag (SEQ ID NO: 48) is present at the N-terminus of the fusion protein. The polynucleotide sequence encoding the fusion protein is shown in SEQ ID NO:42. The amino acid sequence of the fusion protein is shown in SEQ ID NO:43. The recombinant vector pDS24-4 was transformed into competent host cells.
  • Expression of the fusion protein is induced in the host cells using IPTG. The fusion protein is isolated from the host cells by affinity chromatography using nickel-NTA resin. In certain embodiments, the polyhistidine tag is removed from the fusion protein of SEQ ID NO:43 by treatment with Factor Xa to yield the fusion protein shown in SEQ ID NO:44.
  • F. Use of Fusion Proteins to Increase Processivity of Taq DNA Polymerase
  • The increase in processivity of a fusion protein comprising Taq DNA polymerase relative to Taq DNA polymerase alone is assessed using a processivity assay based on that of Wang et al. (2004) Nuc. Acids Res. 32:1197-1207. A 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a mixture comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl2, 250 μm dNTPs, and 0.1% Triton X-100. The concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively. The primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 96° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second.
  • A reaction composition is prepared in which a fusion protein comprising Taq DNA polymerase is added at a molar concentration of about 1:4000 (fusion protein:template) to 20 μl of the above mixture. A control reaction composition is prepared in which Taq DNA polymerase is added at a molar concentration of about 1:4000 (DNA polymerase:template) to 20 μl of the above mixture. DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA Sequencer. The median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration, that length is used as a measure of processivity.
  • G. Use of Fusion Proteins to Increase Processivity of Pfu DNA Polymerase
  • The increase in processivity of a fusion protein comprising Pfu DNA polymerase relative to Pfu DNA polymerase alone is assessed using a processivity assay based on that of Wang et al. (2004) Nuc. Acids Res. 32:1197-1207. A 5′ FAM-labeled primer of sequence 5′ gttttcccagtcacgacgttgtaaaacgacggcc 3′ (SEQ ID NO:29) is added to single stranded M13mp18 DNA in a mixture comprising 10 mM Tris-HCl pH 8.8, 50 mM KCl, 2.5 mM MgCl2, 250 μm dNTPs, and 0.1% Triton X-100. The concentrations of the primer and M13mp18 template are 50 nM and 80 nM, respectively. The primer is annealed to the single stranded M13mp18 DNA template by heating the mixture to 90° C. for 5 minutes, cooling to 72° C. at 0.1° C. per second, incubating at 72° C. for 10 minutes, and cooling to 4° C. at 0.1° C. per second.
  • A reaction composition is prepared in which a fusion protein comprising Pfu DNA polymerase is added at a molar concentration of about 1:4000 (fusion protein:template) to 20 μl of the above mixture. A control reaction composition is prepared in which Pfu DNA polymerase is added at a molar concentration of about 1:4000 (DNA polymerase:template) to 20 μl of the above mixture. DNA synthesis is initiated in the reaction compositions by incubating them at 72° C. Samples from each reaction are taken at various time points. The samples are diluted in gel loading dye, and the primer extension products are analyzed by denaturing polyacrylamide gel electrophoresis using an ABI 377 DNA sequencer. The median product length is determined based on the integration of all detectable primer extension products. When the median product length does not change with an increase in reaction time or a decrease in polymerase concentration, that length is used as a measure of processivity.
  • One skilled in the art will readily recognize that the above assay may be modified so as to assess the processivity of a fusion protein comprising a DNA polymerase other than Taq or Pfu.
  • H. Use of Fusion Proteins in PCR
  • The ability of a fusion protein comprising a nucleic acid binding polypeptide and a thermostable DNA polymerase (e.g., Taq or Pfu) to increase the efficiency of PCR is demonstrated using a typical PCR reaction. An exemplary PCR reaction is prepared which contains PCR buffer (1×), dNTPs (200 μM each), template DNA (250 ng), forward and reverse primers (0.25 μM each) and fusion protein (about 0.5 to 2.5 units) in a final volume of 50 μl. As a control reaction, thermostable DNA polymerase alone is used in place of the fusion protein. The primers used in the PCR reaction are tPAF7 (5′-ggaagtacagctcagagttctgcagcacccctgc-3′ (SEQ ID NO:45)) and tPAR10 (5′-gatgcgaaactgaggctggctgtactgtctc-3′ (SEQ ID NO:46)). The template DNA is human genomic DNA (Roche, Indianapolis, Ind., P/N 1-691-112). The primers tPAF7 and tPAR10 amplify a product of approximately 5 kb from human genomic DNA. If the fusion protein being used in the PCR reaction comprises Pfu DNA polymerase, then the standard PCR buffer for Pfu (Stratagene; La. Jolla, Calif.) is used, except that the KCl concentration is elevated. The final working concentration (1×) of the buffer thus contains 20 mM Tris, pH 8:8; 10 mM (NH4)2SO4, 0.1% Triton X-100, 2 mM MgSO4, 100 μg/mL BSA and 60 mM KCl. If the fusion protein being used in the PCR reaction comprises Taq DNA polymerase, the standard PCR buffer for Taq (Applied Biosystems, Foster City, Calif.) is used. Cycling is performed as follows:
  • initial denaturation (98° C., 30 sec);
    denaturation (98° C., 10 sec);
    annealing (65° C., 10 sec); and {close oversize bracket} 29 cycles
    extension (72° C., 2 min);
    and final extension (72° C., 10 min).
  • An aliquot of the reaction is analyzed by agarose gel electrophoresis along with an appropriate size standard, stained with ethidium bromide, and then visualized by fluorescence.
  • TABLE OF SEQUENCES
    SEQ
    ID
    NO: Brief Description Sequence
    1 Pae3192 MSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFAVAKSPYTGIKVYRLLGKKK
    (protein)
    2 PAE3192(ORF) atgtccaaga agcagaaact aaagttctac gacataaagg cgaagcaggc
    gtttgagact gaccagtacg aggttattga gaagcagact gcccgcggtc
    cgatgatgtt cgccgtggcc aaatcgccgt acaccggcat aaaagtatac
    agactgttag gcaagaagaa ataa
    3 PAE3289 (ORF) atgtccaaga agcagaaact aaagttctac gacataaagg cgaagcaggc
    gtttgagact gaccagtacg aggttattga gaagcagact gcccgcggtc
    cgatgatgtt cgccgtggcc aaatcgccgt acaccggcat aaaagtatac
    agactattag gcaagaagaa ataa
    4 Pae0384 MAKQKLKFYDIKAKQSFETDKYEVIEKETARGPMLFAVATSPYTGIKVYRLLGKKK
    (protein)
    5 PAE0384 (ORF) atggccaaac aaaaactaaa gttctacgac ataaaagcga aacagtcctt
    cgaaacggac aaatacgagg tcattgagaa agagacggcc cgcgggccga
    tgttatttgc agtggcaacc tcgccgtaca ctggcataaa ggtgtacaga
    ctgttaggca agaagaaata a
    6 Ape3192 MPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFAKAKSPYTGKIFYRVLGKA
    7 APE3192 (ORF) atgcccaaga aggagaagat aaagttcttc gacctagtcg ccaagaagta
    ctacgagact gacaactacg aagtcgagat aaaggagact aagaggggca
    agtttaggtt cgccaaagcc aagagcccgt acaccggcaa gatcttctat
    agagtgctag gcaaagccta g
    8 p3192-a atgtccaaga agcagaaact gaagttctac gacattaagg cgaagcaggc gtttgag
    9 p3192-b accgaccagt acgaggttat tgagaagcag accgcccgcg gtccgatgat gttcgcc
    10 p3192-c gtggccaaat cgccgtacac cggcattaaa gtgtaccgcc tgttaggcaa
    gaagaaataa
    11 p3192-y gtactggtcg gtctcaaacg cctg
    12 p3192-z cgatttggcc acggcgaaca tcat
    13 8, 9, and 10 atgtccaaga agcagaaact gaagttctac gacattaagg cgaagcaggc
    assembled gtttgagacc gaccagtacg aggttattga gaagcagacc gcccgcggtc
    cgatgatgtt cgccgtggcc aaatcgccgt acaccggcat taaagtgtac
    cgcctgttag gcaagaagaa ataa
    14 ap3192-a atgccgaaga aggagaagat taagttcttc gacctggtcg ccaagaagta ctacgag
    15 ap3192-b actgacaact acgaagtcga gattaaggag actaagcgcg gcaagtttcg cttcgcc
    16 ap3192-c aaagccaaga gcccgtacac cggcaagatc ttctatcgcg tgctgggcaa agcctag
    17 ap3192-y gtagttgtca gtctcgtagt actt
    18 ap3192-z gctcttggct ttggcgaagc gaaa
    19 14, 15, and 16 atgccgaaga aggagaagat taagttcttc gacctggtcg ccaagaagta
    assembled ctacgagact gacaactacg aagtcgagat taaggagact aagcgcggca
    agtttcgctt cgccaaagcc aagagcccgt acaccggcaa gatcttctat
    cgcgtgctgg gcaaagccta g
    20 oligo for gtaaaacgac ggccagt
    annealing assay
    21 mismatch oligo gtaaaacgtc ggccagt
    for annealing
    assay
    22 polynucleotide ccatgggccatcatcatcatcatcatcatcatcatcacagcagcggccatatcgaaggtc
    encoding gtcatatgattttagatgtggattacataactgaagaaggaaaacctgttattaggctat
    Pfu: Pae3192 tcaaaaaagagaacggaaaatttaagatagagcatgatagaacttttagaccatacattt
    fusion protein acgctcttctcagggatgattcaaagattgaagaagttaagaaaataacgggggaaaggc
    (including poly- atggaaagattgtgagaattgttgatgtagagaaggttgagaaaaagtttctcggcaagc
    His tag) ctattaccgtgtggaaactttatttggaacatccccaagatgttcccactattagagaaa
    aagttagagaacatccagcagttgtggacatcttcgaatacgatattccatttgcaaaga
    gatacctcatcgacaaaggcctaataccaatggagggggaagaagagctaaagattcttg
    ccttcgatatagaaaccctctatcacgaaggagaagagtttggaaaaggcccaattataa
    tgattagttatgcagatgaaaatgaagcaaaggtgattacttggaaaaacatagatcttc
    catacgttgaggttgtatcaagcgagagagagatgataaagagatttctcaggattatca
    gggagaaggatcctgacattatagttacttataatggagactcattcgacttcccatatt
    tagcgaaaagggcagaaaaacttgggattaaattaaccattggaagagatggaagcgagc
    ccaagatgcagagaataggcgatatgacggctgtagaagtcaagggaagaatacatttcg
    acttgtatcatgtaataacaaggacaataaatctcccaacatacacactagaggctgtat
    atgaagcaatttttggaaagccaaaggagaaggtatacgccgacgagatagcaaaagcct
    gggaaagtggagagaaccttgagagagttgccaaatactcgatggaagatgcaaaggcaa
    cttatgaactcgggaaagaattccttccaatggaaattcagctttcaagattagttggac
    aacctttatgggatgtttcaaggtcaagcacagggaaccttgtagagtggttcttactta
    ggaaagcctacgaaagaaacgaagtagctccaaacaagccaagtgaagaggagtatcaaa
    gaaggctcagggagagctacacaggtggattcgttaaagagccagaaaaggggttgtggg
    aaaacatagtatacctagattttagagccctatatccctcgattataattacccacaatg
    tttctcccgatactctaaatcttgagggatgcaagaactatgatatcgctcctcaagtag
    gccacaagttctgcaaggacatccctggttttataccaagtctcttgggacatttgttag
    aggaaagacaaaagattaagacaaaaatgaaggaaactcaagatcctatagaaaaaatac
    tccttgactatagacaaaaagcgataaaactcttagcaaattctttctacggatattatg
    gctatgcaaaagcaagatggtactgtaaggagtgtgctgagagcgttactgcctggggaa
    gaaagtacatcgagttagtatggaaggagctcgaagaaaagtttggatttaaagtcctct
    acattgacactgatggtctctatgcaactatcccaggaggagaaagtgaggaaataaaga
    aaaaggctctagaatttgtaaaatacataaattcaaagctccctggactgctagagcttg
    aatatgaagggttttataagaggggattcttcgttacgaagaagaggtatgcagtaatag
    atgaagaaggaaaagtcattactcgtggtttagagatagttaggagagattggagtgaaa
    ttgcaaaagaaactcaagctagagttttggagacaatactaaaacacggagatgttgaag
    aagctgtgagaatagtaaaagaagtaatacaaaagcttgccaattatgaaattccaccag
    agaagctcgcaatatatgagcagataacaagaccattacatgagtataaggcgataggtc
    ctcacgtagctgttgcaaagaaactagctgctaaaggagttaaaataaagccaggaatgg
    taattggatacatagtacttagaggcgatggtccaattagcaatagggcaattctagctg
    aggaatacgatcccaaaaagcacaagtatgacgcagaatattacattgagaaccaggttc
    ttccagcggtacttaggatattggagggatttggatacagaaaggaagacctcagatacc
    aaaagacaagacaagtcggcctaacttcctggcttaacattaaaaaatccggtaccggcg
    gtggcggtatgtccaagaagcagaaactgaagttctacgacattaaggcgaagcaggcgt
    ttgagaccgaccagtacgaggttattgagaagcagaccgcccgcggtccgatgatgttcg
    ccgtggccaaatcgccgtacaccggcattaaagtgtaccgcctgttaggcaagaagaaat
    aactcgag
    23 amino acid MGHHHHHHHHHHSSGHIEGRHMILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIY
    sequence of ALLRDDSKIEEVKKITGERHGKIVRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREK
    Pfu: Ape3192 VREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIM
    fusion protein ISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYL
    (including poly- AKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVY
    His tag) EAIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQ
    PLWDVSRSSTGNLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWE
    NIVYLDFRALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLE
    ERQKIKTKMKETQDPIEKILLDYRQKAIKLLANSFYGYYGYAKARWYCKECAESVTAWGR
    KYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELE
    YEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE
    AVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMV
    IGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQ
    KTRQVGLTSWLNIKKSGTGGGGMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    VAKSPYTGIKVYRLLGKKK
    24 amino acid HMILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIY
    sequence of ALLRDDSKIEEVKKITGERHGKIVRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREK
    Pfu: Pae3192 VREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIM
    fusion protein ISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYL
    (after removal of AKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVY
    poly-His tag) EAIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQ
    PLWDVSRSSTGNLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWE
    NIVYLDFRALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLE
    ERQKIKTKMKETQDPIEKILLDYRQKAIKLLANSFYGYYGYAKARWYCKECAESVTAWGR
    KYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELE
    YEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE
    AVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMV
    IGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQ
    KTRQVGLTSWLNIKKSGTGGGGMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    VAKSPYTGIKVYRLLGKKK
    25 polynucleotide ccatgggccatcatcatcatcatcatcatcatcatcacagcagcggccatatcgaaggtc
    encoding gtcatatgattttagatgtggattacataactgaagaaggaaaacctgttattaggctat
    Pfu: Ape3192 tcaaaaaagagaacggaaaatttaagatagagcatgatagaacttttagaccatacattt
    fusion protein acgctcttctcagggatgattcaaagattgaagaagttaagaaaataacgggggaaaggc
    (including poly- atggaaagattgtgagaattgttgatgtagagaaggttgagaaaaagtttctcggcaagc
    His tag) ctattaccgtgtggaaactttatttggaacatccccaagatgttcccactattagagaaa
    aagttagagaacatccagcagttgtggacatcttcgaatacgatattccatttgcaaaga
    gatacctcatcgacaaaggcctaataccaatggagggggaagaagagctaaagattcttg
    ccttcgatatagaaaccctctatcacgaaggagaagagtttggaaaaggcccaattataa
    tgattagttatgcagatgaaaatgaagcaaaggtgattacttggaaaaacatagatcttc
    catacgttgaggttgtatcaagcgagagagagatgataaagagatttctcaggattatca
    gggagaaggatcctgacattatagttacttataatggagactcattcgacttcccatatt
    tagcgaaaagggcagaaaaacttgggattaaattaaccattggaagagatggaagcgagc
    ccaagatgcagagaataggcgatatgacggctgtagaagtcaagggaagaatacatttcg
    acttgtatcatgtaataacaaggacaataaatctcccaacatacacactagaggctgtat
    atgaagcaatttttggaaagccaaaggagaaggtatacgccgacgagatagcaaaagcct
    gggaaagtggagagaaccttgagagagttgccaaatactcgatggaagatgcaaaggcaa
    cttatgaactcgggaaagaattccttccaatggaaattcagctttcaagattagttggac
    aacctttatgggatgtttcaaggtcaagcacagggaaccttgtagagtggttcttactta
    ggaaagcctacgaaagaaacgaagtagctccaaacaagccaagtgaagaggagtatcaaa
    gaaggctcagggagagctacacaggtggattcgttaaagagccagaaaaggggttgtggg
    aaaacatagtatacctagattttagagccctatatccctcgattataattacccacaatg
    tttctcccgatactctaaatcttgagggatgcaagaactatgatatcgctcctcaagtag
    gccacaagttctgcaaggacatccctggttttataccaagtctcttgggacatttgttag
    aggaaagacaaaagattaagacaaaaatgaaggaaactcaagatcctatagaaaaaatac
    tccttgactatagacaaaaagcgataaaactcttagcaaattctttctacggatattatg
    gctatgcaaaagcaagatggtactgtaaggagtgtgctgagagcgttactgcctggggaa
    gaaagtacatcgagttagtatggaaggagctcgaagaaaagtttggatttaaagtcctct
    acattgacactgatggtctctatgcaactatcccaggaggagaaagtgaggaaataaaga
    aaaaggctctagaatttgtaaaatacataaattcaaagctccctggactgctagagcttg
    aatatgaagggttttataagaggggattcttcgttacgaagaagaggtatgcagtaatag
    atgaagaaggaaaagtcattactcgtggtttagagatagttaggagagattggagtgaaa
    ttgcaaaagaaactcaagctagagttttggagacaatactaaaacacggagatgttgaag
    aagctgtgagaatagtaaaagaagtaatacaaaagcttgccaattatgaaattccaccag
    agaagctcgcaatatatgagcagataacaagaccattacatgagtataaggcgataggtc
    ctcacgtagctgttgcaaagaaactagctgctaaaggagttaaaataaagccaggaatgg
    taattggatacatagtacttagaggcgatggtccaattagcaatagggcaattctagctg
    aggaatacgatcccaaaaagcacaagtatgacgcagaatattacattgagaaccaggttc
    ttccagcggtacttaggatattggagggatttggatacagaaaggaagacctcagatacc
    aaaagacaagacaagtcggcctaacttcctggcttaacattaaaaaatccggtaccggcg
    gtggcggtccgaagaaggagaagattaggttcttcgacctggtcgccaagaagtactacg
    agactgacaactacgaagtcgagattaaggagactaagcgcggcaagtttcgcttcgcca
    aagccaagagcccgtacaccggcaagatcttctatcgcgtgctgggcaaagcctaactcgag
    26 amino acid MGHHHHHHHHHHSSGHIEGRHMILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIY
    sequence of ALLRDDSKIEEVKKITGERHGKIVRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREK
    Pfu: Ape3192 VREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIM
    fusion protein ISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYL
    (including poly- AKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVY
    His tag) EAIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQ
    PLWDVSRSSTGNLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWE
    NIVYLDFRALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLE
    ERQKIKTKMKETQDPIEKILLDYRQKAIKLLANSFYGYYGYAKARWYCKECAESVTAWGR
    KYIELVWKELEEKFGEKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELE
    YEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE
    AVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMV
    IGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQ
    KTRQVGLTSWLNIKKSGTGGGGPKKEKIRFFDLVAKKYYETDNYEVEIKETKRGKFRFAK
    AKSPYTGKIFYRVLGKA
    27 amino acid HMILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIY
    sequence of ALLRDDSKIEEVKKITGERHGKIVRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREK
    Pfu: Ape3192 VREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIM
    fusion protein ISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYL
    (after removal of AKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVY
    tag) EAIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQ
    PLWDVSRSSTGNLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWE
    NIVYLDFRALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLE
    ERQKIKTKMKETQDPIEKILLDYRQKAIKLLANSFYGYYGYAKARWYCKECAESVTAWGR
    KYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELE
    YEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE
    AVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMV
    IGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQ
    KTRQVGLTSWLNIKKSGTGGGGPKKEKIRFFDLVAKKYYETDNYEVEIKETKRGKFRFAK
    AKSPYTGKIFYRVLGKA
    28 Pae/Ape KXKXKFXDXXAKXXXETDXYEVXXKXTXRGXXXFAXAKSPYTGXXXYRXLGK
    consensus
    sequence
    29 oligo for gttttcccagtcacgacgttgtaaaacgacggcc
    processivity
    assay
    30 Sso7d MATVKFKYKGEEKQVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQMLEKQKK
    31 Pfu DNA MILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIY
    polymerase ALLRDDSKIEEVKKITGERHGKIVRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREK
    VREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIM
    ISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYL
    AKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVY
    EAIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQ
    PLWDVSRSSTGNLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWE
    NIVYLDFRALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLE
    ERQKIKTKMKETQDPIEKILLDYRQKAIKLLANSFYGYYGYAKARWYCKECAESVTAWGR
    KYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELE
    YEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE
    AVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMV
    IGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQ
    KTRQVGLTSWLNIKKS
    32 Taq DNA MRGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSRGEPVQAVYGFAKSLLKALKEDG
    polymerase DAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQLALIKELVDLLGLARLEVPGYEA
    DDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIHVLHPEGYLITPAWLWEKYGLRPD
    QWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGSLEALLKNLDRLKPAIREKILAHM
    DDLKLSWDLAKVRTDLPLEVDFAKRREPDRERLRAFLERLEFGSLLHEFGLLESPKALE
    EAPWPPPEGAFVGFVLSRKEPMWADLLALAAARGGRVHRAPEPYKALRDLKEARGLLAK
    DLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGVARRYGGEWTEEAGERAALSERLF
    ANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEA
    EVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEK
    ILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTPL
    GQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGV
    PREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIE
    KTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAAERMAFNMPVQGTAADLMKLAMVK
    LFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLAKEVMEGVYPLAVPLEVEVGIGED
    WLSAKE
    33 polynucleotide ATGGGCCATCATCATCATCATCATCATCATCATCACAGCAGCGGCCATATCGAAGGTCG
    encoding TCATATGTCCAAGAAGCAGAAACTGAAGTTCTACGACATTAAGGCGAAGCAGGCGTTTG
    Pae3192: Taq AGACCGACCAGTACGAGGTTATTGAGAAGCAGACCGCCCGCGGTCCGATGATGTTCGCC
    fusion protein GTGGCCAAATCGCCGTACACCGGCATTAAAGTGTACCGCCTGTTAGGCAAGAAGAAAGG
    (including poly- CGGCGGTGTCACTAGTGGGATGCTGCCCCTCTTTGAGCCCAAGGGCCGGGTCCTCCTGG
    His tag) TGGACGGCCACCACCTGGCCTACCGGACCTTCCACGCCCTGAAGGGCCTCACCACCAGC
    CGGGGGGAGCCGGTGCAGGCGGTCTACGGCTTCGCCAAGAGCCTCCTCAAGGCCCTCAA
    GGAGGACGGGGACGCGGTGATCGTGGTCTTTGACGCCAAGGCCCCCTCCTTCCGCCACG
    AGGCCTACGGGGGGTACAAGGCGGGCCGGGCCCCCACGCCGGAGGACTTTCCCCGGCAA
    CTCGCCCTCATCAAGGAGCTGGTGGACCTCCTGGGGCTGGCGCGCCTCGAGGTCCCGGG
    CTACGAGGCGGACGACGTCCTGGCCAGCCTGGCCAAGAAGGCGGAAAAGGAGGGCTACG
    AGGTCCGCATCCTCACCGCCGACAAAGACCTTTACCAGCTCCTTTCCGACCGCATCCAC
    GTCCTCCACCCCGAGGGGTACCTCATCACCCCGGCCTGGCTTTGGGAAAAGTACGGCCT
    GAGGCCCGACCAGTGGGCCGACTACCGGGCCCTGACCGGGGACGAGTCCGACAACCTTC
    CCGGGGTCAAGGGCATCGGGGAGAAGACGGCGAGGAAGCTTCTGGAGGAGTGGGGGAGC
    CTGGAAGCCCTCCTCAAGAACCTGGACCGGCTGAAGCCCGCCATCCGGGAGAAGATCCT
    GGCCCACATGGACGATCTGAAGCTCTCCTGGGACCTGGCCAAGGTGCGCACCGACCTGC
    CCCTGGAGGTGGACTTCGCCAAAAGGCGGGAGCCCGACCGGGAGAGGCTTAGGGCCTTT
    CTGGAGAGGCTTGAGTTTGGCAGCCTCCTCCACGAGTTCGGCCTTCTGGAAAGCCCCAA
    GGCCCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGGCCTTCGTGGGCTTTGTGCTTT
    CCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCTGGCCGCCGCCAGGGGGGGCCGG
    GTCCACCGGGCCCCCGAGCCTTATAAAGCCCTCAGGGACCTGAAGGAGGCGCGGGGGCT
    TCTCGCCAAAGACCTGAGCGTTCTGGCCCTGAGGGAAGGCCTTGGCCTCCCGCCCGGCG
    ACGACCCCATGCTCCTCGCCTACCTCCTGGACCCTTCCAACACCACCCCCGAGGGGGTG
    GCCCGGCGCTACGGCGGGGAGTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCCGA
    GAGGCTCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTTTGGCTTT
    ACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGGCCACGGGGGTG
    CGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGCCGAGGAGATCGCCCG
    CCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTCAACCTCAACTCCCGGGACC
    AGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCCATCGGCAAGACGGAGAAG
    ACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGCCCTCCGCGAGGCCCACCCCAT
    CGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGCTGAAGAGCACCTACATTGACC
    CCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCCTCCACACCCGCTTCAACAAGACG
    GCCACGGCCACGGGCAGGCTAAGTAGCTCCGATCCCAACCTCCAGAACATCCCCGTCCG
    CACCCCGCTTGGGCAGAGGATCCGCCGGGCCTTCATCGCCGAGGAGGGGTGGCTATTGG
    TGGCCCTGGACTATAGCCAGATAGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGAG
    AACCTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGGAT
    GTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAGACCATCA
    ACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGCTAGCCATCCCT
    TACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTCCCCAAGGTGCGGGC
    CTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTGGAGACCCTCTTCG
    GCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAGAGCGTGCGGGAGGCGGCC
    GAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGCCGCCGACCTCATGAAGCTGAC
    TATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGGCCAGGATGCTCCTTCAGGTCC
    ACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGAGGGCGGAGGCCGTGGCCCGGCTGGCC
    AAGGAGGTCATGGAGGGGGTGTATCCCCTGGCCGTGCCCCTGGAGGTGGAGGTGGGGAT
    AGGGGAGGACTGGCTCTCCGCCAAGGAGTGA
    34 amino acid MGHHHHHHHHHHSSGHIEGRHMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    sequence of VAKSPYTGIKVYRLLGKKKGGGVTSGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTS
    Pae3192: Taq RGEPVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQ
    fusion protein LALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIH
    (including poly- VLHPEGYLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGS
    His tag) LEALLKNLDRLKPAIREKILAHMDDLKLSWDLAKVRTDLPLEVDFAKRREPDRERLRAF
    LERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALAAARGGR
    VHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGV
    ARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGV
    RLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK
    TGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQT
    ATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDE
    NLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIP
    YEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAA
    ERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLA
    KEVMEGVYPLAVPLEVEVGIGEDWLSAKE
    35 amino acid HMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    sequence of VAKSPYTGIKVYRLLGKKKGGGVTSGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTS
    Pae3192: Taq RGEPVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQ
    fusion protein LALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIH
    (after removal of VLHPEGYLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGS
    poly-His tag) LEALLKNLDRLKPAIREKILAHMDDLKLSWDLAKVRTDLPLEVDFAKRREPDRERLRAF
    LERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALAAARGGR
    VHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGV
    ARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGV
    RLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEK
    TGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQT
    ATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDE
    NLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIP
    YEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAA
    ERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLA
    KEVMEGVYPLAVPLEVEVGIGEDWLSAKE
    36 polynucleotide ATGGGCCATCATCATCATCATCATCATCATCATCACAGCAGCGGCCATATCGAAG
    encoding GTCGTCATATGCCGAAGAAGGAGAAGATTAAGTTCTTCGACCTGGTCGCCAAGAAGTAC
    Ape3192: Taq TACGAGACTGACAACTACGAAGTCGAGATTAAGGAGACTAAGCGCGGCAAGTTTCGCTT
    fusion protein CGCCAAAGCCAAGAGCCCGTACACCGGCAAGATCTTCTATCGCGTGCTGGGCAAAGCCG
    (inclusing poly- GCGGCGGTGTCACTAGTGGGATGCTGCCCCTCTTTGAGCCCAAGGGCCGGGTCCTCCTG
    His tag) GTGGACGGCCACCACCTGGCCTACCGCACCTTCCACGCCCTGAAGGGCCTCACCACCAG
    CCGGGGGGAGCCGGTGCAGGCGGTCTACGGCTTCGCCAAGAGCCTCCTCAAGGCCCTCA
    AGGAGGACGGGGACGCGGTGATCGTGGTCTTTGACGCCAAGGCCCCCTCCTTCCGCCAC
    GAGGCCTACGGGGGGTACAAGGCGGGCCGGGCCCCCACGCCGGAGGACTTTCCCCGGCA
    ACTCGCCCTCATCAAGGAGCTGGTGGACCTCCTGGGGCTGGCGCGCCTCGAGGTCCCGG
    GCTACGAGGCGGACGACGTCCTGGCCAGCCTGGCCAAGAAGGCGGAAAAGGAGGGCTAC
    GAGGTCCGCATCCTCACCGCCGACAAAGACCTTTACCAGCTCCTTTCCGACCGCATCCA
    CGTCCTCCACCCCGAGGGGTACCTCATCACCCCGGCCTGGCTTTGGGAAAAGTACGGCC
    TGAGGCCCGACCAGTGGGCCGACTACCGGGCCCTGACCGGGGACGAGTCCGACAACCTT
    CCCGGGGTCAAGGGCATCGGGGAGAAGACGGCGAGGAAGCTTCTGGAGGAGTGGGGGAG
    CCTGGAAGCCCTCCTCAAGAACCTGGACCGGCTGAAGCCCGCCATCCGGGAGAAGATCC
    TGGCCCACATGGACGATCTGAAGCTCTCCTGGGACCTGGCCAAGGTGCGCACCGACCTG
    CCCCTGGAGGTGGACTTCGCCAAAAGGCGGGAGCCCGACCGGGAGAGGCTTAGGGCCTT
    TCTGGAGAGGCTTGAGTTTGGCAGCCTCCTCCACGAGTTCGGCCTTCTGGAAAGCCCCA
    AGGCCCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGGCCTTCGTGGGCTTTGTGCTT
    TCCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCTGGCCGCCGCCAGGGGGGGCCG
    GGTCCACCGGGCCCCCGAGCCTTATAAAGCCCTCAGGGACCTGAAGGAGGCGCGGGGGC
    TTCTCGCCAAAGACCTGAGCGTTCTGGCCCTGAGGGAAGGCCTTGGCCTCCCGCCCGGC
    GACGACCCCATGCTCCTCGCCTACCTCCTGGACCCTTCCAACACCACCCCCGAGGGGGT
    GGCCCGGCGCTACGGCGGGGAGTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCCG
    AGAGGCTCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTTTGGCTT
    TACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGGCCACGGGGGT
    GCGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGCCGAGGAGATCGCCC
    GCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTCAACCTCAACTCCCGGGAC
    CAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCCATCGGCAAGACGGAGAA
    GACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGCCCTCCGCGAGGCCCACCCCA
    TCGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGCTGAAGAGCACCTACATTGAC
    CCCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCCTCCACACCCGCTTCAACCAGAC
    GGCCACGGCCACGGGCAGGCTAAGTAGCTCCGATCCCAACCTCCAGAACATCCCCGTCC
    GCACCCCGCTTGGGCAGAGGATCCGCCGGGCCTTCATCGCCGAGGAGGGGTGGCTATTG
    GTGGCCCTGGACTATAGCCAGATAGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGA
    GAACCTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGGA
    TGTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAGACCATC
    AACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGCTAGCCATCCC
    TTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTCCCCAAGGTGCGGG
    CCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTGGAGACCCTCTTC
    GGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAGAGCGTGCGGGAGGCGGC
    CGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGCCGCCGACCTCATGAAGCTGG
    CTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGGCCAGGATGCTCCTTCAGGTC
    CACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGAGGGCGGAGGCCGTGGCCCGGCTGGC
    CAAGGAGGTCATGGAGGGGGTGTATCCCCTGGCCGTGCCCCTGGAGGTGGAGGTGGGGA
    TAGGGGAGGACTGGCTCTCCGCCAAGGAGTGA
    37 amino acid MGHHHHHHHHHHSSGHIEGRHMPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFA
    sequence of KAKSPYTGKIFYRVLGKAGGGVTSGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSR
    Ape3192: Taq GEPVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQL
    fusion protein ALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIHV
    (including poly- LHPEGYLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGSL
    His tag) EALLKNLDRLKPAIREKILAHMDDLKLSWDLAKVRTDLPLEVDFAKRREPDRERLRAFL
    ERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALAAARGGRV
    HRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGVA
    RRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGVR
    LDVAYLRALSLEVAEETARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEKT
    GKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQTA
    TATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDEN
    LIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIPY
    EEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAAE
    RMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLAK
    EVMEGVYPLAVPLEVEVGIGEDWLSAKE
    38 amino acid HMPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFA
    sequence of KAKSPYTGKIFYRVLGKAGGGVTSGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSR
    Ape3192: Taq GEPVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQL
    fusion protein ALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIHV
    (after removal of LHPEGYLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGSL
    poly-His tag) EALLKNLDRLKPAIREKILAHMDDLKLSWDLAKVRTDLPLEVDFAKRREPDRERLRAFL
    ERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALAAARGGRV
    HRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGVA
    RRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGVR
    LDVAYLRALSLEVAEETARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEKT
    GKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQTA
    TATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDEN
    LIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIPY
    EEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAAE
    RMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLAK
    EVMEGVYPLAVPLEVEVGIGEDWLSAKE
    39 polynucleotide ATGGGCCATCATCATCATCATCATCATCATCATCACAGCAGCGGCCATATCGAAGGTCG
    encoding TCATATGTCCAAGAAGCAGAAACTGAAGTTCTACGACATTAAGGCGAAGCAGGCGTTTG
    Pae3192: Stoffel- AGACCGACCAGTACGAGGTTATTGAGAAGCAGACCGCCCGCGGTCCGATGATGTTCGCC
    Tag fusion GTGGCCAAATCGCCGTACACCGGCATTAAAGTGTACCGCCTGTTAGGCAAGAAGAAAGG
    protein CGGCGGTGTCACTAGTCCCAAGGCCCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGG
    (including poly- CCTTCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCTG
    His tag) GCCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTATAAAGCCCTCAGGGA
    CCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGCGTTCTGGCCCTGAGGGAAG
    GCCTTGGCCTCCCGCCCGGCGACGACCCCATGCTCCTCGCCTACCTCCTGGACCCTTCC
    AACACCACCCCCGAGGGGGTGGCCCGGCGCTACGGCGGGGAGTGGACGGAGGAGGCGGG
    GGAGCGGGCCGCCCTTTCCGAGAGGCTCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGG
    AGGAGAGGCTCCTTTGGCTTTACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCC
    CACATGGAGGCCACGGGGGTGCGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGA
    GGTGGCCGAGGAGATCGCCCGCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCT
    TCAACCTCAACTCCCGGGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCC
    GCCATCGGCAAGACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGC
    CCTCCGCGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGC
    TGAAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCCTC
    CACACCCGCTTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGTAGCTCCGATCCCAA
    CCTCCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGATCCGCCGGGCCTTCATCG
    CCGAGGAGGGGTGGCTATTGGTGGCCCTGGACTATAGCCAGATAGAGCTCAGGGTGCTG
    GCCCACCTCTCCGGCGACGAGAACCTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCA
    CACGGAGACCGCCAGCTGGATGTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGC
    GCCGGGCGGCCAAGACCATCAACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTC
    TCCCAGGAGCTAGCCATCCCTTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCA
    GAGCTTCCCCAAGGTGCGGGCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGG
    GGTACGTGGAGACCCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTG
    AAGAGCGTGCGGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGC
    CGCCGACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGG
    CCAGGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGAGGGCG
    GAGGCCGTGGCCCGGCTGGCCAAGGAGGTCATGGAGGGGGTGTATCCCCTGGCCGTGCC
    CCTGGAGGTGGAGGTGGGGATAGGGGAGGACTGGCTCTCCGCCAAGGAGTGA
    40 amino acid MGHHHHHHHHHHSSGHIEGRHMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    sequence of VAKSPYTGIKVYRLLGKKKGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLAL
    Pae3192: Stoffel- AAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPS
    Tag fusion NTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLA
    protein HMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLP
    (including poly- AIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRL
    His tag) HTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVL
    AHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRL
    SQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARV
    KSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERA
    EAVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE
    41 amino acid HMSKKQKLKFYDIKAKQAFETDQYEVIEKQTARGPMMFA
    sequence of VAKSPYTGIKVYRLLGKKKGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLAL
    Pae3192: Stoffel- AAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPS
    Tag fusion NTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLA
    protein (after HMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLP
    removal of poly- AIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRL
    His tag) HTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVL
    AHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRL
    SQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARV
    KSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERA
    EAVARLAKEVMEGVYPLAVPLEVEVG IGEDWLSAKE
    42 polynucleotide ATGGGCCATCATCATCATCATCATCATCATCATCACAGCAGCGGCCATATCGAAGGTCG
    encoding TCATATGCCGAAGAAGGAGAAGATTAAGTTCTTCGACCTGGTCGCCAAGAAGTACTACG
    Ape3192: Stoffel- AGACTGACAACTACGAAGTCGAGATTAAGGAGACTAAGCGCGGCAAGTTTCGCTTCGCC
    Tag fusion AAAGCCAAGAGCCCGTACACCGGCAAGATCTTCTATCGCGTGCTGGGCAAAGCCGGCGG
    protein CGGTGTCACTAGTCCCAAGGCCCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGGCCT
    (including poly- TCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCTGGCC
    His tag) GCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTATAAAGCCCTCAGGGACCT
    GAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGCGTTCTGGCCCTGAGGGAAGGCC
    TTGGCCTCCCGCCCGGCGACGACCCCATGCTCCTCGCCTACCTCCTGGACCCTTCCCAC
    ACCACCCCCGAGGGGGTGGCCCGGCGCTACGGCGGGGAGTGGACGGAGGAGGCGGGGGA
    GCGGGCCGCCCTTTCCGAGAGGCTCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGGAGG
    AGAGGCTCCTTTGGCTTTACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCAC
    ATGGAGGCCACGGGGGTGCGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGT
    GGCCGAGGAGATCGCCCGCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTCA
    ACCTCAACTCCCGGGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCC
    ATCGGCAAGACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGCCCT
    CCGCGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGCTGA
    AGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCCTCCAC
    ACCCGCTTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGTAGCTCCGATCCCAACCT
    CCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGATCCGCCGGGCCTTCATCGCCG
    AGGAGGGGTGGCTATTGGTGGCCCTGGACTATAGCCAGATAGAGCTCAGGGTGCTGGCC
    CACCTCTCCGGCGACGAGAACCTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCACAC
    GGAGACCGCCAGCTGGATGTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCC
    GGGCGGCCAAGACCATCAACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCC
    CAGGAGCTAGCCATCCCTTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAG
    CTTCCCCAAGGTGCGGGCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGT
    ACGTGGAGACCCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAG
    AGCGTGCGGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGCCGC
    CGACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGGCCA
    GGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGAGGGCGGAG
    GCCGTGGCCCGGCTGGCCAAGGAGGTCATGGAGGGGGTGTATCCCCTGGCCGTGCCCCT
    GGAGGTGGAGGTGGGGATAGGGGAGGACTGGCTCTCCGCCAAGGAGTGA
    43 amino acid MGHHHHHHHHHHSSGHIEGRHMPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFA
    sequence of KAKSPYTGKIFYRVLGKAGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALA
    Ape3192: Stoffel- AARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSN
    Tag fusion TTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAH
    protein MEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPA
    (including poly- IGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLH
    His tag) TRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLA
    HLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLS
    QELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVK
    SVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAE
    AVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE
    44 amino acid HMPKKEKIKFFDLVAKKYYETDNYEVEIKETKRGKFRFA
    sequence of KAKSPYTGKIFYRVLGKAGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALA
    Ape3192: Stoffel- AARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSN
    Tag fusion TTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAH
    protein (after MEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLFDELGLPA
    removal of poly- IGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLH
    His tag) TRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLA
    HLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLS
    QELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVK
    SVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAE
    AVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE
    45 exemplary PCR ggaagtacagctcagagttctgcagcacccctgc
    “forward” primer
    46 exemplary PCR gatgcgaaactgaggctggctgtactgtctc
    “reverse” primer

Claims (13)

1.-7. (canceled)
8. A method of producing a polypeptide comprising an amino acid sequence having at least 90% identity to SEQ ID NOs:1, 4 6 or 28 wherein the polypeptide has nucleic acid binding activity, the method comprising culturing a host cell comprising a vector comprising a polynucleotide sequence encoding the polypeptide under conditions suitable for expression of the polypeptide and isolating the polypeptide so expressed.
9.-24. (canceled)
25. A fusion protein comprising:
a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity; and
a nucleic acid polymerase.
26. (canceled)
27. The fusion protein of claim 25, wherein the nucleic acid polymerase is a thermostable DNA polymerase.
28. The fusion protein of claim 25, wherein the polypeptide comprises an amino acid sequence having at least 90% identiy to SEQ ID NOs:1, 4, 6, or 28 or a fragment thereof wherein the polynucleotide or fragment thereof has nucleic acid binding activity.
29.-52. (canceled)
53. A method of increasing the processivity of a DNA polymerase comprising combining a reaction mixture comprising the DNA polymerase with a polypeptide comprising an amino acid sequence of a Crenarchaeal nucleic acid binding polypeptide or a fragment thereof having nucleic acid binding activity.
54.-55. (canceled)
56. The method of claim 53, wherein the polypeptide comprises an amino acid sequence having at least 95% identity to SEQ ID NOs:4, 6, or 28.
57. The method of claim 56, wherein the polypeptide comprises the amino acid sequence of SEQ ID NOs:4, 6, or 28 or a fragment of SEQ ID NO:4 6, or 28 that has nucleic acid binding activity.
58.-69. (canceled)
US14/934,029 2005-01-06 2015-11-05 Polypeptides having nucleic acid binding activity Abandoned US20160122393A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/934,029 US20160122393A1 (en) 2005-01-06 2015-11-05 Polypeptides having nucleic acid binding activity

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US64198705P 2005-01-06 2005-01-06
US11/327,195 US8349600B2 (en) 2005-01-06 2006-01-06 Polypeptides having nucleic acid binding activity
US13/619,099 US9212352B2 (en) 2005-01-06 2012-09-14 Polypeptides having nucleic acid binding activity
US14/934,029 US20160122393A1 (en) 2005-01-06 2015-11-05 Polypeptides having nucleic acid binding activity

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/619,099 Continuation US9212352B2 (en) 2005-01-06 2012-09-14 Polypeptides having nucleic acid binding activity

Publications (1)

Publication Number Publication Date
US20160122393A1 true US20160122393A1 (en) 2016-05-05

Family

ID=36648169

Family Applications (3)

Application Number Title Priority Date Filing Date
US11/327,195 Expired - Fee Related US8349600B2 (en) 2005-01-06 2006-01-06 Polypeptides having nucleic acid binding activity
US13/619,099 Active US9212352B2 (en) 2005-01-06 2012-09-14 Polypeptides having nucleic acid binding activity
US14/934,029 Abandoned US20160122393A1 (en) 2005-01-06 2015-11-05 Polypeptides having nucleic acid binding activity

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US11/327,195 Expired - Fee Related US8349600B2 (en) 2005-01-06 2006-01-06 Polypeptides having nucleic acid binding activity
US13/619,099 Active US9212352B2 (en) 2005-01-06 2012-09-14 Polypeptides having nucleic acid binding activity

Country Status (6)

Country Link
US (3) US8349600B2 (en)
EP (1) EP1833969B1 (en)
AT (1) ATE491032T1 (en)
AU (1) AU2006203970A1 (en)
DE (1) DE602006018701D1 (en)
WO (1) WO2006074288A2 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1833969B1 (en) 2005-01-06 2010-12-08 Life Technologies Corporation Polypeptides having nucleic acid binding activity
WO2010091203A2 (en) * 2009-02-04 2010-08-12 Lucigen Corporation Rna-and dna-copying enzymes
CN102392000A (en) * 2011-11-07 2012-03-28 李志茹 High-temperature-resistant Pyrolobus polymerase and efficient expression plasmid and application thereof
US10626383B2 (en) 2016-01-15 2020-04-21 Thermo Fisher Scientific Baltics Uab Thermophilic DNA polymerase mutants
US10273530B2 (en) 2016-01-15 2019-04-30 Thermo Fisher Scientific Baltics Uab Antibodies that bind thermophilic DNA polymerases
US20200002689A1 (en) 2017-02-13 2020-01-02 Qiagen Sciences, Llc Polymerase enzyme from 9°n
WO2019002178A1 (en) 2017-06-26 2019-01-03 Thermo Fisher Scientific Baltics Uab Thermophilic dna polymerase mutants
CN114574464B (en) * 2022-05-05 2022-10-14 南京巨匠生物科技有限公司 High-fidelity DNA polymerase mutant and application thereof

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6627424B1 (en) * 2000-05-26 2003-09-30 Mj Bioworks, Inc. Nucleic acid modifying enzymes

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030228616A1 (en) 1999-10-29 2003-12-11 Stratagene DNA polymerase mutants with reverse transcriptase activity
JP4689163B2 (en) * 2001-11-28 2011-05-25 バイオ−ラッド ラボラトリーズ インコーポレーティッド How to use improved polymerase
CN1720333A (en) 2002-11-01 2006-01-11 美格贝斯科研制品公司 Gas jet process and apparatus for high-speed amplification of DNA
EP1833969B1 (en) 2005-01-06 2010-12-08 Life Technologies Corporation Polypeptides having nucleic acid binding activity

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6627424B1 (en) * 2000-05-26 2003-09-30 Mj Bioworks, Inc. Nucleic acid modifying enzymes

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Ngo et al. in The Protein Folding Problem and Tertiary Structure Prediction, 1994, Merz et al. (ed.), Birkhauser, Boston, MA, pp. 433 and 492-495. *

Also Published As

Publication number Publication date
EP1833969B1 (en) 2010-12-08
ATE491032T1 (en) 2010-12-15
WO2006074288A3 (en) 2007-03-08
US20130164817A1 (en) 2013-06-27
US20060223157A1 (en) 2006-10-05
WO2006074288A2 (en) 2006-07-13
US8349600B2 (en) 2013-01-08
AU2006203970A1 (en) 2006-07-13
DE602006018701D1 (en) 2011-01-20
EP1833969A2 (en) 2007-09-19
US9212352B2 (en) 2015-12-15

Similar Documents

Publication Publication Date Title
US9267130B2 (en) Polypeptides having nucleic acid binding activity and compositions and methods for nucleic acid amplification
US9212352B2 (en) Polypeptides having nucleic acid binding activity
US9040276B2 (en) DNA binding protein-polymerase chimeras
Boudsocq et al. Sulfolobus solfataricus P2 DNA polymerase IV (Dpo4): an archaeal DinB-like DNA polymerase with lesion-bypass properties akin to eukaryotic polη
EP2352818B1 (en) Chimeric dna polymerases
JP4220557B2 (en) Overexpression and purification of truncated thermostable DNA polymerase by protein fusion
Lao-Sirieix et al. The heterodimeric primase of the hyperthermophilic archaeon Sulfolobus solfataricus possesses DNA and RNA primase, polymerase and 3′-terminal nucleotidyl transferase activities
AU8416198A (en) Altered thermostable DNA polymerases for sequencing
PT624641E (en) THERMOSTABLE NUCLEIC ACID POLYMERASES
EP2986719B1 (en) Fusion polymerases
JP2021528968A (en) Nucleic acid molecule encoding fusion single-stranded DNA polymerase Bst, fusion DNA polymerase NeqSSB-Bst, its preparation method and its use
CA2802000C (en) Dna polymerases with increased 3'-mismatch discrimination
JP2003506048A (en) Archaeal replication cofactors and methods of use
JP2002253265A (en) Varied heat resistant dna polymerase
KR101230362B1 (en) Hot-start pcr based on the protein trans-splicing of nanoarchaeum equitans dna polymerase
JP2002360261A (en) Dna polymerase-related factor
JP3487394B2 (en) Modified thermostable DNA polymerase and use thereof
ES2383990T3 (en) FEN endonucleases
RU2413767C1 (en) Thermostable dna-ligase from archaea of genus acidilobus
WO2016104272A1 (en) Pcr method
JP2003284576A (en) Dna polymerase composition for nucleic acid amplification

Legal Events

Date Code Title Description
AS Assignment

Owner name: APPLIED BIOSYSTEMS INC., CALIFORNIA

Free format text: MERGER;ASSIGNOR:APPLERA CORPORATION;REEL/FRAME:037995/0516

Effective date: 20080630

Owner name: APPLERA CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MARTIN, PATRICK;SIMPSON, DAVID;REEL/FRAME:037995/0378

Effective date: 20060928

Owner name: APPLIED BIOSYSTEMS INC., CALIFORNIA

Free format text: MERGER;ASSIGNOR:ATOM ACQUISITION CORPORATION;REEL/FRAME:037998/0500

Effective date: 20081121

Owner name: APPLIED BIOSYSTEMS, LLC, CALIFORNIA

Free format text: MERGER;ASSIGNOR:ATOM ACQUISITION, LLC & APPLIED BIOSYSTEMS INC.;REEL/FRAME:037998/0680

Effective date: 20081121

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION