US20020106729A1 - Expression vectors - Google Patents

Expression vectors Download PDF

Info

Publication number
US20020106729A1
US20020106729A1 US09/897,006 US89700601A US2002106729A1 US 20020106729 A1 US20020106729 A1 US 20020106729A1 US 89700601 A US89700601 A US 89700601A US 2002106729 A1 US2002106729 A1 US 2002106729A1
Authority
US
United States
Prior art keywords
vector
protein
sequence
cells
expression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US09/897,006
Inventor
Gregory Bleck
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GALA DESIGN LLC
Original Assignee
GALA DESIGN LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GALA DESIGN LLC filed Critical GALA DESIGN LLC
Priority to AU2001270252A priority Critical patent/AU2001270252B2/en
Priority to US09/897,006 priority patent/US20020106729A1/en
Assigned to GALA DESIGN, LLC reassignment GALA DESIGN, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BLECK, GREGORY T.
Publication of US20020106729A1 publication Critical patent/US20020106729A1/en
Priority to US10/947,881 priority patent/US7378273B2/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/12Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
    • C07K16/1203Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria
    • C07K16/1228Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/12Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
    • C07K16/1267Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria
    • C07K16/1282Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria from Clostridium (G)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/18Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
    • C07K16/28Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
    • C07K16/30Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants from tumour cells
    • C07K16/3007Carcino-embryonic Antigens
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/18Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
    • C07K16/28Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
    • C07K16/30Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants from tumour cells
    • C07K16/3061Blood cells
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/13011Gammaretrovirus, e.g. murine leukeamia virus
    • C12N2740/13041Use of virus, viral particle or viral elements as a vector
    • C12N2740/13043Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/15Vector systems having a special element relevant for transcription chimeric enhancer/promoter combination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/48Vector systems having a special element relevant for transcription regulating transport or export of RNA, e.g. RRE, PRE, WPRE, CTE
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/80Vector systems having a special element relevant for transcription from vertebrates
    • C12N2830/85Vector systems having a special element relevant for transcription from vertebrates mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2840/00Vectors comprising a special translation-regulating system
    • C12N2840/20Vectors comprising a special translation-regulating system translation of more than one cistron
    • C12N2840/203Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES

Definitions

  • the present invention relates to novel regulatory elements and vectors for the expression of one or more proteins in a host cell.
  • proteins produced recombinantly in mammalian cells include erythropoietin, factor VIII, factor IX, and insulin.
  • recombinant antibodies are often used as therapeutic agents.
  • Clinical applications of recombinantly produced proteins, in particular antibodies, often require large amounts of highly purified proteins. Proteins are generally produced in either mammalian cell culture or in transgenic animals.
  • Retroviral vectors are widely used as vehicles for delivery of genes into mammalian cells (See e.g., Vile and Russell, British Medical Bulletin, 51:12 [1995]).
  • current methods for creating mammalian cell lines for expression of recombinant proteins suffer from several drawbacks. (See, e.g., Mielke et al., Biochem. 35:2239-52 [1996]).
  • Episomal systems allow for high expression levels of the recombinant protein, but are frequently only stable for a short time period (See, e.g., Klehr and Bode, Mol. Genet. (Life Sci. Adv.) 7:47-52 [1988]).
  • Mammalian cell lines containing integrated exogenous genes are somewhat more stable, but there is increasing evidence that stability depends on the presence of only a few copies or even a single copy of the exogenous gene. Vectors are often unstable, resulting in a decrease in the level of protein expression over time.
  • the present invention relates to novel regulatory elements and vectors for the expression of one or more proteins in a host cell.
  • the present invention provides a hybrid ⁇ -lactalbumin promoter comprising at least one portion derived from a first mammalian ⁇ -lactalbumin promoter and at least one portion derived from a second mammalian ⁇ -lactalbumin promoter.
  • the present invention is not limited to portions derived from any particular ⁇ -lactalbumin promoter. Indeed, portions from a variety of ⁇ -lactalbumin promoters are contemplated, including, but not limited to bovine, human, ovine, caprine, and murine ⁇ -lactalbumin promoters.
  • the present invention provides a nucleic acid comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO:1 and sequences hybridizable to SEQ ID NO:1 under low stringency conditions, wherein the nucleic acid contains sequences derived from at least two mammalian sources and causes mammary specific gene expression.
  • the present invention provides a nucleic acid sequence encoding a hybrid bovine/human alpha lactalbumin ( ⁇ LA) promoter/enhancer (i.e., SEQ ID NO:1) and sequences that are hybridizable to a hybrid bovine/human ⁇ -LA promoter under low to high stringency conditions.
  • ⁇ LA hybrid bovine/human alpha lactalbumin
  • these sequences drive the expression of an exogenous gene in the mammary gland of a transgenic animal.
  • the hybridizable sequence comprises human and bovine elements.
  • the present invention provides a vector containing the nucleic acid sequence of hybrid bovine/human ⁇ -LA promoter.
  • the vector is a retroviral vector.
  • the present invention provides a host cell containing a vector containing a hybrid bovine/human ⁇ -LA promoter.
  • the present invention also provides a nucleic acid encoding a mutant RNA export element (PPE element; SEQ ID NO:2) and sequences that are hybridizable to a mutant PPE element.
  • PPE element RNA export element
  • the sequences hybridizable to a mutant PPE element contain ATG sequences that have been mutated at at least one of the positions corresponding to nucleic acid residues 4, 112, 131, and 238 of the wild-type PPE element. In preferred embodiments, these sequences enhance the export from the nucleus of the RNA to which they are operably linked.
  • the present invention provides a vector containing the nucleic acid sequence of the mutant PPE element.
  • the vector is a retroviral vector.
  • the present invention provides a host cell containing a vector that contains a mutant PPE element.
  • the present invention also provides a nucleic acid encoding an IRES coding sequence and a signal peptide coding sequence, wherein said IRES and signal peptide coding sequences are adjacent to one another.
  • the IRES/signal peptide sequence comprises SEQ ID NO:3 or SEQ ID NO:12 and sequences that are hybridizable to these sequences under low stringency conditions.
  • these sequences interact with a ribosome and provide for the secretion of proteins to which they are operably linked.
  • the present invention is not limited to any particular signal sequence peptide. Indeed, it is contemplated that a variety of signal peptides find use in the present invention.
  • the signal peptide sequence is selected from alpha-casein, human growth hormone, or ⁇ -lactalbumin signal peptide sequences.
  • the present invention provides a vector containing the nucleic acid sequence of the IRES/signal peptide sequence.
  • the vector is a retroviral vector.
  • the present invention provides a host cell containing a vector that contains a IRES/signal peptide sequence.
  • the present invention also provides methods for producing a protein of interest.
  • the methods comprise providing a host cell and a vector containing at least one exogenous gene operably linked to a bovine/human hybrid ⁇ -lactalbumin promoter and introducing the vector to the host cell under conditions such that expression of the protein encoded by the exogenous gene is expressed.
  • the vector further contains a mutant RNA export element.
  • the vector contains at least two exogenous genes.
  • the two or more exogenous genes are arranged in a polycistronic sequence separated by an internal ribosome entry site/bovine ⁇ -lactalbumin signal peptide.
  • the present invention also provides methods for expressing at least two proteins in a polycistronic sequence.
  • the proteins are unrelated, while in other embodiments, the proteins are subunits of a multisubunit protein.
  • the present invention provides methods for producing an immunoglobulin including providing a host cell and a vector comprising a first exogenous gene and a second exogenous gene, wherein the first exogenous gene encodes a first immunoglobulin chain and wherein the second exogenous gene encodes a second immunoglobulin chain, and wherein the first and the second genes are separated by an internal ribosome entry site, and introducing the vector to the host cell under conditions such the first immunoglobulin chain and the second immunoglobulin chain encoded by the first and second exogenous genes are expressed.
  • the first immunoglobulin chain is an immunoglobulin light chain (e.g., ⁇ or ⁇ ) and the second immunoglobulin chain is an immunoglobulin heavy chain (e.g., ⁇ , ⁇ , ⁇ , or ⁇ ).
  • the first immunoglobulin chain is an immunoglobulin heavy chain (e.g., ⁇ , ⁇ , ⁇ , ⁇ , or ⁇ ) and the second immunoglobulin chain is an immunoglobulin light chain (e.g., ⁇ or ⁇ ).
  • the vector is a retroviral vector. In other embodiments, the vector further contains a bovine ⁇ -lactalbumin signal peptide.
  • the vector further contains a bovine/human hybrid ⁇ -lactalbumin promoter.
  • the first immunoglobulin chain and the second immunoglobulin chain are expressed at a ratio of about 0.9:1.1 to 1:1.
  • the present invention also provides immunoglobulins produced by the methods described herein. The present invention is not limited to the use of any particular vector. Indeed, it is contemplated that a variety of vectors find use in the present invention, including, but not limited to plasmid and retroviral vectors. In some preferred embodiments, the retroviral vector is pseudotyped.
  • the present invention provides methods of indirectly detecting the expression of a protein of interest comprising providing a host cell transduced or transfected with a vector encoding a polycistronic sequence, wherein the polycistronic sequence comprises a signal protein and a protein of interest operably linked by an IRES, and culturing the host cells under conditions such that the signal protein and protein of interest are produced, wherein the presence of the signal protein indicates the presence of the protein of interest.
  • the methods of the present invention are not limited to the expression of any particular protein of interest. Indeed, the expression of a variety of proteins of interest is contemplated, including, but not limited to, G-protein coupled receptors. The present invention is not limited to the use of any particular signal protein.
  • signal proteins including, but not limited to, immunoglobulin heavy and light chains, beta-galactosidase, beta-lactamase, green fluorescent protein, and luciferase.
  • expression of the signal protein and protein of interest is driven by the same promoter and the signal protein and protein of interest are transcribed as a single transcriptional unit.
  • FIG. 1 is a Western blot of a 15% SDS-PAGE gel run under denaturing conditions and probed with anti-human IgG (Fc) and anti-human IgG (kappa).
  • FIG. 2 is a graph of MN14 expression over time.
  • FIG. 3 is a Western blot of a 15% PAGE run under non-denaturing conditions and probed with anti-human IgG (Fc) and anti-human IgG (Kappa).
  • FIG. 4 provides the sequence for the hybrid human-bovine alpha-lactalbumin promoter (SEQ ID NO:1).
  • FIG. 5 provides the sequence for the mutated PPE sequence (SEQ ID NO:2).
  • FIG. 6 provides the sequence for the IRES-Signal peptide sequence (SEQ ID NO:3).
  • FIGS. 7 a and 7 b provide the sequence for CMV MN14 vector (SEQ ID NO:4).
  • FIGS. 8 a and 8 b provide the sequence for the CMV LL2 vector (SEQ ID NO:5).
  • FIGS. 9 a - c provide the sequence for the MMTV MN14 vector (SEQ ID NO:6).
  • FIGS. 10 a - d provide the sequence for the alpha-lactalbumin MM14 Vector (SEQ ID NO:7).
  • FIGS. 11 a - c provide the sequence for the alpha-lactalbumin Bot vector (SEQ ID NO:8).
  • FIGS. 12 a - b provide the sequence for the LSRNL vector (SEQ ID NO:9).
  • FIGS. 13 a - b provide the sequence for the alpha-lactalbumin cc49IL2 vector (SEQ ID NO:10).
  • FIGS. 14 a - c provides the sequence for the alpha-lactalbumin YP vector (SEQ ID NO:11).
  • FIG. 15 provides the sequence for the IRES-Casein signal peptide sequence (SEQ ID NO:12).
  • FIGS. 16 a - c provide the sequence for the LNBOTDC vector (SEQ ID NO:13).
  • FIGS. 17 a - d provide the sequence of a retroviral vector that expresses a G-Protein coupled receptor and antibody light chain.
  • the term “host cell” refers to any eukaryotic cell (e.g., mammalian cells, avian cells, amphibian cells, plant cells, fish cells, and insect cells), whether located in vitro or in vivo.
  • eukaryotic cell e.g., mammalian cells, avian cells, amphibian cells, plant cells, fish cells, and insect cells
  • cell culture refers to any in vitro culture of cells. Included within this term are continuous cell lines (e.g., with an immortal phenotype), primary cell cultures, finite cell lines (e.g., non-transformed cells), and any other cell population maintained in vitro, including oocytes and embryos.
  • vector refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences between cells.
  • vector includes cloning and expression vehicles, as well as viral vectors.
  • integrating vector refers to a vector whose integration or insertion into a nucleic acid (e.g., a chromosome) is accomplished via an integrase.
  • integrating vectors include, but are not limited to, retroviral vectors, transposons, and adeno associated virus vectors.
  • the term “integrated” refers to a vector that is stably inserted into the genome (i.e., into a chromosome) of a host cell.
  • the term “multiplicity of infection” or “MOI” refers to the ratio of integrating vectors:host cells used during transfection or transduction of host cells. For example, if 1,000,000 vectors are used to transduce 100,000 host cells, the multiplicity of infection is 10. The use of this term is not limited to events involving transduction, but instead encompasses introduction of a vector into a host by methods such as lipofection, microinjection, calcium phosphate precipitation, and electroporation.
  • the term “genome” refers to the genetic material (e.g., chromosomes) of an organism.
  • nucleotide sequence of interest refers to any nucleotide sequence (e.g., RNA or DNA), the manipulation of which may be deemed desirable for any reason (e.g., treat disease, confer improved qualities, expression of a protein of interest in a host cell, etc.), by one of ordinary skill in the art.
  • nucleotide sequences include, but are not limited to, coding sequences of structural genes (e.g., reporter genes, selection marker genes, oncogenes, drug resistance genes, growth factors, etc.), and non-coding regulatory sequences which do not encode an mRNA or protein product (e.g., promoter sequence, polyadenylation sequence, termination sequence, enhancer sequence, etc.).
  • protein of interest refers to a protein encoded by a nucleic acid of interest.
  • signal protein refers to a protein that is co-expressed with a protein of interest and which, when detected by a suitable assay, provides indirect evidence of expression of the protein of interest.
  • signal protein useful in the present invention include, but are not limited to, immunoglobulin heavy and light chains, beta-galactosidase, beta-lactamase, green fluorescent protein, and luciferase.
  • exogenous gene refers to a gene that is not naturally present in a host organism or cell, or is artificially introduced into a host organism or cell.
  • the term “gene” refers to a nucleic acid (e.g., DNA or RNA) sequence that comprises coding sequences necessary for the production of a polypeptide or precursor (e.g., proinsulin).
  • the polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence so long as the desired activity or functional properties (e.g., enzymatic activity, ligand binding, signal transduction, etc.) of the full-length or fragment are retained.
  • the term also encompasses the coding region of a structural gene and includes sequences located adjacent to the coding region on both the 5 40 and 3′ ends for a distance of about 1 kb or more on either end such that the gene corresponds to the length of the full-length mRNA.
  • the sequences that are located 5′ of the coding region and which are present on the mRNA are referred to as 5′ untranslated sequences.
  • the sequences that are located 3′ or downstream of the coding region and which are present on the mRNA are referred to as 3′ untranslated sequences.
  • gene encompasses both cDNA and genomic forms of a gene.
  • a genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed “introns” or “intervening regions” or “intervening sequences.”
  • Introns are segments of a gene which are transcribed into nuclear RNA (hnRNA); introns may contain regulatory elements such as enhancers. Introns are removed or “spliced out” from the nuclear or primary transcript; introns therefore are absent in the messenger RNA (mRNA) transcript.
  • mRNA messenger RNA
  • RNA expression refers to the process of converting genetic information encoded in a gene into RNA (e.g., mRNA, rRNA, tRNA, or snRNA) through “transcription” of the gene (i.e., via the enzymatic action of an RNA polymerase), and for protein encoding genes, into protein through “translation” of mRNA.
  • Gene expression can be regulated at many stages in the process. “Up-regulation” or “activation” refers to regulation that increases the production of gene expression products (i.e., RNA or protein), while “down-regulation” or “repression” refers to regulation that decrease production. Molecules (e.g., transcription factors) that are involved in up-regulation or down-regulation are often called “activators” and “repressors,” respectively.
  • amino acid sequence is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule
  • amino acid sequence and like terms, such as “polypeptide” or “protein” are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
  • nucleic acid molecule encoding refers to the order or sequence of deoxyribonucleotides or ribonucleotides along a strand of deoxyribonucleic acid or ribonucleic acid. The order of these deoxyribonucleotides or ribonucleotides determines the order of amino acids along the polypeptide (protein) chain. The DNA or RNA sequence thus codes for the amino acid sequence.
  • variant when used in reference to a protein, refers to proteins encoded by partially homologous nucleic acids so that the amino acid sequence of the proteins varies.
  • variant encompasses proteins encoded by homologous genes having both conservative and nonconservative amino acid substitutions that do not result in a change in protein function, as well as proteins encoded by homologous genes having amino acid substitutions that cause decreased (e.g., null mutations) protein function or increased protein function.
  • the terms “complementary” or “complementarity” are used in reference to polynucleotides (i.e., a sequence of nucleotides) related by the base-pairing rules. For example, for the sequence “A-G-T,” is complementary to the sequence “T-C-A.”Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods that depend upon binding between nucleic acids.
  • the terms “homology” and “percent identity” when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology (i.e., partial identity) or complete homology (i.e., complete identity).
  • a partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid sequence and is referred to using the functional term “substantially homologous.” The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency.
  • a substantially homologous sequence or probe i.e., an oligonucleotide which is capable of hybridizing to another oligonucleotide of interest
  • conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction.
  • the absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
  • a partial degree of complementarity e.g., less than about 30% identity
  • the art knows conditions that promote hybridization under conditions of high stringency e.g., increasing the temperature of the hybridization and/or wash steps, the use of formamide in the hybridization solution, etc.).
  • substantially homologous refers to any probe that can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described above.
  • substantially homologous refers to any probe that can hybridize (i.e., it is the complement of) the single-stranded nucleic acid sequence under conditions of low stringency as described above.
  • hybridization is used in reference to the pairing of complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the T m of the formed hybrid, and the G:C ratio within the nucleic acids. A single molecule that contains pairing of complementary nucleic acids within its structure is said to be “self-hybridized.”
  • T m is used in reference to the “melting temperature” of a nucleic acid.
  • the melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands.
  • stringency is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid hybridizations are conducted. With “high stringency” conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences. Thus, conditions of “weak” or “low” stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less.
  • “High stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5 ⁇ SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5 ⁇ Denhardt's reagent and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1 ⁇ SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • “Medium stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5 ⁇ SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5 ⁇ Denhardt's reagent and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.0 ⁇ SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • “Low stringency conditions” comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5 ⁇ SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5 ⁇ Denhardt's reagent [50 ⁇ Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 5 ⁇ SSPE, 0.1% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • 5 ⁇ SSPE 43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH
  • a gene may produce multiple RNA species that are generated by differential splicing of the primary RNA transcript.
  • cDNAs that are splice variants of the same gene will contain regions of sequence identity or complete homology (representing the presence of the same exon or portion of the same exon on both cDNAs) and regions of complete non-identity (for example, representing the presence of exon “A” on cDNA 1 wherein cDNA 2 contains exon “B” instead). Because the two cDNAs contain regions of sequence identity they will both hybridize to a probe derived from the entire gene or portions of the gene containing sequences found on both cDNAs; the two splice variants are therefore substantially homologous to such a probe and to each other.
  • operable combination refers to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced.
  • operable order refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
  • selectable marker refers to a gene that encodes an enzymatic activity that confers the ability to grow in medium lacking what would otherwise be an essential nutrient (e.g. the HIS3 gene in yeast cells); in addition, a selectable marker may confer resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed. Selectable markers may be “dominant”; a dominant selectable marker encodes an enzymatic activity that can be detected in any eukaryotic cell line.
  • dominant selectable markers examples include the bacterial aminoglycoside 3′ phosphotransferase gene (also referred to as the neo gene) that confers resistance to the drug G418 in mammalian cells, the bacterial hygromycin G phosphotransferase (hyg) gene that confers resistance to the antibiotic hygromycin and the bacterial xanthine-guanine phosphoribosyl transferase gene (also referred to as the gpt gene) that confers the ability to grow in the presence of mycophenolic acid.
  • Other selectable markers are not dominant in that their use must be in conjunction with a cell line that lacks the relevant enzyme activity.
  • non-dominant selectable markers include the thymidine kinase (tk) gene that is used in conjunction with tk ⁇ cell lines, the CAD gene which is used in conjunction with CAD-deficient cells and the mammalian hypoxanthine-guanine phosphoribosyl transferase (hprt) gene which is used in conjunction with hprt ⁇ cell lines.
  • tk thymidine kinase
  • CAD CAD-deficient cells
  • hprt mammalian hypoxanthine-guanine phosphoribosyl transferase
  • regulatory element refers to a genetic element which controls some aspect of the expression of nucleic acid sequences.
  • a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region.
  • Other regulatory elements are splicing signals, polyadenylation signals, termination signals, RNA export elements, internal ribosome entry sites, etc. (defined infra).
  • Transcriptional control signals in eukaryotes comprise “promoter” and “enhancer” elements. Promoters and enhancers consist of short arrays of DNA sequences that interact specifically with cellular proteins involved in transcription (Maniatis et al., Science 236:1237 [1987]). Promoter and enhancer elements have been isolated from a variety of eukaryotic sources including genes in yeast, insect and mammalian cells, and viruses (analogous control elements, i.e., promoters, are also found in prokaryotes). The selection of a particular promoter and enhancer depends on what cell type is to be used to express the protein of interest.
  • eukaryotic promoters and enhancers have a broad host range while others are functional in a limited subset of cell types (for review see, Voss et al., Trends Biochem. Sci., 11:287 [1986]; and Maniatis et al., supra).
  • the SV40 early gene enhancer is very active in a wide variety of cell types from many mammalian species and has been widely used for the expression of proteins in mammalian cells (Dijkema et al., EMBO J. 4:761 [1985]).
  • promoter/enhancer elements active in a broad range of mammalian cell types are those from the human elongation factor 1 ⁇ gene (Uetsuki et al., J. Biol. Chem., 264:5791 [1989]; Kim et al., Gene 91:217 [1990]; and Mizushima and Nagata, Nuc. Acids. Res., 18:5322 [1990]) and the long terminal repeats of the Rous sarcoma virus (Gorman et al, Proc. Natl. Acad. Sci. USA 79:6777 [1982]) and the human cytomegalovirus (Boshart et al., Cell 41:521 [1985]).
  • promoter/enhancer denotes a segment of DNA which contains sequences capable of providing both promoter and enhancer functions (i.e., the functions provided by a promoter element and an enhancer element, see above for a discussion of these functions).
  • promoter/promoter may be “endogenous” or “exogenous” or “heterologous.”
  • An “endogenous” enhancer/promoter is one which is naturally linked with a given gene in the genome.
  • an “exogenous” or “heterologous” enhancer/promoter is one which is placed in juxtaposition to a gene by means of genetic manipulation (i.e., molecular biological techniques such as cloning and recombination) such that transcription of that gene is directed by the linked enhancer/promoter.
  • Regulatory elements may be tissue specific or cell specific.
  • tissue specific refers to a regulatory element that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., liver) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., lung).
  • Tissue specificity of a regulatory element may be evaluated by, for example, operably linking a reporter gene to a promoter sequence (which is not tissue-specific) and to the regulatory element to generate a reporter construct, introducing the reporter construct into the genome of an animal such that the reporter construct is integrated into every tissue of the resulting transgenic animal, and detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues of the transgenic animal.
  • the detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression of the reporter gene in other tissues shows that the regulatory element is “specific” for the tissues in which greater levels of expression are detected.
  • tissue-specific e.g., liver-specific
  • tissue-specific does not require that one tissue have extremely high levels of expression and another tissue have no expression. It is sufficient that expression is greater in one tissue than another.
  • tissue-specific expression is meant to indicate expression in a single tissue type (e.g., liver) with no detectable expression in other tissues.
  • cell type specific refers to a regulatory element which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression of the same nucleotide sequence of interest in a different type of cell within the same tissue.
  • cell type specific when applied to a regulatory element also means a regulatory element capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue.
  • Cell type specificity of a regulatory element may be assessed using methods well known in the art (e.g., immunohistochemical staining and/or Northern blot analysis). Briefly, for immunohistochemical staining, tissue sections are embedded in paraffin, and paraffin sections are reacted with a primary antibody specific for the polypeptide product encoded by the nucleotide sequence of interest whose expression is regulated by the regulatory element. A labeled (e.g., peroxidase conjugated) secondary antibody specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy.
  • a labeled (e.g., peroxidase conjugated) secondary antibody specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy.
  • RNA is isolated from cells and electrophoresed on agarose gels to fractionate the RNA according to size followed by transfer of the RNA from the gel to a solid support (e.g., nitrocellulose or a nylon membrane).
  • a solid support e.g., nitrocellulose or a nylon membrane.
  • the immobilized RNA is then probed with a labeled oligo-deoxyribonucleotide probe or DNA probe to detect RNA species complementary to the probe used.
  • Northern blots are a standard tool of molecular biologists.
  • promoter refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into mRNA.
  • a promoter is typically, though not necessarily, located 5′ (i.e., upstream) of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription.
  • Promoters may be constitutive or regulatable.
  • the term “constitutive” when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, etc.).
  • a “regulatable” promoter is one which is capable of directing a level of transcription of an operably linked nucleic acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, etc.) which is different from the level of transcription of the operably linked nucleic acid sequence in the absence of the stimulus.
  • Splicing signals mediate the removal of introns from the primary RNA transcript and consist of a splice donor and acceptor site (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York [1989], pp. 16.7-16.8).
  • a commonly used splice donor and acceptor site is the splice junction from the 16S RNA of SV40.
  • Efficient expression of recombinant DNA sequences in eukaryotic cells requires expression of signals directing the efficient termination and polyadenylation of the resulting transcript. Transcription termination signals are generally found downstream of the polyadenylation signal and are a few hundred nucleotides in length.
  • the term “poly A site” or “poly A sequence” as used herein denotes a DNA sequence that directs both the termination and polyadenylation of the nascent RNA transcript. Efficient polyadenylation of the recombinant transcript is desirable as transcripts lacking a poly A tail are unstable and are rapidly degraded.
  • the poly A signal utilized in an expression vector may be “heterologous” or “endogenous.”
  • An endogenous poly A signal is one that is found naturally at the 3′ end of the coding region of a given gene in the genome.
  • a heterologous poly A signal is one that is isolated from one gene and placed 3′ of another gene.
  • a commonly used heterologous poly A signal is the SV40 poly A signal.
  • the SV40 poly A signal is contained on a 237 bp BamHI/BclI restriction fragment and directs both termination and polyadenylation (Sambrook, supra, at 16.6-16.7).
  • Eukaryotic expression vectors may also contain “viral replicons” or “viral origins of replication.”
  • Viral replicons are viral DNA sequences that allow for the extrachromosomal replication of a vector in a host cell expressing the appropriate replication factors.
  • Vectors that contain either the SV40 or polyoma virus origin of replication replicate to high “copy number” (up to 10 4 copies/cell) in cells that express the appropriate viral T antigen.
  • Vectors that contain the replicons from bovine papillomavirus or Epstein-Barr virus replicate extrachromosomally at “low copy number” ( ⁇ 100 copies/cell).
  • long terminal repeat of “LTR” refers to transcriptional control elements located in or isolated from the U3 region 5′ and 3′ of a retroviral genome. As is known in the art, long terminal repeats may be used as control elements in retroviral vectors, or isolated from the retroviral genome and used to control expression from other types of vectors.
  • secretion signal refers to any DNA sequence which when operably linked to a recombinant DNA sequence encodes a signal peptide which is capable of causing the secretion of the recombinant polypeptide.
  • the signal peptides comprise a series of about 15 to 30 hydrophobic amino acid residues (See, e.g., Zwizinski et al., J. Biol. Chem. 255(16): 7973-77 [1980], Gray et al., Gene 39(2): 247-54 [1985], and Martial et al., Science 205: 602-607 [1979]).
  • secretion signal sequences are preferably derived from genes encoding polypeptides secreted from the cell type targeted for tissue-specific expression (e.g., secreted milk proteins for expression in and secretion from mammary secretory cells).
  • Secretory DNA sequences are not limited to such sequences.
  • Secretory DNA sequences from proteins secreted from many cell types and organisms may also be used (e.g., the secretion signals for t-PA, serum albumin, lactoferrin, and growth hormone, and secretion signals from microbial genes encoding secreted polypeptides such as from yeast, filamentous fungi, and bacteria).
  • RNA export element or “Pre-mRNA Processing Enhancer (PPE)” refer to 3′ and 5′ cis-acting post-transcriptional regulatory elements that enhance export of RNA from the nucleus.
  • PPE elements include, but are not limited to Mertz sequences (described in U.S. Pat. Nos. 5,914,267 and 5,686,120, all of which are incorporated herein by reference) and woodchuck mRNA processing enhancer (WPRE; WO99/14310 and U.S. Pat. No. 6,136,597, each of which is incorporated herein by reference).
  • polycistronic refers to an mRNA encoding more than polypeptide chain (See, e.g., WO 93/03143, WO 88/05486, and European Pat. No. 117058, all of which are incorporated herein by reference).
  • arranged in polycistronic sequence refers to the arrangement of genes encoding two different polypeptide chains in a single mRNA.
  • internal ribosome entry site refers to a sequence located between polycistronic genes that permits the production of the expression product originating from the second gene by internal initiation of the translation of the dicistronic mRNA.
  • Examples of internal ribosome entry sites include, but are not limited to, those derived from foot and mouth disease virus (FDV), encephalomyocarditis virus, poliovirus and RDV (Scheper et al, Biochem. 76: 801-809 [1994]; Meyer et al., J. Virol. 69: 2819-2824 [1995]; Jang et al., 1988, J. Virol.
  • FDV foot and mouth disease virus
  • poliovirus poliovirus
  • Vectors incorporating IRES's may be assembled as is known in the art.
  • a retroviral vector containing a polycistronic sequence may contain the following elements in operable association: nucleotide polylinker, gene of interest, an internal ribosome entry site and a mammalian selectable marker or another gene of interest.
  • the polycistronic cassette is situated within the retroviral vector between the 5′ LTR and the 3′ LTR at a position such that transcription from the 5′ LTR promoter transcribes the polycistronic message cassette.
  • the transcription of the polycistronic message cassette may also be driven by an internal promoter (e.g., cytomegalovirus promoter) or an inducible promoter, which may be preferable depending on the use.
  • the polycistronic message cassette can further comprise a cDNA or genomic DNA (gDNA) sequence operatively associated within the polylinker.
  • Any mammalian selectable marker can be utilized as the polycistronic message cassette mammalian selectable marker.
  • Such mammalian selectable markers are well known to those of skill in the art and can include, but are not limited to, kanamycin/G418, hygromycin B or mycophenolic acid resistance markers.
  • the term “retrovirus” refers to a retroviral particle which is capable of entering a cell (i.e., the particle contains a membrane-associated protein such as an envelope protein or a viral G glycoprotein which can bind to the host cell surface and facilitate entry of the viral particle into the cytoplasm of the host cell) and integrating the retroviral genome (as a double-stranded provirus) into the genome of the host cell.
  • a membrane-associated protein such as an envelope protein or a viral G glycoprotein which can bind to the host cell surface and facilitate entry of the viral particle into the cytoplasm of the host cell
  • retroviral genome as a double-stranded provirus
  • the term “retrovirus” encompasses Oncovirinae (e.g., Moloney murine leukemia virus (MoMOLV), Moloney murine sarcoma virus (MoMSV), and Mouse mammary tumor virus (MMTV), Spumavirinae, amd Lentivirinae (e.g., Human immunodeficiency virus, Simian immunodeficiency virus, Equine infection anemia virus, and Caprine arthritis-encephalitis virus; See, e.g., U.S. Pat. Nos. 5,994,136 and 6,013,516, both of which are incorporated herein by reference).
  • Oncovirinae e.g., Moloney murine leukemia virus (MoMOLV), Moloney murine sarcoma virus (MoMSV), and Mouse mammary tumor virus (MMTV)
  • Spumavirinae e.g., Amd Lentivirinae (e.g., Human immunodeficiency
  • retroviral vector refers to a retrovirus that has been modified to express a gene of interest. Retroviral vectors can be used to transfer genes efficiently into host cells by exploiting the viral infectious process. Foreign or heterologous genes cloned (i.e., inserted using molecular biological techniques) into the retroviral genome can be delivered efficiently to host cells which are susceptible to infection by the retrovirus. Through well known genetic manipulations, the replicative capacity of the retroviral genome can be destroyed. The resulting replication-defective vectors can be used to introduce new genetic material to a cell but they are unable to replicate. A helper virus or packaging cell line can be used to permit vector particle assembly and egress from the cell.
  • retroviral vectors comprise a replication-deficient retroviral genome containing a nucleic acid sequence encoding at least one gene of interest (i.e., a polycistronic nucleic acid sequence can encode more than one gene of interest), a 5′ retroviral long terminal repeat (5′ LTR); and a 3′ retroviral long terminal repeat (3′ LTR).
  • a nucleic acid sequence encoding at least one gene of interest (i.e., a polycistronic nucleic acid sequence can encode more than one gene of interest)
  • 5′ LTR 5′ retroviral long terminal repeat
  • 3′ retroviral long terminal repeat 3′ retroviral long terminal repeat
  • the term “pseudotyped retroviral vector” refers to a retroviral vector containing a heterologous membrane protein.
  • membrane-associated protein refers to a protein (e.g., a viral envelope glycoprotein or the G proteins of viruses in the Rhabdoviridae family such as VSV, Piry, Chandipura and Mokola) which are associated with the membrane surrounding a viral particle; these membrane-associated proteins mediate the entry of the viral particle into the host cell.
  • the membrane associated protein may bind to specific cell surface protein receptors, as is the case for retroviral envelope proteins or the membrane-associated protein may interact with a phospholipid component of the plasma membrane of the host cell, as is the case for the G proteins derived from members of the Rhabdoviridae family.
  • heterologous membrane-associated protein refers to a membrane-associated protein which is derived from a virus which is not a member of the same viral class or family as that from which the nucleocapsid protein of the vector particle is derived.
  • Virtual class or family refers to the taxonomic rank of class or family, as assigned by the International Committee on Taxonomy of Viruses.
  • the term “Rhabdoviridae” refers to a family of enveloped RNA viruses that infect animals, including humans, and plants.
  • the Rhabdoviridae family encompasses the genus Vesiculovirus which includes vesicular stomatitis virus (VSV), Cocal virus, Piry virus, Chandipura virus, and Spring viremia of carp virus (sequences encoding the Spring viremia of carp virus are available under GenBank accession number U18101).
  • the G proteins of viruses in the Vesiculovirus genera are virally-encoded integral membrane proteins that form externally projecting homotrimeric spike glycoproteins complexes that are required for receptor binding and membrane fusion.
  • the G proteins of viruses in the Vesiculovirus genera have a covalently bound palmititic acid (C 16 ) moiety.
  • the amino acid sequences of the G proteins from the Vesiculoviruses are fairly well conserved.
  • the Piry virus G protein share about 38% identity and about 55% similarity with the VSV G proteins (several strains of VSV are known, e.g., Indiana, New Jersey, Orsay, San Juan, etc., and their G proteins are highly homologous).
  • the Chandipura virus G protein and the VSV G proteins share about 37% identity and 52% similarity.
  • the G proteins from non-VSV Vesiculoviruses may be used in place of the VSV G protein for the pseudotyping of viral particles.
  • the G proteins of the Lyssa viruses also share a fair degree of conservation with the VSV G proteins and function in a similar manner (e.g., mediate fusion of membranes) and therefore may be used in place of the VSV G protein for the pseudotyping of viral particles.
  • the Lyssa viruses include the Mokola virus and the Rabies viruses (several strains of Rabies virus are known and their G proteins have been cloned and sequenced).
  • the Mokola virus G protein shares stretches of homology (particularly over the extracellular and transmembrane domains) with the VSV G proteins which show about 31% identity and 48% similarity with the VSV G proteins.
  • Preferred G proteins share at least 25% identity, preferably at least 30% identity and most preferably at least 35% identity with the VSV G proteins.
  • the VSV G protein from which New Jersey strain (the sequence of this G protein is provided in GenBank accession numbers M27165 and M21557) is employed as the reference VSV G protein.
  • lentivirus vector refers to retroviral vectors derived from the Lentiviridae family (e.g., human immunodeficiency virus, simian immunodeficiency virus, equine infectious anemia virus, and caprine arthritis-encephalitis virus) that are capable of integrating into non-dividing cells (See, e.g., U.S. Pat. Nos. 5,994,136 and 6,013,516, both of which are incorporated herein by reference).
  • Lentiviridae family e.g., human immunodeficiency virus, simian immunodeficiency virus, equine infectious anemia virus, and caprine arthritis-encephalitis virus
  • lentivirus vector refers to lentivirus vector containing a heterologous membrane protein (e.g., a viral envelope glycoprotein or the G proteins of viruses in the Rhabdoviridae family such as VSV, Piry, Chandipura and Mokola).
  • a heterologous membrane protein e.g., a viral envelope glycoprotein or the G proteins of viruses in the Rhabdoviridae family such as VSV, Piry, Chandipura and Mokola.
  • transposon refers to transposable elements (e.g., Tn5, Tn7, and Tn10) that can move or transpose from one position to another in a genome. In general, the transposition is controlled by a transposase.
  • transposon vector refers to a vector encoding a nucleic acid of interest flanked by the terminal ends of transposon. Examples of transposon vectors include, but are not limited to, those described in U.S. Pat. Nos. 6,027,722; 5,958,775; 5,968,785; 5,965,443; and 5,719,055, all of which are incorporated herein by reference.
  • AAV vector refers to a vector derived from an adeno-associated virus serotype, including without limitation, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAVX7, etc.
  • AAV vectors can have one or more of the AAV wild-type genes deleted in whole or part, preferably the rep and/or cap genes, but retain functional flanking ITR sequences.
  • AAV vectors can be constructed using recombinant techniques that are known in the art to include one or more heterologous nucleotide sequences flanked on both ends (5′ and 3′) with functional AAV ITRs.
  • an AAV vector can include at least one AAV ITR and a suitable promoter sequence positioned upstream of the heterologous nucleotide sequence and at least one AAV ITR positioned downstream of the heterologous sequence.
  • a “recombinant AAV vector plasmid” refers to one type of recombinant AAV vector wherein the vector comprises a plasmid.
  • 5′ and 3′ ITRs flank the selected heterologous nucleotide sequence.
  • AAV vectors can also include transcription sequences such as polyadenylation sites, as well as selectable markers or reporter genes, enhancer sequences, and other control elements which allow for the induction of transcription. Such control elements are described above.
  • AAV virion refers to a complete virus particle.
  • An AAV virion may be a wild type AAV virus particle (comprising a linear, single-stranded AAV nucleic acid genome associated with an AAV capsid, i.e., a protein coat), or a recombinant AAV virus particle (described below).
  • single-stranded AAV nucleic acid molecules can be packaged into an AAV virion; both the sense and the antisense strands are equally infectious.
  • recombinant AAV virion or “rAAV” is defined as an infectious, replication-defective virus composed of an AAV protein shell encapsulating (i.e., surrounding with a protein coat) a heterologous nucleotide sequence, which in turn is flanked 5′ and 3′ by AV. ITRs.
  • a number of techniques for constructing recombinant AV. virions are known in the art (See, e.g., U.S. Pat. No. 5,173,414; WO 92/01070; WO 93/03769; Lebkowski et al., Molec. Cell. Biol.
  • Suitable nucleotide sequences for use in AAV vectors include any functionally relevant nucleotide sequence.
  • the AAV vectors of the present invention can comprise any desired gene that encodes a protein that is defective or missing from a target cell genome or that encodes a non-native protein having a desired biological or therapeutic effect (e.g., an antiviral function), or the sequence can correspond to a molecule having an antisense or ribozyme function.
  • Suitable genes include those used for the treatment of inflammatory diseases, autoimmune, chronic and infectious diseases, including such disorders as AIDS, cancer, neurological diseases, cardiovascular disease, hypercholestemia; various blood disorders including various anemias, thalasemias and hemophilia; genetic defects such as cystic fibrosis, Gaucher's Disease, adenosine deaminase (ADA) deficiency, emphysema, etc.
  • a number of antisense oligonucleotides e.g., short oligonucleotides complementary to sequences around the translational initiation site (AUG codon) of an mRNA
  • AAV ITRs adeno-associated virus inverted terminal repeats
  • flanking AAV ITRs are positioned 5′ and 3′ of one or more selected heterologous nucleotide sequences and, together with the rep coding region or the Rep expression product, provide for the integration of the selected sequences into the genome of a target cell.
  • AAV ITR regions The nucleotide sequences of AAV ITR regions are known (See, e.g., Kotin, Human Gene Therapy 5:793-801 [1994]; Bems, K. I. “Parvoviridae and their Replication” in Fundamental Virology, 2nd Edition, (B. N. Fields and D. M. Knipe, eds.) for the AAV-2 sequence.
  • an “AAV ITR” need not have the wild-type nucleotide sequence depicted, but may be altered, e.g., by the insertion, deletion or substitution of nucleotides.
  • the AAV ITR may be derived from any of several AAV serotypes, including without limitation, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAVX7, etc.
  • the 5′ and 3′ ITRs which flank a selected heterologous nucleotide sequence need not necessarily be identical or derived from the same AAV serotype or isolate, so long as they function as intended, i.e., to allow for the integration of the associated heterologous sequence into the target cell genome when the rep gene is present (either on the same or on a different vector), or when the Rep expression product is present in the target cell.
  • in vitro refers to an artificial environment and to processes or reactions that occur within an artificial environment.
  • In vitro environments can consist of, but are not limited to, test tubes and cell cultures.
  • in vivo refers to the natural environment (e.g., an animal or a cell) and to processes or reaction that occur within a natural environment.
  • clonally derived refers to a cell line that it derived from a single cell.
  • non-clonally derived refers to a cell line that is derived from more than one cell.
  • the term “passage” refers to the process of diluting a culture of cells that has grown to a particular density or confluency (e.g., 70% or 80% confluent), and then allowing the diluted cells to regrow to the particular density or confluency desired (e.g., by replating the cells or establishing a new roller bottle culture with the cells.
  • a particular density or confluency e.g. 70% or 80% confluent
  • stable when used in reference to genome, refers to the stable maintenance of the information content of the genome from one generation to the next, or, in the particular case of a cell line, from one passage to the next. Accordingly, a genome is considered to be stable if no gross changes occur in the genome (e.g., a gene is deleted or a chromosomal translocation occurs). The term “stable” does not exclude subtle changes that may occur to the genome such as point mutations.
  • response when used in reference to an assay, refers to the generation of a detectable signal (e.g., accumulation of reporter protein, increase in ion concentration, accumulation of a detectable chemical product).
  • a detectable signal e.g., accumulation of reporter protein, increase in ion concentration, accumulation of a detectable chemical product.
  • membrane receptor protein refers to membrane spanning proteins that bind a ligand (e.g., a hormone or neurotransmitter).
  • protein phosphorylation is a common regulatory mechanism used by cells to selectively modify proteins carrying regulatory signals from outside the cell to the nucleus.
  • the proteins that execute these biochemical modifications are a group of enzymes known as protein kinases. They may further be defined by the substrate residue that they target for phosphorylation.
  • protein kinases are the tyrosine kinases (TKs) which selectively phosphorylate a target protein on its tyrosine residues.
  • tyrosine kinases are membrane-bound receptors (RTKs), and, upon activation by a ligand, can autophosphorylate as well as modify substrates.
  • RTKs membrane-bound receptors
  • the initiation of sequential phosphorylation by ligand stimulation is a paradigm that underlies the action of such effectors as, for example, epidermal growth factor (EGF), insulin, platelet-derived growth factor (PDGF), and fibroblast growth factor (FGF).
  • EGF epidermal growth factor
  • PDGF platelet-derived growth factor
  • FGF fibroblast growth factor
  • the receptors for these ligands are tyrosine kinases and provide the interface between the binding of a ligand (hormone, growth factor) to a target cell and the transmission of a signal into the cell by the activation of one or more biochemical pathways.
  • Tyrosine kinases can also be cytoplasmic, non-receptor-type enzymes and act as a downstream component of a signal transduction pathway.
  • signal transduction protein refers to a proteins that are activated or otherwise effected by ligand binding to a membrane receptor protein or some other stimulus.
  • Examples of signal transduction protein include adenyl cyclase, phospholipase C, and G-proteins.
  • Many membrane receptor proteins are coupled to G-proteins (i.e., G-protein coupled receptors (GPCRs); for a review, see Neer, 1995, Cell 80:249-257 [1995]).
  • GPCRs G-protein coupled receptors
  • GPCRs G-protein coupled receptors
  • Putative GPCRs can be identified on the basis of sequence homology to known GPCRs.
  • GPCRs mediate signal transduction across a cell membrane upon the binding of a ligand to an extracellular portion of a GPCR.
  • the intracellular portion of a GPCR interacts with a G-protein to modulate signal transduction from outside to inside a cell.
  • a GPCR is therefore said to be “coupled” to a G-protein.
  • G-proteins are composed of three polypeptide subunits: an ⁇ subunit, which binds and hydrolyses GTP, and a dimeric ⁇ subunit. In the basal, inactive state, the G-protein exists as a heterotrimer of the ⁇ and ⁇ subunits.
  • GDP guanosine diphosphate
  • GTP guanine triphosphate
  • active G ⁇ subunit disassociates from both the receptor and the dimeric ⁇ subunit.
  • the disassociated, active G ⁇ subunit transduces signals to effectors that are “downstream” in the G-protein signalling pathway within the cell.
  • the G-protein's endogenous GTPase activity returns active G subunit to its inactive state, in which it is associated with GDP and the dimeric ⁇ subunit.
  • G ⁇ s Numerous members of the heterotrimeric G-protein family have been cloned, including more than 20 genes encoding various G ⁇ subunits.
  • the various G subunits have been categorized into four families, on the basis of amino acid sequences and functional homology. These four families are termed G ⁇ s , G ⁇ i , G ⁇ q , and G ⁇ 12 . Functionally, these four families differ with respect to the intracellular signaling pathways that they activate and the GPCR to which they couple.
  • GPCRs normally couple with G ⁇ s and, through G ⁇ s , these GPCRs stimulate adenylyl cyclase activity.
  • Other GPCRs normally couple with GG ⁇ q , and through GG ⁇ q , these GPCRs can activate phospholipase C (PLC), such as the ⁇ isoform of phospholipase C (i.e., PLC ⁇ , Stermweis and Smrcka, Trends in Biochem. Sci. 17:502-506 [1992]).
  • PLC phospholipase C
  • Immunoglobulin refers to proteins which bind a specific antigen. Immunoglobulins include, but are not limited to, polyclonal, monoclonal, chimeric, and humanized antibodies, Fab fragments, F(ab′)2 fragments, and includes immunoglobulins of the following classes: IgG, IgA, IgM, IgD, IbE, and secreted immunoglobulins (sIg). Immunoglobulins generally comprise two identical heavy chains ( ⁇ , ⁇ , ⁇ , ⁇ , or ⁇ ) and two light chains ( ⁇ or ⁇ ).
  • antigen binding protein refers to proteins which bind to a specific antigen.
  • Antigen binding proteins include, but are not limited to, immunoglobulins, including polyclonal, monoclonal, chimeric, and humanized antibodies; Fab fragments, F(ab′)2 fragments, and Fab expression libraries; and single chain antibodies.
  • immunoglobulins including polyclonal, monoclonal, chimeric, and humanized antibodies; Fab fragments, F(ab′)2 fragments, and Fab expression libraries; and single chain antibodies.
  • Fab fragments fragments, F(ab′)2 fragments, and Fab expression libraries
  • single chain antibodies single chain antibodies.
  • Various procedures known in the art are used for the production of polyclonal antibodies.
  • various host animals can be immunized by injection with the peptide corresponding to the desired epitope including but not limited to rabbits, mice, rats, sheep, goats, etc.
  • the peptide is conjugated to an immunogenic carrier (e.g., diphtheria toxoid, bovine serum albumin (BSA), or keyhole limpet hemocyanin (KLH)).
  • an immunogenic carrier e.g., diphtheria toxoid, bovine serum albumin (BSA), or keyhole limpet hemocyanin (KLH)
  • BSA bovine serum albumin
  • KLH keyhole limpet hemocyanin
  • Various adjuvants are used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (Bacille Calmette-Guerin) and Corynebacterium parvum.
  • BCG
  • any technique that provides for the production of antibody molecules by continuous cell lines in culture may be used (See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). These include, but are not limited to, the hybridoma technique originally developed by Köhler and Milstein (Köhler and Milstein, Nature 256:495-497 [1975]), as well as the trioma technique, the human B-cell hybridoma technique (See e.g., Kozbor et al. Immunol.
  • Antibody fragments that contain the idiotype (antigen binding region) of the antibody molecule can be generated by known techniques.
  • fragments include but are not limited to: the F(ab′)2 fragment that can be produced by pepsin digestion of an antibody molecule; the Fab′ fragments that can be generated by reducing the disulfide bridges of an F(ab′)2 fragment, and the Fab fragments that can be generated by treating an antibody molecule with papain and a reducing agent.
  • Genes encoding antigen binding proteins can be isolated by methods known in the art. In the production of antibodies, screening for the desired antibody can be accomplished by techniques known in the art (e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich” immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, for example), Western Blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc.) etc.
  • radioimmunoassay e.g., ELISA (enzyme-linked immunosorbant assay), “sandwich” immunoas
  • reporter gene refers to a gene encoding a protein that may be assayed.
  • reporter genes include, but are not limited to, luciferase (See, e.g., deWet et al., Mol. Cell. Biol. 7:725 [1987] and U.S.
  • purified refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated.
  • An “isolated nucleic acid sequence” is therefore a purified nucleic acid sequence.
  • substantially purified molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which they are naturally associated.
  • the present invention provides novel regulatory sequences for use in expression vectors.
  • the present invention provides retroviral expression vectors containing novel regulatory elements.
  • the present invention provides methods for expressing proteins of interest in host cells.
  • the present invention provides methods for expressing two chains of a multisubunit protein (e.g., a heavy chain and a light chain of an immunoglobulin or the subunits of follicle stimulating hormone) in a nearly equal ratio. These methods take advantage of the novel regulatory sequences and vectors of the present invention to solve problems in the prior art.
  • the retroviral vectors of the present invention include the following elements in operable association: a) a 5′ LTR; b) a packaging signal; c) a 3′ LTR, and d) a nucleic acid encoding a protein of interest located between the 5′ and 3′ LTRs.
  • novel compositions including, but not limited to those described below are included in expression vectors in order to aid in the expression, secretion and purification of proteins of interest.
  • novel elements are described in more detail below: bovine/human hybrid alpha-lactalbumin ( ⁇ -LA) promoter (A); mutant RNA export element (B); and internal ribosome entry site (C).
  • the present invention provides a hybrid ⁇ -lactalbumin ( ⁇ -LA) promoter.
  • ⁇ -LA ⁇ -lactalbumin
  • the hybrid promoter may be constructed from portions of any two or more mammalian ⁇ -lactalbumin promoters (e.g., human, bovine, goat, sheep, rabbit, or mouse ⁇ -lactalbumin promoters among others; see, e.g., GenBank Accession numbers AF124257; AF123893; AX067504; Soulier et al., Transgenic Res. 8(1):23-31 (1999); McKee et al., Nat. Biotech. 16(7):647-51 (1998); Lubon et al., Biochem. J.
  • the portion of at least one of the promoters contributing to the hybrid is at least 50 nucleotides in length, while in preferred embodiments, the portion of at least one of the promoters contributing to the hybrid is at least 100 nucleotides in length, while in particularly preferred embodiments, the portion of at least one of the promoters contributing to the hybrid is at least 500 nucleotides length, with the portion of the at least one other promoter contributing to the hybrid being of similar or longer length.
  • the hybrid promoters can be assayed for functionality by operably linking the promoter to a reporter gene such as beta-galactosidase, green fluorescent protein, or luciferase, creating a transgenic animal such as transgenic mouse or bovine that comprises the resulting construct, and assaying various tissues of the resulting transgenic animal to determine the specificity of expression from the hybrid promoter.
  • a reporter gene such as beta-galactosidase, green fluorescent protein, or luciferase
  • expression from the hybrid promoter is substantially specific to the mammary gland, and in particular to mammary epithelial cells, with no or only trace levels of expression of in other tissues.
  • the hybrid promoter is a bovine/human hybrid ⁇ -lactalbumin ( ⁇ -LA) promoter (SEQ ID NO: 1).
  • the human portion of the promoter was derived from human genomic DNA and contains bases from +15 relative to the transcription start point to ⁇ 600 relative to the transcription start point.
  • the bovine portion is attached to the end of the human portion and corresponds to bases ⁇ 550 to ⁇ 2000 relative to the transcription start point.
  • the hybrid promoter preferably used in the present invention utilizes a region of the human promoter that contained an internal poly-adenylation signal.
  • the internal poly-adenylation signal was removed by mutation.
  • the mutation was at base 2012 and involved a change from A to T.
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated the removal of poly-adenylation signals improves retroviral RNA production by eliminating premature mRNA termination problems.
  • additional enhancer regions exist in the human, but not the bovine sequence.
  • the hybrid promoter was constructed to take advantage of these additional sequences. Likewise, the hybrid promoter contains bovine elements that may or may not be found in the human promoter.
  • the present invention comprises a mutant RNA export element (pre-mRNA processing element (PPE), Mertz sequence, or WPRE; See, e.g., U.S. Pat. Nos. 5,914,267 and 5,686,120 and PCT Publication WO99/14310, all of which are incorporated herein by reference).
  • PPE pre-mRNA processing element
  • WPRE Wideband RNA processing element
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that the use of RNA export elements allows or facilitates high levels of expression of the protein of interest without incorporating splice signals or introns in the nucleic acid sequence encoding the protein of interest.
  • a mutated PPE element is utilized.
  • the PPE sequence is mutated to remove internal ATG sequences.
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that the removal of internal start sequences prevents potential unwanted translation initiation.
  • bases 4, 112, 131, and 238 of SEQ ID NO: 2 were changed from a G to a T. In all cases, these changes resulted in and ATG start codon being mutated to an ATT codon.
  • the mutated PPE sequence is placed in the 5′ untranslated region (UTR) of the mRNA encoding the gene of interest. In other embodiments, the mutated PPE sequence is placed in the 3′ UTR of the mRNA encoding the gene of interest. In some preferred embodiments, two mutated PPE sequences separated by a linker are placed in a head to tail array (See e.g., SEQ ID NO:2). It has been shown that two copies of the sequence cause a more dramatic effect on mRNA export. In other embodiments, 2-20 copies of the mutated PPE sequence are placed in the mRNA encoding the gene of interest.
  • the present invention comprises an internal ribosome entry site (IRES)/signal peptide sequence (e.g., SEQ ID NOs:3 and 12).
  • IRES internal ribosome entry site
  • signal sequences include those from ⁇ -lactalbumin, casein, tissue plasminogen activator, serum albumin, lactoferrin, and lactoferrin (See, e.g., Zwizinski et al., J. Biol. Chem.
  • Such secretion signal sequences are preferably derived from genes encoding polypeptides secreted from the cell type targeted for tissue-specific expression (e.g., secreted milk proteins for expression in and secretion from mammary secretory cells).
  • Suitable IRES sequences include, but are not limited, to those derived from foot and mouth disease virus (FDV), encephalomyocarditis virus, poliovirus and RDV (Scheper et al., Biochem.
  • Functional IRES/signal peptide sequences may be identified by operably linking two genes with the sequence and an appropriate promoter, transfecting a host cell with the construct, and assaying the host cell for production the proteins encoding by the two genes. Suitable genes, vector constructs, and host cells for such screening are provided in the examples.
  • the coding sequences for the IRES and signal peptide are adjacent to one another, with no intervening coding sequences (i.e., that may be separated by noncoding sequences in some instances).
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention.
  • the IRES allows translation of the gene to start at the IRES sequence, thereby resulting in the expression of two genes of interest in the same construct.
  • the bovine ⁇ -lactalbumin signal peptide or casein signal peptide causes extracellular secretion of expressed protein products.
  • the initial ATG of the signal peptide is attached to the IRES in order to allow the most efficient translation initiation from the IRES.
  • the second codon of the signal peptide is mutated from an ATG to a GCC, changing the second amino acid of the ⁇ -lactalbumin signal peptide from a methionine to an alanine.
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that this mutation facilitates more efficient translation initiation by the IRES.
  • the (IRES)/signal peptide is inserted into a vector between two genes of interest.
  • the (IRES)/signal peptide creates a second translation initiation site, allowing for the expression of two polypeptides from the same expression vector.
  • a single transcript is produced that encodes two different polypeptides (e.g., the heavy and light chains of an immunoglobulin).
  • the signal peptide is derived from ⁇ -lactalbumin.
  • the present invention comprises an internal ribosome entry site (IRES)/modified bovine ⁇ -S1 Casein signal peptide fusion protein (SEQ ID NO:12).
  • IRES internal ribosome entry site
  • SEQ ID NO:12 modified bovine ⁇ -S1 Casein signal peptide fusion protein
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention.
  • the IRES allows translation of the gene to start at the IRES sequence, allowing the expression of two genes of interest in the same construct.
  • the bovine ⁇ S1 casein signal peptide causes secretion of expressed protein products.
  • the second codon of the bovine ⁇ -S1 casein signal peptide is mutated from a AAA to a GCC.
  • the mutation results in the second codon of the signal peptide being changed from an alanine to a lysine.
  • the third codon of the signal peptide is mutated from a CTT to a TTG, a change which does not result and an amino acid substitution.
  • the present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that this mutation allows more efficient translation initiation by the IRES.
  • the present invention comprises retroviral expression vectors.
  • Retroviruses family Retroviridae
  • retroviruses are generally divided into three groups: the spumaviruses (e.g., human foamy virus); the lentiviruses (e.g., human immunodeficiency virus and sheep visna virus), and the oncoviruses (e.g., MLV and Rous sarcoma virus).
  • Retroviruses are enveloped (i.e., surrounded by a host cell-derived lipid bilayer membrane) single-stranded RNA viruses which infect animal cells.
  • a retrovirus infects a cell, its RNA genome is converted into a double-stranded linear DNA form (i.e., it is reverse transcribed).
  • the DNA form of the virus is then integrated into the host cell genome as a provirus.
  • the provirus serves as a template for the production of additional viral genomes and viral mRNAs. Mature viral particles containing two copies of genomic RNA bud from the surface of the infected cell.
  • the viral particle comprises the genomic RNA, reverse transcriptase and other pol gene products inside the viral capsid (containing the viral gag gene products) which is surrounded by a lipid bilayer membrane derived from the host cell containing the viral envelope glycoproteins (also referred to as membrane-associated proteins).
  • the gene of interest is inserted into a retroviral vector which contains the sequences necessary for the efficient expression of the gene of interest (including promoter and/or enhancer elements which may be provided by the viral long terminal repeats (LTRs) or by an internal promoter/enhancer and relevant splicing signals), sequences required for the efficient packaging of the viral RNA into infectious virions (e.g., the packaging signal (Psi), the tRNA primer binding site ( ⁇ PBS), the 3′ regulatory sequences required for reverse transcription (+PBS)) and the viral LTRs.
  • LTRs viral long terminal repeats
  • Psi packaging signal
  • ⁇ PBS tRNA primer binding site
  • (+PBS reverse transcription
  • the LTRs contain sequences required for the association of viral genomic RNA, reverse transcriptase and integrase functions, and sequences involved in directing the expression of the genomic RNA to be packaged in viral particles.
  • many recombinant retroviral vectors lack functional copies of the genes which are essential for viral replication (these essential genes are either deleted or disabled); therefore, the resulting virus is said to be “replication defective”.
  • the vector DNA is introduced into a packaging cell line.
  • Packaging cell lines provide viral proteins required in trans for the packaging of the viral genomic RNA into viral particles having the desired host range (i.e., the viral-encoded gag, pol and env proteins). The host range is controlled, in part, by the type of envelope gene product expressed on the surface of the viral particle.
  • Packaging cell lines may express ecotrophic, amphotropic or xenotropic envelope gene products.
  • the packaging cell line may lack sequences encoding a viral envelope (env) protein. In this case the packaging cell line will package the viral genome into particles lacking a membrane-associated protein (e.g., an env protein).
  • the packaging cell line containing the retroviral sequences is commonly transfected with sequences encoding a membrane-associated protein (e.g., the G protein of vesicular stomatitis virus (VSV)).
  • VSV vesicular stomatitis virus
  • the transfected packaging cell will then produce viral particles which contain the membrane-associated protein expressed by the transfected packaging cell line; these viral particles which contain viral genomic RNA derived from one virus encapsidated by the envelope proteins of another virus are said to be “pseudotyped virus particles”.
  • the retroviral vectors of the present invention can be further modified to include additional regulatory sequences.
  • the retroviral vectors of the present invention include the following elements in operable association: a) a 5′ LTR; b) a packaging signal; c) a 3′ LTR; and d) a nucleic acid encoding a protein of interest located between the 5′ and 3′ LTRs.
  • the nucleic acid of interest may be arranged in opposite orientation to the 5′ LTR when transcription from an internal promoter is desired.
  • Suitable internal promoters include, but are not limited to, the alpha-lactalbumin promoter, the CMV promoter, and the thymidine kinase promoter.
  • the vectors are modified by including a signal peptide sequence in operable association with the protein of interest.
  • the sequences of several suitable signal peptides are known in the art, including, but not limited to, those derived from tissue plasminogen activator, human growth hormone, lactoferrin, alpha S1-casein, and alpha-lactalbumin.
  • the vectors are modified by incorporating one or more of the elements described above, including, but not limited to, an RNA export element, a PPE element, and an IRES/bovine ⁇ -lactalbumin signal sequence.
  • the retroviral vectors of the present invention may further comprise a selectable marker which facilitates selection of transformed cells.
  • selectable markers known in the art find use in the present invention, including, but not limited to the bacterial aminoglycoside 3′ phosphotransferase gene (also referred to as the “neo gene”) that confers resistance to the drug G418 in mammalian cells, the bacterial hygromycin G phosphotransferase (hyg) gene that confers resistance to the antibiotic hygromycin, and the bacterial xanthine-guanine phosphoribosyl transferase gene (also referred to as the “gpt gene”) that confers the ability to grow in the presence of mycophenolic acid.
  • the selectable marker gene is provided as part of a polycistronic sequence also encoding the protein of interest.
  • the retroviral vectors may comprise recombination elements recognized by a recombination system (e.g., the cre/loxP or flp recombinase systems: See, e.g., Hoess et al., Nucleic Acids Res., 14:2287 [1986], O'Gorman et al., Science 251:1351 [1991], van Deursen et al., Proc. Natl. Acad. Sci. USA 92:7376 [1995], and U.S. Pat. No. 6,025,192, incorporated herein by reference).
  • a recombination system e.g., the cre/loxP or flp recombinase systems: See, e.g., Hoess et al., Nucleic Acids Res., 14:2287 [1986], O'Gorman et al., Science 251:1351 [1991], van Deursen et
  • the host cell can be transiently transfected (e.g., by electroporation, lipofection, or microinjection) with either a recombinase enzyme (e.g., Cre recombinase) or a nucleic acid sequence encoding the recombinase enzyme and one or more nucleic acid sequences encoding a protein of interest flanked by sequences recognized by the recombination enzyme so that the nucleic acid sequence of interest is inserted into the integrated vector.
  • a recombinase enzyme e.g., Cre recombinase
  • a nucleic acid sequence encoding the recombinase enzyme e.g., Cre recombinase
  • nucleic acid sequence encoding a protein of interest flanked by sequences recognized by the recombination enzyme
  • Viral vectors including recombinant retroviral vectors, provide a more efficient means of transferring genes into cells, as compared to other techniques such as calcium phosphate-DNA co-precipitation or DEAE-dextran-mediated transfection, electroporation or microinjection of nucleic acids. Nonetheless, the present invention is not limited to any particular mechanism. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is believed that the efficiency of viral transfer is due in part to the fact that the transfer of nucleic acid is a receptor-mediated process (i.e., the virus binds to a specific receptor protein on the surface of the target cell). In addition, once inside a cell, the virally transferred nucleic acid integrates in controlled manner. This is in contrast to nucleic acids transferred by other means (e.g., calcium phosphate-DNA co-precipitation), which are typically subject to rearrangement and degradation.
  • other means e.g., calcium phosphate-DNA co-precipitation
  • Example 1 describes several illustrative examples of retroviral vectors of the current invention. However, it is not intended that the present invention be limited to the vectors described in Example 1. Indeed, any suitable retroviral vectors containing the novel elements of the present invention are contemplated. Furthermore, the elements described above find use in other vectors such as AAV vectors, transposon vectors, plasmids, bacterial artificial chromosomes, and yeast artificial chromosomes.
  • the vectors and regulatory elements described above find use in the expression of one or more proteins.
  • the present invention is not limited to the production of any particular protein. Indeed, the production of a wide variety of proteins is contemplated, including, but not limited to, erythropoietin, alpha-interferon, alpha-1 proteinase inhibitor, angiogenin, antithrombin III, beta-acid decarboxylase, human growth hormone, bovine growth hormone, porcine growth hormone, human serum albumin, beta-interferon, calf intestine alkaline phosphatase, cystic fibrosis transmembrane regulator, Factor VIII, Factor IX, Factor X, insulin, lactoferrin, tissue plasminogen activator, myelin basic protein, insulin, proinsulin, prolactin, hepatitis B antigen, immunoglobulins, monoclonal antibody CTLA4 Ig, Tag 72 monoclonal antibody, Tag 72 single chain
  • the vectors of the present invention are used to express more than one exogenous protein.
  • host cells may be transfected with vectors encoding different proteins of interest (e.g., cotransfection with one vector encoding a first protein of interest and a second vector encoding a second protein of interest).
  • more than one protein is expressed by arranging the nucleic acids encoding the different proteins of interest in a polycistronic sequence (e.g., bicistronic or tricistronic sequences). This arrangement is especially useful when expression of the different proteins of interest in a 1:1 molar ratio is desired (e.g., expression of the light and heavy chains of an immunoglobulin molecule).
  • proteins are expressed in cell culture.
  • retroviral vectors are used to express protein in mammalian tissue culture host cells, including, but not limited to, rat fibroblast cells, bovine kidney cells, and human kidney cells, while in some preferred embodiments, protein is expressed in bovine mammary cells.
  • the host cells are cultured according to methods known in the art; suitable culture conditions for mammalian cells are well known in the art (See e.g., J. Immunol. Methods 56:221 [1983], Animal Cell Culture: A Practical Approach 2 nd Ed., Rickwood, D. and Hames, B. D., eds. Oxford University Press, New York [1992]).
  • the present invention contemplates the transfection of a variety of host cells with integrating vectors.
  • a number of mammalian host cell lines are known in the art. In general, these host cells are capable of growth and survival when placed in either monolayer culture or in suspension culture in a medium containing the appropriate nutrients and growth factors, as is described in more detail below. Typically, the cells are capable of expressing and secreting large quantities of a particular protein of interest into the culture medium.
  • suitable mammalian host cells include, but are not limited to Chinese hamster ovary cells (CHO-K1, ATCC CC1-61); bovine mammary epithelial cells (ATCC CRL 10274; bovine mammary epithelial cells); monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture; see, e.g., Graham et al., J. Gen Virol., 36:59 [1977]); baby hamster kidney cells (BHK, ATCC CCL 10); mouse sertoli cells (TM4, Mather, Biol. Reprod.
  • CHO-K1, ATCC CC1-61 Chinese hamster ovary cells
  • ATCC CRL 10274 bovine mammary epithelial cells
  • monkey kidney CV1 line transformed by SV40 COS-7, ATCC CRL 1651
  • human embryonic kidney line (293 or 293 cells subcloned for growth in
  • monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et al., Annals N.Y. Acad. Sci., 383:44-68 [1982]); MRC 5 cells; FS4 cells; rat fibroblasts (208F cells); MDBK cells (bovine kidney cells); and a human hepatoma line (Hep G2).
  • the present invention also contemplates the transfection of plant protoplasts with integrating vectors at a low or high multiplicity of infection.
  • the present invention contemplates a plant cell or whole plant comprising at least one integrated integrating vector, preferably a retroviral vector, and most preferably a pseudotyped retroviral vector.
  • All plants that can be produced by regeneration from protoplasts can also be transfected using the process according to the invention (e.g., cultivated plants of the genera Solanum, Nicotiana, Brassica, Beta, Pisum, Phaseolus, Glycine, Helianthus, Allium, Avena, Hordeum, Oryzae, Setaria, Secale, Sorghum, Triticum, Zea, Musa, Cocos, Cydonia, Pyrus, Malus, Phoenix, Elaeis, Rubus, Fragaria, Prunus, Arachis, Panicum, Saccharum, Coffea, Camellia, Ananas, Vitis or Citrus).
  • protoplasts are produced in accordance with conventional methods (See, e.g., U.S.
  • Plant tissue may be dispersed in an appropriate medium having an appropriate osmotic potential (e.g., 3 to 8 wt. % of a sugar polyol) and one or more polysaccharide hydrolases (e.g., pectinase, cellulase, etc.), and the cell wall degradation allowed to proceed for a sufficient time to provide protoplasts. After filtration the protoplasts may be isolated by centrifugation and may then be resuspended for subsequent treatment or use.
  • an appropriate osmotic potential e.g., 3 to 8 wt. % of a sugar polyol
  • polysaccharide hydrolases e.g., pectinase, cellulase, etc.
  • the present invention also contemplates the use of amphibian and insect host cell lines.
  • suitable insect host cell lines include, but are not limited to, mosquito cell lines (e.g., ATCC CRL-1660).
  • suitable amphibian host cell lines include, but are not limited to, toad cell lines (e.g., ATCC CCL-102).
  • the host cell cultures are prepared in a medium suitable for the particular cell being cultured.
  • Commercially available media such as Ham's F10 (Sigma, St. Louis, Mo.), Minimal Essential Medium (MEM, Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium (DMEM, Sigma) are exemplary nutrient solutions. Suitable media are also described in U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; 5,122,469 and 4,560,655; and PCT Publications WO 90/03430; and WO 87/00195 (each of which are incorporated herein by reference).
  • any of these media may be supplemented as necessary, with hormones and/or other growth factors (e.g., insulin, transferrin, or epidermal growth factor), salts (e.g., sodium chloride, calcium, magnesium, and phosphate), buffers (e.g., HEPES), nucleosides (e.g., adenosine and thymidine), antibiotics (e.g., gentamycin (gentamicin)), trace elements (i.e., inorganic compounds usually present at final concentrations in the micromolar range) lipids (e.g., linoleic or other fatty acids) and their suitable carriers, and glucose or an equivalent energy source. Any other necessary supplements may also be included at appropriate concentrations known to those skilled in the art.
  • the osmolality of the culture medium is generally about 290-330 mOsm.
  • the present invention also contemplates the use of a variety of culture systems (e.g., petri dishes, 96 well plates, roller bottles, and bioreactors) for the growth and expression of host cells.
  • the host cells can be cultured in a perfusion system.
  • Perfusion culture refers to providing a continuous flow of culture medium through a culture maintained at high cell density. The cells are suspended and do not require a solid support upon which to grow. Generally, fresh nutrients must be supplied continuously with concomitant removal of toxic metabolites and, ideally, selective removal of dead cells. Filtering, entrapment and micro-capsulation methods are all suitable for refreshing the culture environment at sufficient rates.
  • a fed batch culture procedure is employed.
  • the mammalian host cells and culture medium are supplied to a culturing vessel initially and additional culture nutrients are fed, continuously or in discrete increments, to the culture during culturing, with or without periodic cell and/or roduct harvest before termination of culture.
  • the fed batch culture is a semi-continuous fed batch culture in which the whole culture (including cells and medium) is removed from the growth vessel and replaced by fresh medium.
  • Fed batch culture is distinguished from simple batch culture in which all components for cell culturing (including the cells and all culture nutrients) are supplied to the culturing vessel at the start of the culturing process.
  • Fed batch culture can be further distinguished from perfusion culturing insofar as the supernate is not removed from the culturing vessel during the process (in perfusion culturing, the cells are restrained in the culture ( e.g., by filtration, encapsulation, anchoring to microcarriers etc.) and the culture medium is continuously or intermittently introduced and removed from the culturing vessel).
  • the cells of the culture may be propagated according to any scheme or routine suitable for the particular host cell and the particular production plan contemplated.
  • the present invention contemplates single step, as well as multiple step culture procedures.
  • a single step culture the host cells are inoculated into a culture environment and the processes of the instant invention are employed during a single production phase of the cell culture.
  • cells are cultivated in a number of steps or phases. For instance, cells may be grown in a first step or growth phase culture wherein cells, possibly removed from storage, are inoculated into a medium suitable for promoting growth and high viability. The cells may be maintained in the growth phase for a suitable period of time by the addition of fresh medium to the host cell culture.
  • Fed batch or continuous cell culture conditions are contemplated in order to enhance growth of the mammalian cells in the growth phase of the cell culture.
  • cells are grown under conditions and for a period of time that is optimized for growth.
  • Culture conditions such as temperature, pH, dissolved oxygen (dO 2 ) and the like, are those used with the particular host and are apparent to the ordinarily skilled artisan.
  • the pH is adjusted to a level between about 6.5 and 7.5 using either an acid (e.g., CO 2 ) or a base (e.g., Na 2 CO 3 or NaOH).
  • a suitable temperature range for culturing mammalian cells e.g., CHO cells
  • a suitable dO 2 is between 5-90% of air saturation.
  • the polypeptide of interest is recovered from the culture medium using well-established techniques.
  • the protein of interest is recovered from the culture medium as a secreted polypeptide (e.g., the secretion of the protein of interest is directed by a signal peptide sequence), although it also may be recovered from host cell lysates.
  • the culture medium or lysate is centrifuged to remove particulate cell debris.
  • the polypeptide is then purified from contaminant soluble proteins and polypeptides using any suitable method.
  • Suitable purificaiton methods include, but are not limited to fractionation on immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using (e.g., Sephadex G-75); and protein A Sepharose columns to remove contaminants such as IgG.
  • a protease inhibitor such as phenyl methyl sulfonyl fluoride (PMSF) also may be useful to inhibit proteolytic degradation during purification.
  • the protein of interest can be fused in frame to a marker sequence which allows for purification of the protein of interest.
  • Non-limiting examples of marker sequences include a hexahistidine tag which may be supplied by a vector, preferably a pQE-9 vector, and a hemagglutinin (HA) tag.
  • the HA tag corresponds to an epitope derived from the influenza hemagglutinin protein (See e.g., Wilson et al., Cell, 37:767 [1984]).
  • purification methods suitable for the polypeptide of interest may require modification to account for changes in the character of the polypeptide upon expression in recombinant cell culture.
  • the host cell utilized for expression of the protein of interest is part of a mammal.
  • the mammal is a transgenic bovine.
  • the transgenic bovine may be produced by any suitable method (See e.g., Chan et al., PNAS, 95:14028 [1998]; U.S. Pat. No. 5,741,957 (incorporated herein by reference); and Pursel et al., Science, 244:1281 [1989]).
  • the protein is expressed in the mammary gland of a bovine and secreted in the milk of the bovine.
  • proteins and signal sequences for tissue specific expression and secretion are utilized, including, but not limited to, bovine/human ⁇ -lactalbumin promoter and bovine ⁇ -lactalbumin signal sequence.
  • the protein of interest may be recovered from bovine milk using any suitable method, including but not limited to, those described above for the recovery of protein from cell cultures.
  • vectors of the present invention will find use in the production of other transgenic animals as well, including, but not limited to, mice, goats, pigs, birds and rabbits (See e.g., U.S Pat. Nos. 5,523,226; 5,453,457; 4,873,191; 4,736,866; each of which is herein incorporated by reference).
  • single vectors are utilized for the expression of two or more proteins, including individual subunits of multisubunit proteins.
  • two or more chains of an immunoglobulin e.g., one heavy chain (( ⁇ , ⁇ , ⁇ , ⁇ , or 68 ) and one light chain ( ⁇ or ⁇ )
  • IRES sequence e.g., one heavy chain (( ⁇ , ⁇ , ⁇ , ⁇ , or 68 ) and one light chain ( ⁇ or ⁇ )
  • the present invention is not limited to any particular vector. Indeed, the use of a variety of vectors is contemplated, including, but not limited to plasmids, cosmids, bacterial artificial chomosomes, yeast artificial chromosomes, adeno-associated virus vectors, and adenovirus vectors.
  • vectors include, but are not limited to, the following vectors: 1) Bacterial—pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, phagescript, psiX174, pbluescript SK, pBSKS, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); and 2) Eukaryotic—pWLNEO, pSV2CAT, pOG44, PXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia).
  • mammalian expression vectors comprise an origin of replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation sites, splice donor and acceptor sites, transcriptional termination sequences, and 5′ flanking non-transcribed sequences.
  • DNA sequences derived from the SV40 splice, and polyadenylation sites may be used to provide the required non-transcribed genetic elements.
  • the DNA sequence in the expression vector is operatively linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis.
  • Promoters useful in the present invention include, but are not limited to, the LTR or SV40 promoter, the E. coli lac or trp, the phage lambda P L and P R , T3 and T7 promoters, and the cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, and mouse metallothionein-I promoters and other promoters known to control expression of gene in prokaryotic or eukaryotic cells or their viruses.
  • CMV cytomegalovirus
  • HSV herpes simplex virus
  • thymidine kinase thymidine kinase
  • recombinant expression vectors include origins of replication and selectable markers permitting transformation of the host cell (e.g., dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or tetracycline or ampicillin resistance in E. coli ).
  • transcription of the DNA encoding the polypeptides of the present invention by higher eukaryotes is increased by inserting an enhancer sequence into the vector.
  • Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp that act on a promoter to increase its transcription.
  • Enhancers useful in the present invention include, but are not limited to, the SV40 enhancer on the late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
  • the expression vector also contains a ribosome binding site for translation initiation and a transcription terminator.
  • the vector may also include appropriate sequences for amplifying expression.
  • retroviral vectors are used to express immunoglobulins.
  • retroviral vectors for expression of immunoglobulins contain regulatory elements.
  • two immunoglobulins chains are expressed in the same retrovirus vector construct separated by an IRES sequence.
  • the two chains are separated by an IRES/ ⁇ -LA signal sequence.
  • the vector further contains RNA export elements.
  • the RNA export element is a WPRE.
  • the PPE element is at least one Mertz sequence.
  • the PPE element is mutated to remove start signals.
  • two PPE elements are placed in a head to tail array separated by a linker.
  • expression of immunoglobulins by the vectors of the current invention is controlled by a promoter.
  • expression is controlled by a CMV promoter, while in other embodiments, expression is controlled by a MMTV promoter.
  • expression is controlled by a hybrid bovine/human ⁇ -LA promoter.
  • heavy and light chains are expressed by the vectors of the current invention of a ratio of about 0.7:1.3. In preferred embodiments, heavy and light chains are expressed and a ratio of about 0.8:1.2. In particularly preferred embodiments, heavy and light chains are expressed at a ratio of about 0.9:1.1. In still more preferred embodiments, heavy and light chains are expressed at a ratio of about 1:1. In particularly preferred embodiments, the majority (e.g., greater that 90%, preferably greater than 95%, and most preferably greater than about 99%) of the heavy and light chains are correctly assembled in a ratio of 1:1 to form a functional (e.g., able to bind an antigen) antibody.
  • a functional e.g., able to bind an antigen
  • immunoglobulins are expressed in a host cell comprising the vectors and elements described above.
  • the vectors described in Example 1 are used to express a variety of immunoglobulins in a variety of cell lines. In general, this expression led to the formation of finctional, tetrameric immunoglobulins.
  • the vectors of the present invention are also useful for expressing G-protein coupled receptors (GPCRs) and other transmembrane proteins. It is contemplated that when these proteins are expressed, they are correctly inserted into the membrane in their native conformation. Thus, GPCRs and other transmembrane proteins may be purified as part of a membrane fraction or purified from the membranes by methods known in the art.
  • GPCRs G-protein coupled receptors
  • the vectors of the present invention are useful for co-expressing a protein of interest for which there is no assay or for which assays are difficult.
  • a protein of interest and a signal protein are arranged in a polycistronic sequence.
  • an IRES sequence separates the signal protein and protein of interest (e.g., a GPCR) and the genes encoding the signal protein and protein of interest are expressed as a single transcriptional unit.
  • the present invention is not limited to any particular signal protein. Indeed, the use of a variety of signal proteins for which easy assays exist is contemplated. These signal proteins include, but are not limited to, green fluorescent protein, luciferase, beta-galactosidase, and antibody heavy or light chains.
  • the present invention provides methods for indirectly detecting the expression of protein of interest comprising providing a host cell transfected with a vector encoding a polycistronic sequence, wherein the polycistronic sequence comprises a signal protein and a protein of interest operably linked by an IRES, and culturing the host cells under conditions such that the signal protein and protein of interest are produced, wherein the presence of the signal protein indicates the presence of the protein of interest.
  • M molar
  • mM millimolar
  • ⁇ M micromolar
  • nM nanomolar
  • mol molecular weight
  • mmol millimoles
  • ⁇ mol micromoles
  • nmol nanomoles
  • gm grams
  • mg milligrams
  • ⁇ g micrograms
  • pg picograms
  • L liters
  • ml milliliters
  • ⁇ l microliters
  • cm centimeters
  • mm millimeters
  • nm nanometers
  • RNA ribonucleic acid
  • PBS phosphate buffered saline
  • g gravity
  • OD optical density
  • HEPES N-[2- ⁇ Hydroxyethyl]piperazine-N-[2-ethanesulfonic acid]
  • HBS HBS buffered saline
  • PBS phosphate buffered saline
  • SDS sodium dodecylsulfate
  • the CMV MN14 vector (SEQ ID NO:4; MN14 antibody is described in U.S. Pat. No. 5,874,540, incorporated herein by reference) comprises the following elements, arranged in 5′ to 3′ order: CMV promoter; MN14 heavy chain signal peptide, MN14 antibody heavy chain; IRES from encephalomyocarditis virus; bovine ⁇ -lactalbumin signal peptide; MN 14 antibody light chain; and 3′ MoMuLV LTR.
  • the CMV MN14 vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO:7; the 5′ LTR is derived from Moloney Murine Sarcoma Virus in each of the constructs described herein, but is converted to the MoMuLV 5′ LTR when integrated).
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene.
  • the expression of MN14 antibody is controlled by the CMV promoter.
  • the MN14 heavy chain gene and light chain gene are attached together by an IRES sequence.
  • the CMV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES.
  • Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA.
  • the mRNA expression from the LTR as well as from the CMV promoter is terminated and poly adenylated in the 3′ LTR.
  • the construct was cloned by similar methods as described in section B below.
  • the IRES sequence (SEQ ID NO:3) comprises a fusion of the IRES from the plasmid pLXIN (Clontech) and the bovine ⁇ -lactalbumin signal peptide.
  • the initial ATG of the signal peptide was attached to the IRES to allow the most efficient translation initiation from the IRES.
  • the 3′ end of the signal peptide provides a multiple cloning site allowing easy attachment of any protein of interest to create a fusion protein with the signal peptide.
  • the IRES sequence can serve as a translational enhancer as well as creating a second translation initiation site that allows two proteins to be produced from a single mRNA.
  • the IRES-bovine ⁇ -lactalbumin signal peptide was constructed as follows. The portion of the plasmid pLXIN (Clontech, Palo Alto, Calif.) containing the ECMV IRES was PCR amplified using the following primers.
  • Primer 1 (SEQ ID NO: 35):
  • Primer 2 attaches a tail corresponding to the start of the bovine ⁇ -lactalbumin signal peptide coding region to the IRES sequence.
  • the second triplet codon of the ⁇ -lactalbumin signal peptide was mutated from ATG to GCC to allow efficient translation from the IRES sequence. This mutation results in a methionine to alanine change in the protein sequence. This mutation was performed because the IRES prefers an alanine as the second amino acid in the protein chain.
  • the resulting IRES PCR product contains an EcoRI site on the 5′ end of the fragment (just downstream of Primer 1 above).
  • the ⁇ -lactalbumin signal peptide containing sequence was PCR amplified from the ⁇ -LA Signal Peptide vector construct using the following primers.
  • Primer 3 attaches a tail corresponding to the 3′ end of the IRES sequence to the ⁇ -lactalbumin signal peptide coding region.
  • the second triplet codon of the bovine ⁇ -lactalbumin signal peptide was mutated to allow efficient translation from the IRES sequence.
  • the resulting signal peptide PCR fragment contains NaeI, NcoI, EcoRV, XbaI, BglII and XhoI sites on the 3′ end.
  • the resultant product of this reaction is a spliced fragment that contains the IRES attached to the full length ⁇ -lactalbumin signal peptide.
  • the ATG encoding the start of the signal peptide is placed at the same location as the ATG encoding the start of the neomycin phosphotransferase gene found in the vector pLXIN.
  • the fragment also contains the EcoRI site on the 5′ end and Nael, NcoI, EcoRV, XbaI, BglII and XhoI sites on the 3′ end.
  • the spliced IRES/ ⁇ -lactalbumin signal peptide PCR fragment was digested with EcoRI and XhoI.
  • the ⁇ -LA Signal Peptide vector construct was also digested with EcoRI and XhoI. These two fragments were ligated together to give the pIRES construct.
  • the IRES/ ⁇ -lactalbumin signal peptide portion of the pIRES vector was sequenced and found to contain mutations in the 5′ end of the IRES. These mutations occur in a long stretch of C's and were found in all clones that were isolated.
  • pLXIN DNA was digested with EcoRI and BsmFI.
  • the 500 bp band corresponding to a portion of the IRES sequence was isolated.
  • the mutated IRES/ ⁇ -lactalbumin signal peptide construct was also digested with EcoRI and BsmFI and the mutated IRES fragment was removed.
  • the IRES fragment from pLXIN was then substituted for the IRES fragment of the mutated IRES/ ⁇ -lactalbumin signal peptide construct.
  • the IRES/ ⁇ -LA signal peptide portion of resulting plasmid was then verified by DNA sequencing.
  • the CMV LL2 (SEQ ID NO:5; LL2 antibody is described in U.S. Pat. No. 6,187,287, incorporated herein by reference) construct comprises the following elements, arranged in 5′ to 3′ order: 5° CMV promoter (Clontech), LL2 heavy chain signal peptide, LL2 antibody heavy chain; IRES from encephalomyocarditis virus; bovine ⁇ -LA signal peptide; LL2 antibody light chain; and 3′ MoMuLV LTR.
  • the CMV LL2 vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO:7).
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene.
  • the expression of LL2 antibody is controlled by the CMV promoter (Clontech).
  • the LL2 heavy chain gene and light chain gene are attached together by an IRES sequence.
  • the CMV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES.
  • Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA.
  • the mRNA expression from the LTR as well as from the CMV promoter is terminated and poly adenylated in the 3′ LTR.
  • the IRES sequence (SEQ ID NO:3) comprises a fusion of the IRES from the plasmid pLXIN (Clontech) and the bovine alpha-lactalbumin signal peptide.
  • the initial ATG of the signal peptide was attached to the IRES to allow the most efficient translation initiation from the IRES.
  • the 3′ end of the signal peptide provides a multiple cloning site allowing easy attachment of any protein of interest to create a fusion protein with the signal peptide.
  • the IRES sequence can serve as a translational enhancer as well as creating a second translation initiation site that allows two proteins to be produced from a single mRNA.
  • the LL2 light chain gene was attached to the IRES ⁇ -lactalbumin signal peptide as follows.
  • the LL2 light chain was PCR amplified from the vector pCRLL2 using the following primers.
  • the Kozak sequence of the LL2 heavy chain gene was then modified.
  • the vector pCRMN14HC was digested with XhoI and AvrII to remove about a 400 bp fragment.
  • PCR was then used to amplify the same portion of the LL2 heavy chain construct that was removed by the XhoI-AvrII digestion. This amplification also mutated the 5′ end of the gene to add a better Kozak sequence to the clone.
  • the Kozak sequence was modified to resemble the typical IgG Kozak sequence.
  • the PCR primers are shown below.
  • the PCR product was digested with XhoI and AvrII and inserted back into the previously digested plasmid backbone.
  • the “good” Kozak sequence was then added to the light chain gene.
  • the “good” Kozak LL2 heavy chain gene construct was digested with EcoRI and the heavy chain gene containing fragment was isolated.
  • the IRES ⁇ -Lactalbumin Signal Peptide LL2 light chain gene construct was also digested with EcoRI.
  • the heavy chain gene was then cloned into the EcoRI site of IRES light chain construct. This resulted in the heavy chain gene being placed at the 5′ end of the IRES sequence.
  • LNCX retroviral backbone plasmid was digested with HindIII and ClaI. Two oligonucleotide primers were produced and annealed together to create an double stranded DNA multiple cloning site. The following primers were annealed together.
  • the double chain gene fragment was ligated into the retroviral backbone gene construct.
  • the double chain gene construct created above was digested with SalI and BglII and the double chain containing fragment was isolated.
  • the retroviral expression plasmid LNC-MCS was digested with XhoI and BglII. The double chain fragment was then cloned into the LNC-MCS retroviral expression backbone.
  • RNA splicing problem in the construct was corrected.
  • the construct was digested with NsiI.
  • the resulting fragment was then partially digested with EcoRI.
  • the fragments resulting from the partial digest that were approximately 9300 base pairs in size were gel purified.
  • a linker was created to mutate the splice donor site at the 3′ end of the LL2 heavy chain gene.
  • the linker was again created by annealing two oligonucleotide primers together to form the double stranded DNA linker. The two primers used to create the linker are shown below.
  • the MMTV MN14 (SEQ ID NO:6) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MMTV promoter; double mutated PPE sequence; MN 14 antibody heavy chain; IRES from encephalomyocarditis virus; bovine ⁇ LA signal peptide MN 14 antibody light chain; WPRE sequence; and 3′ MoMuLV LTR.
  • the MMTV MN14 vector further comprises a MoMuLV LTR, MoMuLV extended viral packaging signal; neomycin phosphotransferase gene located 5′ of the MMTV promoter (these additional elements are provided in SEQ ID NO: 7).
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene.
  • the expression of MN14 antibody is controlled by the MMTV promoter (Pharmacia).
  • the MN14 heavy chain gene and light chain gene are attached together by an IRES/bovine ⁇ -LA signal peptide sequence (SEQ ID NO: 3).
  • the MMTV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES/bovine ⁇ -LA signal peptide sequence.
  • Ribosomes attach to the mRNA at the CAP site and at the IRES/bovine ⁇ -LA signal peptide sequence.
  • the PPE sequence is contained between the RNA CAP site and the start of the MN14 protein coding region
  • the WPRE is contained between the end of MN14 protein coding and the poly-adenylation site.
  • ATG sequences within the PPE element were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence is placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region.
  • the WPRE is isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold.
  • the ⁇ -LA MN14 (SEQ ID NO:7) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MoMuLV LTR, MoMuLV extended viral packaging signal, neomycin phosphotransferase gene, bovine/human alpha-lactalbumin hybrid promoter, double mutated PPE element, MN14 heavy chain signal peptide, MN14 antibody heavy chain, IRES from encephalomyocarditis virus/bovine ⁇ LA signal peptide, MN14 antibody light chain, WPRE sequence; and 3′ MoMuLV LTR.
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene.
  • the expression of MN14 antibody is controlled by the hybrid ⁇ -LA promoter (SEQ ID NO:1).
  • the MN14 heavy chain gene and light chain gene are attached 30 together by an IRES sequence/bovine ⁇ -LA signal peptide (SEQ ID NO:3).
  • the ⁇ -LA promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA.
  • the mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region. ATG sequences within the PPE element (SEQ ID NO:2) were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence is placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region.
  • the WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold.
  • the WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR.
  • the bovine/human alpha-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences.
  • the human portion of the promoter is from +15 relative to transcription start point (tsp) to ⁇ 600 relative to the tsp.
  • the bovine portion is then attached to the end of the human portion and corresponds to ⁇ 550 to ⁇ 2000 relative to the tsp.
  • the hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine.
  • bovine/human ⁇ -lactalbumin promoter human genomic DNA was isolated and purified. A portion of the human ⁇ -lactalbumin promoter was PCR amplified using the following two primers:
  • Primer 1 (SEQ ID NO: 24):
  • This two primers created a NdeI site on the 5′ end of the PCR fragment and a EcoRI site on the 3′ end of the PCR fragment.
  • the human PCR fragment created using the above primers was double digested with the restriction enzymes NdeI and EcoRI.
  • the plasmid pKBaP-1 was also double digested with NdeI and EcoRI.
  • the plasmid pKBaP-1 contains the bovine ⁇ -lactalbumin 5′ flanking region attached to a multiple cloning site. This plasmid allows attachment of various genes to the bovine ⁇ -lactalbumin promoter.
  • the human fragment was ligated/substituted for the bovine fragment of the promoter that was removed from the pKBaP-1 plasmid during the double digestion.
  • the resulting plasmid was confirmed by DNA sequencing to be a hybrid of the Bovine and Human a-lactalbumin promoter/regulatory regions.
  • Attachment of the MN14 light chain gene to the IRES ⁇ -lactalbumin signal peptide was accomplished as follows.
  • the MN14 light chain was PCR amplified from the vector pCRMN14LC using the following primers.
  • Primer 1 (SEQ ID NO: 26): 5′CTACAGGTGTCCACGTCGACATCCAGCTGACCCAG 3′
  • Primer 2 (SEQ ID NO: 27): 5′ CTGCAGAATAGATCTCTAACACTCTCCCCTGTTG 3′
  • the vector pCRMN14HC was digested with XhoI and NruI to remove about a 500 bp fragment.
  • PCR was then used to amplify the same portion of the MN14 heavy chain construct that was removed by the XhoI-NruI digestion. This amplification also mutated the 5′ end of the gene to add a better Kozak sequence to the clone.
  • the Kozak sequence was modified to resemble the typical IgG Kozak sequence.
  • the PCR primers are shown below.
  • Primer 1 (SEQ ID NO: 28):
  • the PCR product was digested with XhoI and NruI and inserted back into the previously digested plasmid backbone.
  • the “good” Kozak MN14 heavy chain gene construct was digested with EcoRI and the heavy chain gene containing fragment was isolated.
  • the IRES ⁇ -Lactalbumin Signal Peptide MN14 light chain gene construct was also digested with EcoRI.
  • the heavy chain gene was then cloned into the EcoRI site of IRES light chain construct. This resulted in the heavy chain gene being placed at the 5′ end of the IRES sequence.
  • a multiple cloning site was then added to the LNCX retroviral backbone plasmid.
  • the LNCX plasmid was digested with HindIII and ClaI.
  • Two oligonucleotide primers were produced and annealed together to create an double stranded DNA multiple cloning site. The following primers were annealed together.
  • Primer 1 (SEQ ID NO: 30):
  • the double chain gene fragment was then inserted into a retroviral backbone gene construct.
  • the double chain gene construct created in step 3 was digested with SailI and BglII and the double chain containing fragment was isolated.
  • the retroviral expression plasmid LNC-MCS was digested with XhoI and BglII. The double chain fragment was then cloned into the LNC-MCS retroviral expression backbone.
  • RNA splicing problem in the construct was repaired.
  • the construct was digested with NsiI.
  • the resulting fragment was then partially digested with EcoRI.
  • the fragments resulting from the partial digest that were approximately 9300 base pairs in size, were gel purified.
  • a linker was created to mutate the splice donor site at the 3′ end of the MN14 heavy chain gene.
  • the linker was again created by annealing two oligonucleotide primers together to form the double stranded DNA linker. The two primers used to create the linker are shown below.
  • Primer 1 (SEQ ID NO: 32):
  • the mutated double chain fragment was inserted into the ⁇ -Lactalbumin expression retroviral backbone LN ⁇ -LA-Mertz-MCS.
  • the gene construct produced above was digested with BamHI and BglII and the mutated double chain gene containing fragment was isolated.
  • the LN ⁇ -LA-Mertz-MCS retroviral backbone plasmid was digested with BglII.
  • the BamHI/BglII fragment was then inserted into the retroviral backbone plasmid.
  • a WPRE element was then inserted into the gene construct.
  • the plasmid BluescriptII SK+WPRE-B11 was digested with BamHI and HincII to remove the WPRE element and the element was isolated.
  • the vector created above was digested with BglII and HpaI.
  • the WPRE fragment was ligated into the BglII and HpaI sites to create the final gene construct.
  • the ⁇ -LA Bot (SEQ ID NO:8, botulinum toxin antibody) construct comprises the following elements, arranged in 5′ to 3′ order: bovine/human alpha-lactalbumin hybrid promoter, mutated PPE element, cc49 signal peptide, botulinum toxin antibody light chain, IRES from encephalomyocarditis virus/bovine ⁇ -LA signal peptide, botulinum toxin antibody heavy chain, WPRE sequence, and 3′ MoMuLV LTR.
  • the ⁇ -LA botulinum toxin antibody vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO: 7).
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene.
  • the expression of botulinum toxin antibody is controlled by the hybrid a-LA promoter.
  • the botulinum toxin antibody light chain gene and heavy chain gene are attached together by an IRES/bovine ⁇ -LA signal peptide sequence.
  • the bovine/human alpha-lactalbumin hybrid promoter drives production of a mRNA containing the light chain gene and the heavy chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both light and heavy chain protein to be produced from a single mRNA.
  • the mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region. ATG sequences within the PPE element (SEQ ID NO:2) were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence was placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region.
  • the WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the niRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold.
  • the WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR.
  • the bovine/human ⁇ -lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine ⁇ -lactalbumin promoter sequences.
  • the human portion of the promoter is from +15 relative to transcription start point to ⁇ 600 relative to the tsp.
  • the bovine portion is then attached to the end of the human portion and corresponds to ⁇ 550 to ⁇ 2000 relative to the tsp.
  • the hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human.
  • the LSRNL (SEQ ID NO:9) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MoMuLV LTR, MoMuLV viral packaging signal; hepatitis B surface antigen; RSV promoter; neomycin phosphotransferase gene; and 3′ MoMuLV LTR.
  • This construct uses the 5′ MoMuLV LTR to control production of the Hepatitis B surface antigen gene.
  • the expression of the neomycin phosphotransferase gene is controlled by the RSV promoter.
  • the mRNA expression from the LTR as well as from the RSV promoter is terminated and poly adenylated in the 3′ LTR.
  • the ⁇ -LA cc49IL2 (SEQ ID NO:10; the cc49 antibody is described in U.S. Pat. Nos. 5,512,443; 5,993,813; and 5,892,019; each of which is herein incorporated by reference) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ bovine/human ⁇ -lactalbumin hybrid promoter; cc49-IL2 coding region; and 3′ MoMuLV LTR.
  • This gene construct expresses a fusion protein of the single chain antibody cc49 attached to Interleukin-2. Expression of the fusion protein is controlled by the bovine/human ⁇ -lactalbumin hybrid promoter.
  • the bovine/human ⁇ -lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences.
  • the human portion of the promoter is from +15 relative to transcription start point to ⁇ 600 relative to the tsp.
  • the bovine portion is then attached to the end of the human portion and corresponds to ⁇ 550 to ⁇ 2000 relative to the tsp.
  • the hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human.
  • the 3′ viral LTR provide the poly-adenylation sequence for the mRNA.
  • the ⁇ -LA YP (SEQ ID NO: 11) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ bovine/human alpha-lactalbumin hybrid promoter; double mutated PPE sequence; bovine ⁇ LA signal peptide; Yersenia pestis antibody heavy chain Fab coding region; EMCV IRES/bovine ⁇ -LA signal peptide; Yersenia pestis antibody light chain Fab coding region; WPRE sequence; 3′ MoMuLV LTR.
  • This gene construct will cause the expression of Yersenia pestis mouse Fab antibody.
  • the expression of the gene construct is controlled by the bovine/human ⁇ -lactalbumin hybrid promoter.
  • the PPE sequence and the WPRE sequence aid in moving the mRNA from the nucleus to the cytoplasm.
  • the IRES sequence allows both the heavy and the light chain genes to be translated from the same mRNA.
  • the 3′ viral LTR provides the poly-adenylation sequence for the mRNA.
  • the mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region.
  • ATG sequences within the PPE element (SEQ ID NO:2) were mutated (bases 4, 112, 131, and 238 of SEQ ID NO: 2 were changed from a G to a T) to prevent potential unwanted translation initiation.
  • Two copies of this mutated sequence were used in a head to tail array. This sequence was placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region.
  • the WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold.
  • the WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR.
  • the bovine/human alpha-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences.
  • the human portion of the promoter is from +15 relative to transcription start point to ⁇ 600 relative to the tsp.
  • the bovine portion is then attached to the end of the human portion and corresponds to ⁇ 550 to ⁇ 2000 relative to the tsp.
  • the hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human.
  • Examples 2-5 describe the production of pseudotyped retroviral vectors. These methods are generally applicable to the production of the vectors described above.
  • the expression of the fusogenic VSV G protein on the surface of cells results in syncytium formation and cell death. Therefore, in order to produce retroviral particles containing the VSV G protein as the membrane-associated protein a two-step approach was taken. First, stable cell lines expressing the gag and pol proteins from MoMLV at high levels were generated (e.g., 293GP SD cells). The stable cell line which expresses the gag and pol proteins produces noninfectious viral particles lacking a membrane-associated protein (e.g., an envelope protein).
  • a membrane-associated protein e.g., an envelope protein
  • the stable cell line was then co-transfected, using the calcium phosphate precipitation, with VSV-G and gene of interest plasmid DNAs.
  • the pseudotyped vector generated was used to infect 293GP SD cells to produce stably transformed cell lines.
  • Stable cell lines can be transiently transfected with a plasmid capable of directing the high level expression of the VSV G protein (see below).
  • the transiently transfected cells produce VSV G-pseudotyped retroviral vectors which can be collected from the cells over a period of 3 to 4 days before the producing cells die as a result of syncytium formation.
  • the first step in the production of VSV G-pseudotyped retroviral vectors, the generation of stable cell lines expressing the MoMLV gag and pol proteins is described below.
  • the human adenovirus Ad-5-transformed embryonal kidney cell line 293 (ATCC CRL 1573) was cotransfected with the pCMVgag-pol and the gene encoding for phleomycin.
  • pCMV gag-pol contains the MoMLV gag and pol genes under the control of the CMV promoter (pCMV gag-pol is available from the ATCC).
  • the plasmid DNA was introduced into the 293 cells using calcium phosphate co-precipitation (Graham and Van der Eb, Virol. 52:456 [1973]). Approximately 5 ⁇ 10 5 293 cells were plated into a 100 mm tissue culture plate the day before the DNA co-precipitate was added. Stable transformants were selected by growth in DMEM-high glucose medium containing 10% FCS and 10 ⁇ g/ml phleomycin (selective medium). Colonies which grew in the selective medium were screened for extracellular reverse transcriptase activity (Goff et al., J. Virol. 38:239 [1981]) and intracellular p30gag expression.
  • the presence of p30gag expression was determined by Western blotting using a goat-anti p30 antibody (NCI antiserum 77S000087).
  • a clone which exhibited stable expression of the retroviral genes was selected. This clone was named 293GP SD (293 gag-pol-San Diego).
  • VSV G protein pseudotyped retrovirus In order to produce VSV G protein pseudotyped retrovirus the following steps were taken.
  • the 293GP SD cell line was co-transfected with VSV-G plasmid and DNA plasmid of interest. This co-transfection generates the infectious particles used to infect 293 GP SD cells to generate the packaging cell lines.
  • This Example describes the production of pseudotyped LNBOTDC virus. This general method may be used to produce any of the vectors described in Example 1.
  • the packaging cell line, 293GP SD was grown in alpha-MEM-high glucose medium containing 10% FCS
  • the titer of the pseudo-typed virus may be determined using either 208F cells (Quade, Virol. 98:461 [1979]) or NIH/3T3 cells (ATCC CRL 1658); 208F and NIH/3T3 cells are grown in DMEM-high glucose medium containing 10% CS.
  • the plasmid LNBOTDC contains the gene encoding BOTD under the transcriptional control of cytomegalovirus intermediate-early promoter followed by the gene encoding neomycin phosphotransferase (Neo) under the transcriptional control of the LTR promoter.
  • the plasmid pHCMV-G contains the VSV G gene under the transcriptional control of the human cytomegalovirus intermediate-early promoter (Yee et al., Meth. Cell Biol. 43:99 [1994]).
  • LNBOTDC DNA (SEQ ID NO: 13) was co-transfected with pHCMV-G DNA into the packaging line 293GP SD to produce LNBOTDC virus. The resulting LNBOTDC virus was then used to infect 293GP SD cells to transform the cells. The procedure for producing pseudotyped LNBOTDC virus was carried out as described (Yee et al., Meth. Cell Biol. 43:99 [1994].
  • the CMV regulatory sequences control the expression of the botulinum toxin antibody heavy and light chain genes.
  • the IRES sequence allows both the heavy and the light chain genes to be translated from the same mRNA.
  • the 3′ viral LTR provides the poly-adenylation sequence for the mRNA.
  • Both heavy and light chain protein for botulinum toxin antibody are produced from this signal mRNA.
  • the heavy and light chain proteins also appear to be formed in an equal molar ratio to each other.
  • Polybrene was added to the medium following addition to cells.
  • the virus containing medium was allowed to remain on the 293GP SD cells for 24 hours. Following the 16 hour infection period (on day 5), the medium was removed from the 293GP SD cells and was replaced with fresh medium containing 400 ⁇ g/ml G418 (GIBCO/BRL). The medium was changed approximately every 3 days until G418-resistant colonies appeared approximately two weeks later.
  • the G418-resistant 293 colonies were plated as single cells in 96 wells. Sixty to one hundred G418-resistant colonies were screened for the expression of the BOTDC antibody in order to identify high producing clones. The top 10 clones in 96-well plates were transferred 6-well plates and allowed to grow to confluency.
  • the top 10 clones were then expanded to screen for high titer production. Based on protein expression and titer production, 5 clonal cell lines were selected. One line was designated the master cell bank and the other 4 as backup cell lines. Pseudotyped vector was generated as follows. Approximately 1 ⁇ 10 6 293GP SD /LNBOTDC cells were placed into a 75 cm 2 tissue culture flask. Twenty-four hours later, the cells were transfected with 25 ⁇ g of pHCMV-G plasmid DNA using calcium phosphate co-precipitation. Six to eight hours after the calcium-DNA precipitate was applied to the cells, the DNA solution was replaced with fresh culture medium (lacking G41 8).
  • the pseudotyped LNBOTDC virus generated from the transfected 293GP SD /LNBOTDC cells can be collected at least once a day between 24 and 96 hr after transfection.
  • the highest virus titer was generated approximately 48 to 72 hr after initial pHCMV-G transfection. While syncytium formation became visible about 48 hr after transfection in the majority of the transfected cells, the cells continued to generate pseudotyped virus for at least an additional 48 hr as long as the cells remained attached to the tissue culture plate.
  • the collected culture medium containing the VSV G-pseudotyped LNBOTDC virus was pooled, filtered through a 0.45 ⁇ m filter and stored at ⁇ 80° C. or concentrated immediately and then stored at ⁇ 80° C.
  • the titer of the VSV G-pseudotyped LNBOTDC virus was then determined as follows. Approximately 5 ⁇ 10 4 rat 208F fibroblasts cells were plated into 6 well plates. Twenty-fours hours after plating, the cells were infected with serial dilutions of the LNBOTDC virus-containing culture medium in the presence of 8 ⁇ g/ml polybrene. Twenty four hours after infection with virus, the medium was replaced with fresh medium containing 400 ⁇ g/ml G418 and selection was continued for 14 days until G418-resistant colonies became visible. Viral titers were typically about 0.5 to 5.0 ⁇ 10 6 colony forming units (cfu)/ml. The titer of the virus stock could be concentrated to a titer of greater than 10 9 cfu/ml as described below.
  • VSV G-pseudotyped LNBOTDC viruses were concentrated to a high titer by one cycle of ultracentrifugation. However, two cycles can be performed for further concentration.
  • the frozen culture medium collected as described in Example 2 which contained pseudotyped LNBOTDC virus was thawed in a 37° C. water bath and was then transferred to Oakridge centrifuge tubes (50 ml Oakridge tubes with sealing caps, Nalge Nunc International) previously sterilized by autoclaving. The virus was sedimented in a JA20 rotor (Beckman) at 48,000 ⁇ g (20,000 rpm) at 4° C. for 120 min.
  • the culture medium was then removed from the tubes in a biosafety hood and the media remaining in the tubes was aspirated to remove the supernatent.
  • the virus pellet was resuspended to 0.5 to 1% of the original volume of culture medium DMEM.
  • the resuspended virus pellet was incubated overnight at 4° C. without swirling.
  • the virus pellet could be dispersed with gentle pipetting after the overnight incubation without significant loss of infectious virus.
  • the titer of the virus stock was routinely increased 100- to 300-fold after one round of ultracentrifugation. The efficiency of recovery of infectious virus varied between 30 and 100%.
  • the virus stock was then subjected to low speed centrifugation in a microfuge for 5 min at 4° C. to remove any visible cell debris or aggregated virions that were not resuspended under the above conditions. It was noted that if the virus stock is not to be used for injection into oocytes or embryos, this centrifugation step may be omitted.
  • the virus stock can be subjected to another round of ultracentrifugation to further concentrate the virus stock.
  • the resuspended virus from the first round of centrifugation is pooled and pelleted by a second round of ultracentrifugation which is performed as described above.
  • Viral titers are increased approximately 2000-fold after the second round of ultracentrifugation (titers of the pseudotyped LNBOTDC virus are typically greater than or equal to 1 ⁇ 10 9 cfu/ml after the second round of ultracentrifugation).
  • the titers of the pre- and post-centrifugation fluids were determined by infection of 208F cells (NIH 3T3 or bovine mammary epithelial cells can also be employed) followed by selection of G418-resistant colonies as described above in Example 2.
  • the concentrated pseudotyped retroviruses were resuspended in 0.1 ⁇ HBS (2.5 mM HEPES, pH 7.12, 14 mM NaCl, 75 ⁇ M Na 2 HPO 4 -H 2 O) and 18 pl aliquots were placed in 0.5 ml vials (Eppendorf) and stored at ⁇ 80° C. until used.
  • the titer of the concentrated vector was determined by diluting 1 ⁇ l of the concentrated virus 10 ⁇ 7 - or 10 ⁇ 8 -fold with 0.1 ⁇ HBS.
  • the diluted virus solution was then used to infect 208F and bovine mammary epithelial cells and viral titers were determined as described in Example 2.
  • This Example describes the production of antibody MN14 from cells transfected with a high number of integrating vectors.
  • Pseudotyped vector were made from the packaging cell lines for the following vectors: CMV MN14, ⁇ -LA MN14, and MMTV MN14.
  • Rat fibroblasts 208F cells
  • MDBK cells bovine kidney cells
  • bovine mammary epithelial cells were transfected at a multiplicity of infection of 1000.
  • One thousand cells were plated in a T25 flask and 10 6 colony forming units (CFU's) of vector in 3 ml media was incubated with the cells. The duration of the infection was 24 hr, followed by a media change. Following transfection, the cells were allowed to grow and become confluent.
  • the cell lines were grown to confluency in T25 flasks and 5 ml of media was changed daily. The media was assayed daily for the presence of MN14. All of the MN14 produced is active (an ELISA to detect human IgG gave the exact same values as the CEA binding ELISA) and Western blotting has shown that the heavy and light chains are produced at a ratio that appears to be a 1:1 ratio. In addition, a non-denaturing Western blot indicated that what appeared to be 100% of the antibody complexes were correctly formed (See FIG.
  • Lane 1 Lane 1, 85 ng control Mn14; Lane 2, bovine mammary cell line, ⁇ -LA promoter; Lane 3, bovine mammary cell line, CMV promoter; Lane 4, bovine kidney cell line, ⁇ -LA promoter; Lane 5, bovine kidney cell line, CMV promoter; Lane 6, 208 cell line, ⁇ -LA promoter; Lane 7, 208 cell line, CMV promoter)).
  • FIG. 2 is a graph showing the production of MN14 over time for four cell lines.
  • the Y axis shows MN14 production in ng/ml of media.
  • the X-axis shows the day of media collection for the experiment.
  • Four sets of data are shown on the graph.
  • the comparisons are between the CMV and ⁇ -LA promoter and between the 208 cells and the bovine mammary cells.
  • the bovine mammary cell line exhibited the highest expression, followed by the 208F cells and MDBK cells.
  • the CMV driven construct demonstrated the highest level of expression, followed by the ⁇ -LA driven gene construct and the MMTV construct.
  • the level of daily production of the CMV construct was 4.5 ⁇ g/ml of media (22.5 mg/day in a T25 flask). The level of expression subsequently increased slowly to 40 ⁇ g/day as the cells became very densely confluent over the subsequent week.
  • 2.7 L of media from an ⁇ -lac-MN14 packaging cell line was processed by affinity chromatography to produce a purified stock of MN14.
  • FIG. 3 is a western blot of a 15% SDS-PAGE gel run under denaturing conditions in order to separate the heavy and light chains of the MN14 antibody.
  • Lane 1 shows MN14 from bovine mammary cell line, hybrid ⁇ -LA promoter;
  • lane 2 shows MN14 from bovine mammary cell line, CMV promoter;
  • lane 3 shows MN14 from bovine kidney cell line, hybrid ⁇ LA promoter;
  • lane 4 shows MN14 from bovine kidney cell line, CMV promoter;
  • lane 5 shows MN14 from rat fibroblast cell line, hybrid ⁇ -LA promoter;
  • lane 6 shows MN14 from rat fibroblast, CMV promoter.
  • the results show that the heavy and light chains are produced in a ratio of approximately 1:1.
  • This Example describes the quantitation of the amount of protein produced per cell in cell cultures produced according to the invention.
  • Various cells 208F cells, MDBK cells, and bovine mammary cells
  • Various cells 208F cells, MDBK cells, and bovine mammary cells
  • Three different vectors were used to infect the three cells types (CMV-MN14, MMTV-MN14, and ⁇ -LA-MN14) at an MOI of 1000 (titers: 2.8 ⁇ 10 6 , 4.9 ⁇ 10 6 , and 4.3 ⁇ 10 6 , respectively).
  • Media was collected approximately every 24 hours from all cells. Following one month of media collection, the 208F and MDBK cells were discarded due to poor health and low MN14 expression.
  • the cells were passaged to T25 flasks and collection of media from the bovine mammary cells was continued for approximately 2 months with continued expression of MN14. After two months in T25 flasks, the cells with CMV promoters were producing 22.5 pg/cell/day and the cells with ⁇ LA promoters were producing 2.5 pg MN14/cell/day.
  • roller bottles (850 cm 2 ) were seeded to scale-up production and to determine if MN14 expression was stable following multiple passages.
  • Two roller bottles were seeded with bovine mammary cells expressing MN14 from a CMV promoter and two roller bottles were seeded with bovine mammary cells expressing MN14 from the ⁇ -LA promoter.
  • the cultures reached confluency after approximately two weeks and continue to express MN14.
  • Roller bottle expression is shown in Table 1 below.
  • This Example demonstrates the expression of antibody LL2 by bovine mammary cells and 293 human kidney fibroblast cells.
  • Bovine mammary cells were infected with vector CMV LL2 (7.85 ⁇ 10 7 CFU/ml) at MOI's of 1000 and 10,000 and plated in 25cm 2 culture dishes. None of the cells survived transfection at the MOI of 10,000. At 20% confluency, 250 ng/ml of LL2 was present in the media. Active LL2 antibody was produced by both cell types. Non-denaturing and denaturing western analysis demonstrated that all the antibody produced is active and correctly assembled in approximately a 1:1 ratio of heavy:light chain.
  • This Example demonstrates the expression of botulinum toxin antibody in bovine mammary cells.
  • Bovine mammary cells were infected with vector ⁇ -LA Bot (2.2 ⁇ 10 2 CFU/ml) and plated in 25 cm 2 culture dishes. At 100% confluency, 6 ng/ml of botulinum toxin antibody was present in the media.
  • This Example demonstrates the expression of Hepatitis B Surface Antigen antibody in bovine mammary cells.
  • Bovine mammary cells were infected with vector LSRNL (350 CFU/ml) and plated in 25 cm 2 culture dishes. At 100% confluency, 20 ng/ml of Hepatitis B Surface Antigen was present in the media.
  • This Example demonstrates the expression of cc49IL2 in bovine mammary cells and human kidney fibroblast cells.
  • Bovine mammary cells were infected with vector LSRNL (3.1 ⁇ 10 5 CFU/ml) at a MOI of 1000 and plated in 25 cm 2 culture dishes. At 100% confluency, 10 ⁇ g/ml of cc49IL2 was present in the media.
  • Human kidney fibroblast (293) cells were infected with the ⁇ -LA cc49IL2 vector. Active cc49-IL2 fusion protein was produced by the cells.
  • This Example demonstrates the production of Yersinea pestis antibody by bovine mammary epithelial cells and human kidney fibroblast cells (293 cells). Cells lines were infected with the ⁇ -LA YP vector. Both of the cell lines produced YP antibody. All of the antibody is active and the heavy and light chains are produced in a ratio approximating 1:1.
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells.
  • Mammary cells producing MN14 infected with CMV-MN14 vector
  • cc49IL2 vector 3.1 ⁇ 10 5 CFU/ml
  • 1000 cells were plated in 25 cm 2 culture plates.
  • the cells expressed MN14 at 2.5 ⁇ g/ml and cc49IL2 at 5 ⁇ g/ml.
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells.
  • Mammary cells producing MN14 infected with CMV-MN14 vector
  • LSNRL vector 100 CFU/ml
  • 1000 cells were plated in 25 cm 2 culture plates.
  • the cells expressed MN14 at 2.5 ⁇ g/ml and hepatitis surface antigen at 150 ng/ml.
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells.
  • Mammary cells producing hepatitis B surface antigen (infected with LSRNL vector) were infected with cc49IL2 vector at an MOI of 1000, and 1000 cells were plated in 25 cm 2 culture plates. At 100% confluency, the cells expressed MN14 at 2.4 and hepatitis B surface antigen at 13.
  • This Example demonstrates the culture of transfected cells in roller bottle cultures.
  • 208F cells and bovine mammary cells were plated in 25 cm 2 culture dishes at 1000 cells/25 cm 2 .
  • LSRNL or ⁇ -LA Bot vectors were used to infect each cell line at a MOI of 1000. Following one month of culture and media collection, the 208F cells were discarded due to poor growth and plating.
  • the bovine mammary cells infected with ⁇ -LA Bot were discarded due to low protein expression.
  • the bovine mammary cells infected with LSRNL were passaged to seed roller bottles (850 cm 2 ). Approximately 20 ng/ml hepatitis type B surface antigen was produced in the roller bottle cultures.
  • This example describes the expression of a G-Protein Coupled Receptor protein (GPCR) from a retroviral vector.
  • GPCR G-Protein Coupled Receptor protein
  • This example also describes the expression of a signal protein from an IRES as a marker for expression of a difficult to assay protein or a protein that has no assay such as a GPCR.
  • the gene construct (SEQ ID NO: 34; FIG. 17) comprises a G-protein-coupled receptor followed by the IRES-signal peptide-antibody light chain cloned into the MCS of pLBCX retroviral backbone.
  • pLBCX contains the EM7 (T7) promoter, Blasticidin gene and SV40 polyA in place of the Neomycin resistance gene from pLNCX.
  • the gene construct was used to produce a replication defective retroviral packaging cell line and this cell line was used to produce replication defective retroviral vector.
  • the vector produced from this cell line was then used to infect 293GP cells (human embryonic kidney cells). After infection, the cells were placed under Blasticidin selection and single cell Blasticidin resistant clones were isolated. The clones were screened for expression of antibody light chain. The top 12 light chain expressing clones were selected. These 12 light chain expressing clones were then screened for expression of the GPCR using a ligand binding assay. All twelve of the samples also expressed the receptor protein.
  • the clonal cell lines and there expression are shown in Table 2. TABLE 2 Cell Clone Antibody Light GPCR Number Chain Expression Expression 4 + + 8 + + 13 + + 19 + + + 20 + + 22 + + 24 + + + 27 + + 30 + + + 45 + + + 46 + + 50 + + + +

Abstract

The present invention provides novel regulatory elements and vectors for the expression of one or more proteins in a host cell. The present invention also provides methods for expressing one or more proteins, such as antibodies, in a host cell. These methods utilize the novel regulatory elements and vectors of the present invention for the expression of proteins in a host cell. The host cells are used for producing various protein products, including but not limited to pharmaceutical proteins, antibodies, variants of proteins for use in screening assays, and for direct use in high throughput screening.

Description

  • This application claims priority to provisional application 60/215,851, filed Jul. 3, 2000.[0001]
  • FIELD OF THE INVENTION
  • The present invention relates to novel regulatory elements and vectors for the expression of one or more proteins in a host cell. [0002]
  • BACKGROUND OF THE INVENTION
  • Methods for expression of recombinant proteins in bacterial host are widespread and offer ease of use and purification of the recombinant product. However, use of these systems for the expression of eukaryotic proteins is often limited by problems of insolubility and lack of proper post-transcription and post-translational processing (see, e.g., U.S. Pat. No. 5,721,121, incorporated herein by reference). Thus, eukaryotic expression systems are generally used for the expression of eukaryotic proteins. In particular, the pharmaceutical biotechnology industry relies heavily on the production of recombinant proteins in mammalian cells. These recombinantly produced proteins are essential to the therapeutic treatment of many diseases and conditions. In many cases, the market for these proteins exceeds a billion dollars a year. Examples of proteins produced recombinantly in mammalian cells include erythropoietin, factor VIII, factor IX, and insulin. In addition, recombinant antibodies are often used as therapeutic agents. Clinical applications of recombinantly produced proteins, in particular antibodies, often require large amounts of highly purified proteins. Proteins are generally produced in either mammalian cell culture or in transgenic animals. [0003]
  • Vectors for transferring the gene of interest into mammalian cells are widely available, including plasmids, retroviral vectors, and adenoviral vectors. Retroviral vectors are widely used as vehicles for delivery of genes into mammalian cells (See e.g., Vile and Russell, British Medical Bulletin, 51:12 [1995]). However, current methods for creating mammalian cell lines for expression of recombinant proteins suffer from several drawbacks. (See, e.g., Mielke et al., Biochem. 35:2239-52 [1996]). Episomal systems allow for high expression levels of the recombinant protein, but are frequently only stable for a short time period (See, e.g., Klehr and Bode, Mol. Genet. (Life Sci. Adv.) 7:47-52 [1988]). Mammalian cell lines containing integrated exogenous genes are somewhat more stable, but there is increasing evidence that stability depends on the presence of only a few copies or even a single copy of the exogenous gene. Vectors are often unstable, resulting in a decrease in the level of protein expression over time. [0004]
  • Based on overall product yield, expression of recombinant proteins in animals results in higher yields, relative to expression in cell culture (See e.g., Werner et al., Arzneimittelforshcung, 48:870 [1998]; Pollock et al., J. Immunol. Methods, 231:147 [1999]). However, expression in transgenic animals is limited by methods of producing transgenic mammals, variation in production and purity, and the life span of the animal. [0005]
  • Thus, despite continued efforts in the field, vectors for high level, continuous expression of one or more proteins in a host cell remain needed in the art. [0006]
  • SUMMARY OF THE INVENTION
  • The present invention relates to novel regulatory elements and vectors for the expression of one or more proteins in a host cell. [0007]
  • In some embodiments, the present invention provides a hybrid α-lactalbumin promoter comprising at least one portion derived from a first mammalian α-lactalbumin promoter and at least one portion derived from a second mammalian α-lactalbumin promoter. The present invention is not limited to portions derived from any particular α-lactalbumin promoter. Indeed, portions from a variety of α-lactalbumin promoters are contemplated, including, but not limited to bovine, human, ovine, caprine, and murine α-lactalbumin promoters. In other embodiments, the present invention provides a nucleic acid comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO:1 and sequences hybridizable to SEQ ID NO:1 under low stringency conditions, wherein the nucleic acid contains sequences derived from at least two mammalian sources and causes mammary specific gene expression. In still other embodiments, the present invention provides a nucleic acid sequence encoding a hybrid bovine/human alpha lactalbumin (αLA) promoter/enhancer (i.e., SEQ ID NO:1) and sequences that are hybridizable to a hybrid bovine/human α-LA promoter under low to high stringency conditions. In preferred embodiments, these sequences drive the expression of an exogenous gene in the mammary gland of a transgenic animal. In some embodiments, the hybridizable sequence comprises human and bovine elements. In other embodiments, the present invention provides a vector containing the nucleic acid sequence of hybrid bovine/human α-LA promoter. In some embodiments, the vector is a retroviral vector. In still further embodiments, the present invention provides a host cell containing a vector containing a hybrid bovine/human α-LA promoter. [0008]
  • The present invention also provides a nucleic acid encoding a mutant RNA export element (PPE element; SEQ ID NO:2) and sequences that are hybridizable to a mutant PPE element. In some embodiments, the sequences hybridizable to a mutant PPE element contain ATG sequences that have been mutated at at least one of the positions corresponding to [0009] nucleic acid residues 4, 112, 131, and 238 of the wild-type PPE element. In preferred embodiments, these sequences enhance the export from the nucleus of the RNA to which they are operably linked. In other embodiments, the present invention provides a vector containing the nucleic acid sequence of the mutant PPE element. In some embodiments, the vector is a retroviral vector. In still further embodiments, the present invention provides a host cell containing a vector that contains a mutant PPE element.
  • The present invention also provides a nucleic acid encoding an IRES coding sequence and a signal peptide coding sequence, wherein said IRES and signal peptide coding sequences are adjacent to one another. In some embodiments, the IRES/signal peptide sequence comprises SEQ ID NO:3 or SEQ ID NO:12 and sequences that are hybridizable to these sequences under low stringency conditions. In preferred embodiments, these sequences interact with a ribosome and provide for the secretion of proteins to which they are operably linked. The present invention is not limited to any particular signal sequence peptide. Indeed, it is contemplated that a variety of signal peptides find use in the present invention. In some embodiments, the signal peptide sequence is selected from alpha-casein, human growth hormone, or α-lactalbumin signal peptide sequences. In other embodiments, the present invention provides a vector containing the nucleic acid sequence of the IRES/signal peptide sequence. In some embodiments, the vector is a retroviral vector. In still further embodiments, the present invention provides a host cell containing a vector that contains a IRES/signal peptide sequence. [0010]
  • The present invention also provides methods for producing a protein of interest. In some embodiments, the methods comprise providing a host cell and a vector containing at least one exogenous gene operably linked to a bovine/human hybrid α-lactalbumin promoter and introducing the vector to the host cell under conditions such that expression of the protein encoded by the exogenous gene is expressed. In some embodiments, the vector further contains a mutant RNA export element. In other embodiments, the vector contains at least two exogenous genes. In still further embodiments, the two or more exogenous genes are arranged in a polycistronic sequence separated by an internal ribosome entry site/bovine α-lactalbumin signal peptide. [0011]
  • The present invention also provides methods for expressing at least two proteins in a polycistronic sequence. In some embodiments, the proteins are unrelated, while in other embodiments, the proteins are subunits of a multisubunit protein. In some preferred embodiments, the present invention provides methods for producing an immunoglobulin including providing a host cell and a vector comprising a first exogenous gene and a second exogenous gene, wherein the first exogenous gene encodes a first immunoglobulin chain and wherein the second exogenous gene encodes a second immunoglobulin chain, and wherein the first and the second genes are separated by an internal ribosome entry site, and introducing the vector to the host cell under conditions such the first immunoglobulin chain and the second immunoglobulin chain encoded by the first and second exogenous genes are expressed. In some embodiments, the first immunoglobulin chain is an immunoglobulin light chain (e.g., λ or κ) and the second immunoglobulin chain is an immunoglobulin heavy chain (e.g., γ, α, μ, δ, or ε). In other embodiments, the first immunoglobulin chain is an immunoglobulin heavy chain (e.g., γ, α, μ, δ, or ε) and the second immunoglobulin chain is an immunoglobulin light chain (e.g., λ or κ). In some embodiments, the vector is a retroviral vector. In other embodiments, the vector further contains a bovine α-lactalbumin signal peptide. In still further embodiments, the vector further contains a bovine/human hybrid α-lactalbumin promoter. In yet other embodiments, the first immunoglobulin chain and the second immunoglobulin chain are expressed at a ratio of about 0.9:1.1 to 1:1. The present invention also provides immunoglobulins produced by the methods described herein. The present invention is not limited to the use of any particular vector. Indeed, it is contemplated that a variety of vectors find use in the present invention, including, but not limited to plasmid and retroviral vectors. In some preferred embodiments, the retroviral vector is pseudotyped. [0012]
  • In still further embodiments, the present invention provides methods of indirectly detecting the expression of a protein of interest comprising providing a host cell transduced or transfected with a vector encoding a polycistronic sequence, wherein the polycistronic sequence comprises a signal protein and a protein of interest operably linked by an IRES, and culturing the host cells under conditions such that the signal protein and protein of interest are produced, wherein the presence of the signal protein indicates the presence of the protein of interest. The methods of the present invention are not limited to the expression of any particular protein of interest. Indeed, the expression of a variety of proteins of interest is contemplated, including, but not limited to, G-protein coupled receptors. The present invention is not limited to the use of any particular signal protein. Indeed, the use of variety of signal proteins is contemplated, including, but not limited to, immunoglobulin heavy and light chains, beta-galactosidase, beta-lactamase, green fluorescent protein, and luciferase. In particularly preferred embodiments, expression of the signal protein and protein of interest is driven by the same promoter and the signal protein and protein of interest are transcribed as a single transcriptional unit.[0013]
  • DESCRIPTION OF THE FIGURES
  • FIG. 1 is a Western blot of a 15% SDS-PAGE gel run under denaturing conditions and probed with anti-human IgG (Fc) and anti-human IgG (kappa). [0014]
  • FIG. 2 is a graph of MN14 expression over time. [0015]
  • FIG. 3 is a Western blot of a 15% PAGE run under non-denaturing conditions and probed with anti-human IgG (Fc) and anti-human IgG (Kappa). [0016]
  • FIG. 4 provides the sequence for the hybrid human-bovine alpha-lactalbumin promoter (SEQ ID NO:1). [0017]
  • FIG. 5 provides the sequence for the mutated PPE sequence (SEQ ID NO:2). [0018]
  • FIG. 6 provides the sequence for the IRES-Signal peptide sequence (SEQ ID NO:3). [0019]
  • FIGS. 7[0020] a and 7 b provide the sequence for CMV MN14 vector (SEQ ID NO:4).
  • FIGS. 8[0021] a and 8 b provide the sequence for the CMV LL2 vector (SEQ ID NO:5).
  • FIGS. 9[0022] a-c provide the sequence for the MMTV MN14 vector (SEQ ID NO:6).
  • FIGS. 10[0023] a-d provide the sequence for the alpha-lactalbumin MM14 Vector (SEQ ID NO:7).
  • FIGS. 11[0024] a-c provide the sequence for the alpha-lactalbumin Bot vector (SEQ ID NO:8).
  • FIGS. 12[0025] a-b provide the sequence for the LSRNL vector (SEQ ID NO:9).
  • FIGS. 13[0026] a-b provide the sequence for the alpha-lactalbumin cc49IL2 vector (SEQ ID NO:10).
  • FIGS. 14[0027] a-c provides the sequence for the alpha-lactalbumin YP vector (SEQ ID NO:11).
  • FIG. 15 provides the sequence for the IRES-Casein signal peptide sequence (SEQ ID NO:12). [0028]
  • FIGS. 16[0029] a-c provide the sequence for the LNBOTDC vector (SEQ ID NO:13).
  • FIGS. 17[0030] a-d provide the sequence of a retroviral vector that expresses a G-Protein coupled receptor and antibody light chain.
  • DEFINITIONS
  • To facilitate understanding of the invention, a number of terms are defined below. [0031]
  • As used herein, the term “host cell” refers to any eukaryotic cell (e.g., mammalian cells, avian cells, amphibian cells, plant cells, fish cells, and insect cells), whether located in vitro or in vivo. [0032]
  • As used herein, the term “cell culture” refers to any in vitro culture of cells. Included within this term are continuous cell lines (e.g., with an immortal phenotype), primary cell cultures, finite cell lines (e.g., non-transformed cells), and any other cell population maintained in vitro, including oocytes and embryos. [0033]
  • As used herein, the term “vector” refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences between cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors. [0034]
  • As used herein, the term “integrating vector” refers to a vector whose integration or insertion into a nucleic acid (e.g., a chromosome) is accomplished via an integrase. Examples of “integrating vectors” include, but are not limited to, retroviral vectors, transposons, and adeno associated virus vectors. [0035]
  • As used herein, the term “integrated” refers to a vector that is stably inserted into the genome (i.e., into a chromosome) of a host cell. [0036]
  • As used herein, the term “multiplicity of infection” or “MOI” refers to the ratio of integrating vectors:host cells used during transfection or transduction of host cells. For example, if 1,000,000 vectors are used to transduce 100,000 host cells, the multiplicity of infection is 10. The use of this term is not limited to events involving transduction, but instead encompasses introduction of a vector into a host by methods such as lipofection, microinjection, calcium phosphate precipitation, and electroporation. [0037]
  • As used herein, the term “genome” refers to the genetic material (e.g., chromosomes) of an organism. [0038]
  • The term “nucleotide sequence of interest” refers to any nucleotide sequence (e.g., RNA or DNA), the manipulation of which may be deemed desirable for any reason (e.g., treat disease, confer improved qualities, expression of a protein of interest in a host cell, etc.), by one of ordinary skill in the art. Such nucleotide sequences include, but are not limited to, coding sequences of structural genes (e.g., reporter genes, selection marker genes, oncogenes, drug resistance genes, growth factors, etc.), and non-coding regulatory sequences which do not encode an mRNA or protein product (e.g., promoter sequence, polyadenylation sequence, termination sequence, enhancer sequence, etc.). [0039]
  • As used herein, the term “protein of interest” refers to a protein encoded by a nucleic acid of interest. [0040]
  • As used herein, the term “signal protein” refers to a protein that is co-expressed with a protein of interest and which, when detected by a suitable assay, provides indirect evidence of expression of the protein of interest. Examples of signal protein useful in the present invention include, but are not limited to, immunoglobulin heavy and light chains, beta-galactosidase, beta-lactamase, green fluorescent protein, and luciferase. [0041]
  • As used herein, the term “exogenous gene” refers to a gene that is not naturally present in a host organism or cell, or is artificially introduced into a host organism or cell. [0042]
  • The term “gene” refers to a nucleic acid (e.g., DNA or RNA) sequence that comprises coding sequences necessary for the production of a polypeptide or precursor (e.g., proinsulin). The polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence so long as the desired activity or functional properties (e.g., enzymatic activity, ligand binding, signal transduction, etc.) of the full-length or fragment are retained. The term also encompasses the coding region of a structural gene and includes sequences located adjacent to the coding region on both the 5[0043] 40 and 3′ ends for a distance of about 1 kb or more on either end such that the gene corresponds to the length of the full-length mRNA. The sequences that are located 5′ of the coding region and which are present on the mRNA are referred to as 5′ untranslated sequences. The sequences that are located 3′ or downstream of the coding region and which are present on the mRNA are referred to as 3′ untranslated sequences. The term “gene” encompasses both cDNA and genomic forms of a gene. A genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed “introns” or “intervening regions” or “intervening sequences.” Introns are segments of a gene which are transcribed into nuclear RNA (hnRNA); introns may contain regulatory elements such as enhancers. Introns are removed or “spliced out” from the nuclear or primary transcript; introns therefore are absent in the messenger RNA (mRNA) transcript. The mRNA functions during translation to specify the sequence or order of amino acids in a nascent polypeptide.
  • As used herein, the term “gene expression” refers to the process of converting genetic information encoded in a gene into RNA (e.g., mRNA, rRNA, tRNA, or snRNA) through “transcription” of the gene (i.e., via the enzymatic action of an RNA polymerase), and for protein encoding genes, into protein through “translation” of mRNA. Gene expression can be regulated at many stages in the process. “Up-regulation” or “activation” refers to regulation that increases the production of gene expression products (i.e., RNA or protein), while “down-regulation” or “repression” refers to regulation that decrease production. Molecules (e.g., transcription factors) that are involved in up-regulation or down-regulation are often called “activators” and “repressors,” respectively. [0044]
  • Where “amino acid sequence” is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule, “amino acid sequence” and like terms, such as “polypeptide” or “protein” are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule. [0045]
  • As used herein, the terms “nucleic acid molecule encoding,” “DNA sequence encoding,” “DNA encoding,” “RNA sequence encoding,” and “RNA encoding” refer to the order or sequence of deoxyribonucleotides or ribonucleotides along a strand of deoxyribonucleic acid or ribonucleic acid. The order of these deoxyribonucleotides or ribonucleotides determines the order of amino acids along the polypeptide (protein) chain. The DNA or RNA sequence thus codes for the amino acid sequence. [0046]
  • As used herein, the term “variant,” when used in reference to a protein, refers to proteins encoded by partially homologous nucleic acids so that the amino acid sequence of the proteins varies. As used herein, the term “variant” encompasses proteins encoded by homologous genes having both conservative and nonconservative amino acid substitutions that do not result in a change in protein function, as well as proteins encoded by homologous genes having amino acid substitutions that cause decreased (e.g., null mutations) protein function or increased protein function. [0047]
  • As used herein, the terms “complementary” or “complementarity” are used in reference to polynucleotides (i.e., a sequence of nucleotides) related by the base-pairing rules. For example, for the sequence “A-G-T,” is complementary to the sequence “T-C-A.”Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods that depend upon binding between nucleic acids. [0048]
  • The terms “homology” and “percent identity” when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology (i.e., partial identity) or complete homology (i.e., complete identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid sequence and is referred to using the functional term “substantially homologous.” The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe (i.e., an oligonucleotide which is capable of hybridizing to another oligonucleotide of interest) will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous sequence to a target sequence under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target. [0049]
  • The art knows well that numerous equivalent conditions may be employed to comprise low stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of low stringency hybridization different from, but equivalent to, the above listed conditions. In addition, the art knows conditions that promote hybridization under conditions of high stringency (e.g., increasing the temperature of the hybridization and/or wash steps, the use of formamide in the hybridization solution, etc.). [0050]
  • When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term “substantially homologous” refers to any probe that can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described above. [0051]
  • When used in reference to a single-stranded nucleic acid sequence, the term “substantially homologous” refers to any probe that can hybridize (i.e., it is the complement of) the single-stranded nucleic acid sequence under conditions of low stringency as described above. [0052]
  • As used herein, the term “hybridization” is used in reference to the pairing of complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the T[0053] m of the formed hybrid, and the G:C ratio within the nucleic acids. A single molecule that contains pairing of complementary nucleic acids within its structure is said to be “self-hybridized.”
  • As used herein, the term “T[0054] m” is used in reference to the “melting temperature” of a nucleic acid. The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the Tm of nucleic acids is well known in the art. As indicated by standard references, a simple estimate of the Tm value may be calculated by the equation: Tm=81.5+0.41(% G+C), when a nucleic acid is in aqueous solution at 1 M NaCl (See e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization [1985]). Other references include more sophisticated computations that take structural as well as sequence characteristics into account for the calculation of Tm.
  • As used herein the term “stringency” is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid hybridizations are conducted. With “high stringency” conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences. Thus, conditions of “weak” or “low” stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less. [0055]
  • “High stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/l NaH[0056] 2PO4.H2O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5×Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1×SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • “Medium stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/l NaH[0057] 2PO4.H2O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5×Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.0×SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • “Low stringency conditions” comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/l NaH[0058] 2PO4.H2O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5×Denhardt's reagent [50×Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 5×SSPE, 0.1% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
  • A gene may produce multiple RNA species that are generated by differential splicing of the primary RNA transcript. cDNAs that are splice variants of the same gene will contain regions of sequence identity or complete homology (representing the presence of the same exon or portion of the same exon on both cDNAs) and regions of complete non-identity (for example, representing the presence of exon “A” on [0059] cDNA 1 wherein cDNA 2 contains exon “B” instead). Because the two cDNAs contain regions of sequence identity they will both hybridize to a probe derived from the entire gene or portions of the gene containing sequences found on both cDNAs; the two splice variants are therefore substantially homologous to such a probe and to each other.
  • The terms “in operable combination,” “in operable order,” and “operably linked” as used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced. The term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced. [0060]
  • As used herein, the term “selectable marker” refers to a gene that encodes an enzymatic activity that confers the ability to grow in medium lacking what would otherwise be an essential nutrient (e.g. the HIS3 gene in yeast cells); in addition, a selectable marker may confer resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed. Selectable markers may be “dominant”; a dominant selectable marker encodes an enzymatic activity that can be detected in any eukaryotic cell line. Examples of dominant selectable markers include the [0061] bacterial aminoglycoside 3′ phosphotransferase gene (also referred to as the neo gene) that confers resistance to the drug G418 in mammalian cells, the bacterial hygromycin G phosphotransferase (hyg) gene that confers resistance to the antibiotic hygromycin and the bacterial xanthine-guanine phosphoribosyl transferase gene (also referred to as the gpt gene) that confers the ability to grow in the presence of mycophenolic acid. Other selectable markers are not dominant in that their use must be in conjunction with a cell line that lacks the relevant enzyme activity. Examples of non-dominant selectable markers include the thymidine kinase (tk) gene that is used in conjunction with tk cell lines, the CAD gene which is used in conjunction with CAD-deficient cells and the mammalian hypoxanthine-guanine phosphoribosyl transferase (hprt) gene which is used in conjunction with hprt cell lines. A review of the use of selectable markers in mammalian cell lines is provided in Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York (1989) pp.16.9-16.15.
  • As used herein, the term “regulatory element” refers to a genetic element which controls some aspect of the expression of nucleic acid sequences. For example, a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region. Other regulatory elements are splicing signals, polyadenylation signals, termination signals, RNA export elements, internal ribosome entry sites, etc. (defined infra). [0062]
  • Transcriptional control signals in eukaryotes comprise “promoter” and “enhancer” elements. Promoters and enhancers consist of short arrays of DNA sequences that interact specifically with cellular proteins involved in transcription (Maniatis et al., Science 236:1237 [1987]). Promoter and enhancer elements have been isolated from a variety of eukaryotic sources including genes in yeast, insect and mammalian cells, and viruses (analogous control elements, i.e., promoters, are also found in prokaryotes). The selection of a particular promoter and enhancer depends on what cell type is to be used to express the protein of interest. Some eukaryotic promoters and enhancers have a broad host range while others are functional in a limited subset of cell types (for review see, Voss et al., Trends Biochem. Sci., 11:287 [1986]; and Maniatis et al., supra). For example, the SV40 early gene enhancer is very active in a wide variety of cell types from many mammalian species and has been widely used for the expression of proteins in mammalian cells (Dijkema et al., EMBO J. 4:761 [1985]). Two other examples of promoter/enhancer elements active in a broad range of mammalian cell types are those from the human elongation factor 1α gene (Uetsuki et al., J. Biol. Chem., 264:5791 [1989]; Kim et al., Gene 91:217 [1990]; and Mizushima and Nagata, Nuc. Acids. Res., 18:5322 [1990]) and the long terminal repeats of the Rous sarcoma virus (Gorman et al, Proc. Natl. Acad. Sci. USA 79:6777 [1982]) and the human cytomegalovirus (Boshart et al., Cell 41:521 [1985]). [0063]
  • As used herein, the term “promoter/enhancer” denotes a segment of DNA which contains sequences capable of providing both promoter and enhancer functions (i.e., the functions provided by a promoter element and an enhancer element, see above for a discussion of these functions). For example, the long terminal repeats of retroviruses contain both promoter and enhancer functions. The enhancer/promoter may be “endogenous” or “exogenous” or “heterologous.” An “endogenous” enhancer/promoter is one which is naturally linked with a given gene in the genome. An “exogenous” or “heterologous” enhancer/promoter is one which is placed in juxtaposition to a gene by means of genetic manipulation (i.e., molecular biological techniques such as cloning and recombination) such that transcription of that gene is directed by the linked enhancer/promoter. [0064]
  • Regulatory elements may be tissue specific or cell specific. The term “tissue specific” as it applies to a regulatory element refers to a regulatory element that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., liver) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., lung). [0065]
  • Tissue specificity of a regulatory element may be evaluated by, for example, operably linking a reporter gene to a promoter sequence (which is not tissue-specific) and to the regulatory element to generate a reporter construct, introducing the reporter construct into the genome of an animal such that the reporter construct is integrated into every tissue of the resulting transgenic animal, and detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues of the transgenic animal. The detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression of the reporter gene in other tissues shows that the regulatory element is “specific” for the tissues in which greater levels of expression are detected. Thus, the term “tissue-specific” (e.g., liver-specific) as used herein is a relative term that does not require absolute specificity of expression. In other words, the term “tissue-specific” does not require that one tissue have extremely high levels of expression and another tissue have no expression. It is sufficient that expression is greater in one tissue than another. By contrast, “strict” or “absolute” tissue-specific expression is meant to indicate expression in a single tissue type (e.g., liver) with no detectable expression in other tissues. [0066]
  • The term “cell type specific” as applied to a regulatory element refers to a regulatory element which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression of the same nucleotide sequence of interest in a different type of cell within the same tissue. The term “cell type specific” when applied to a regulatory element also means a regulatory element capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue. [0067]
  • Cell type specificity of a regulatory element may be assessed using methods well known in the art (e.g., immunohistochemical staining and/or Northern blot analysis). Briefly, for immunohistochemical staining, tissue sections are embedded in paraffin, and paraffin sections are reacted with a primary antibody specific for the polypeptide product encoded by the nucleotide sequence of interest whose expression is regulated by the regulatory element. A labeled (e.g., peroxidase conjugated) secondary antibody specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy. Briefly, for Northern blot analysis, RNA is isolated from cells and electrophoresed on agarose gels to fractionate the RNA according to size followed by transfer of the RNA from the gel to a solid support (e.g., nitrocellulose or a nylon membrane). The immobilized RNA is then probed with a labeled oligo-deoxyribonucleotide probe or DNA probe to detect RNA species complementary to the probe used. Northern blots are a standard tool of molecular biologists. [0068]
  • The term “promoter,” “promoter element,” or “promoter sequence” as used herein, refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into mRNA. A promoter is typically, though not necessarily, located 5′ (i.e., upstream) of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription. [0069]
  • Promoters may be constitutive or regulatable. The term “constitutive” when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, etc.). In contrast, a “regulatable” promoter is one which is capable of directing a level of transcription of an operably linked nucleic acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, etc.) which is different from the level of transcription of the operably linked nucleic acid sequence in the absence of the stimulus. [0070]
  • The presence of “splicing signals” on an expression vector often results in higher levels of expression of the recombinant transcript. Splicing signals mediate the removal of introns from the primary RNA transcript and consist of a splice donor and acceptor site (Sambrook et al., [0071] Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York [1989], pp. 16.7-16.8). A commonly used splice donor and acceptor site is the splice junction from the 16S RNA of SV40.
  • Efficient expression of recombinant DNA sequences in eukaryotic cells requires expression of signals directing the efficient termination and polyadenylation of the resulting transcript. Transcription termination signals are generally found downstream of the polyadenylation signal and are a few hundred nucleotides in length. The term “poly A site” or “poly A sequence” as used herein denotes a DNA sequence that directs both the termination and polyadenylation of the nascent RNA transcript. Efficient polyadenylation of the recombinant transcript is desirable as transcripts lacking a poly A tail are unstable and are rapidly degraded. The poly A signal utilized in an expression vector may be “heterologous” or “endogenous.” An endogenous poly A signal is one that is found naturally at the 3′ end of the coding region of a given gene in the genome. A heterologous poly A signal is one that is isolated from one gene and placed 3′ of another gene. A commonly used heterologous poly A signal is the SV40 poly A signal. The SV40 poly A signal is contained on a 237 bp BamHI/BclI restriction fragment and directs both termination and polyadenylation (Sambrook, supra, at 16.6-16.7). [0072]
  • Eukaryotic expression vectors may also contain “viral replicons” or “viral origins of replication.” Viral replicons are viral DNA sequences that allow for the extrachromosomal replication of a vector in a host cell expressing the appropriate replication factors. Vectors that contain either the SV40 or polyoma virus origin of replication replicate to high “copy number” (up to 10[0073] 4 copies/cell) in cells that express the appropriate viral T antigen. Vectors that contain the replicons from bovine papillomavirus or Epstein-Barr virus replicate extrachromosomally at “low copy number” (˜100 copies/cell). However, it is not intended that expression vectors be limited to any particular viral origin of replication.
  • As used herein, the term “long terminal repeat” of “LTR” refers to transcriptional control elements located in or isolated from the [0074] U3 region 5′ and 3′ of a retroviral genome. As is known in the art, long terminal repeats may be used as control elements in retroviral vectors, or isolated from the retroviral genome and used to control expression from other types of vectors.
  • As used herein, the term “secretion signal” refers to any DNA sequence which when operably linked to a recombinant DNA sequence encodes a signal peptide which is capable of causing the secretion of the recombinant polypeptide. In general, the signal peptides comprise a series of about 15 to 30 hydrophobic amino acid residues (See, e.g., Zwizinski et al., J. Biol. Chem. 255(16): 7973-77 [1980], Gray et al., Gene 39(2): 247-54 [1985], and Martial et al., Science 205: 602-607 [1979]). Such secretion signal sequences are preferably derived from genes encoding polypeptides secreted from the cell type targeted for tissue-specific expression (e.g., secreted milk proteins for expression in and secretion from mammary secretory cells). Secretory DNA sequences, however, are not limited to such sequences. Secretory DNA sequences from proteins secreted from many cell types and organisms may also be used (e.g., the secretion signals for t-PA, serum albumin, lactoferrin, and growth hormone, and secretion signals from microbial genes encoding secreted polypeptides such as from yeast, filamentous fungi, and bacteria). [0075]
  • As used herein, the terms “RNA export element” or “Pre-mRNA Processing Enhancer (PPE)” refer to 3′ and 5′ cis-acting post-transcriptional regulatory elements that enhance export of RNA from the nucleus. “PPE” elements include, but are not limited to Mertz sequences (described in U.S. Pat. Nos. 5,914,267 and 5,686,120, all of which are incorporated herein by reference) and woodchuck mRNA processing enhancer (WPRE; WO99/14310 and U.S. Pat. No. 6,136,597, each of which is incorporated herein by reference). [0076]
  • As used herein, the term “polycistronic” refers to an mRNA encoding more than polypeptide chain (See, e.g., WO 93/03143, WO 88/05486, and European Pat. No. 117058, all of which are incorporated herein by reference). Likewise, the term “arranged in polycistronic sequence” refers to the arrangement of genes encoding two different polypeptide chains in a single mRNA. [0077]
  • As used herein, the term “internal ribosome entry site” or “IRES” refers to a sequence located between polycistronic genes that permits the production of the expression product originating from the second gene by internal initiation of the translation of the dicistronic mRNA. Examples of internal ribosome entry sites include, but are not limited to, those derived from foot and mouth disease virus (FDV), encephalomyocarditis virus, poliovirus and RDV (Scheper et al, Biochem. 76: 801-809 [1994]; Meyer et al., J. Virol. 69: 2819-2824 [1995]; Jang et al., 1988, J. Virol. 62: 2636-2643 [1998]; Haller et al., J. Virol. 66: 5075-5086 [1995]). Vectors incorporating IRES's may be assembled as is known in the art. For example, a retroviral vector containing a polycistronic sequence may contain the following elements in operable association: nucleotide polylinker, gene of interest, an internal ribosome entry site and a mammalian selectable marker or another gene of interest. The polycistronic cassette is situated within the retroviral vector between the 5′ LTR and the 3′ LTR at a position such that transcription from the 5′ LTR promoter transcribes the polycistronic message cassette. The transcription of the polycistronic message cassette may also be driven by an internal promoter (e.g., cytomegalovirus promoter) or an inducible promoter, which may be preferable depending on the use. The polycistronic message cassette can further comprise a cDNA or genomic DNA (gDNA) sequence operatively associated within the polylinker. Any mammalian selectable marker can be utilized as the polycistronic message cassette mammalian selectable marker. Such mammalian selectable markers are well known to those of skill in the art and can include, but are not limited to, kanamycin/G418, hygromycin B or mycophenolic acid resistance markers. [0078]
  • As used herein, the term “retrovirus” refers to a retroviral particle which is capable of entering a cell (i.e., the particle contains a membrane-associated protein such as an envelope protein or a viral G glycoprotein which can bind to the host cell surface and facilitate entry of the viral particle into the cytoplasm of the host cell) and integrating the retroviral genome (as a double-stranded provirus) into the genome of the host cell. The term “retrovirus” encompasses Oncovirinae (e.g., Moloney murine leukemia virus (MoMOLV), Moloney murine sarcoma virus (MoMSV), and Mouse mammary tumor virus (MMTV), Spumavirinae, amd Lentivirinae (e.g., Human immunodeficiency virus, Simian immunodeficiency virus, Equine infection anemia virus, and Caprine arthritis-encephalitis virus; See, e.g., U.S. Pat. Nos. 5,994,136 and 6,013,516, both of which are incorporated herein by reference). [0079]
  • As used herein, the term “retroviral vector” refers to a retrovirus that has been modified to express a gene of interest. Retroviral vectors can be used to transfer genes efficiently into host cells by exploiting the viral infectious process. Foreign or heterologous genes cloned (i.e., inserted using molecular biological techniques) into the retroviral genome can be delivered efficiently to host cells which are susceptible to infection by the retrovirus. Through well known genetic manipulations, the replicative capacity of the retroviral genome can be destroyed. The resulting replication-defective vectors can be used to introduce new genetic material to a cell but they are unable to replicate. A helper virus or packaging cell line can be used to permit vector particle assembly and egress from the cell. Such retroviral vectors comprise a replication-deficient retroviral genome containing a nucleic acid sequence encoding at least one gene of interest (i.e., a polycistronic nucleic acid sequence can encode more than one gene of interest), a 5′ retroviral long terminal repeat (5′ LTR); and a 3′ retroviral long terminal repeat (3′ LTR). [0080]
  • The term “pseudotyped retroviral vector” refers to a retroviral vector containing a heterologous membrane protein. The term “membrane-associated protein” refers to a protein (e.g., a viral envelope glycoprotein or the G proteins of viruses in the Rhabdoviridae family such as VSV, Piry, Chandipura and Mokola) which are associated with the membrane surrounding a viral particle; these membrane-associated proteins mediate the entry of the viral particle into the host cell. The membrane associated protein may bind to specific cell surface protein receptors, as is the case for retroviral envelope proteins or the membrane-associated protein may interact with a phospholipid component of the plasma membrane of the host cell, as is the case for the G proteins derived from members of the Rhabdoviridae family. [0081]
  • The term “heterologous membrane-associated protein” refers to a membrane-associated protein which is derived from a virus which is not a member of the same viral class or family as that from which the nucleocapsid protein of the vector particle is derived. “Viral class or family” refers to the taxonomic rank of class or family, as assigned by the International Committee on Taxonomy of Viruses. [0082]
  • The term “Rhabdoviridae” refers to a family of enveloped RNA viruses that infect animals, including humans, and plants. The Rhabdoviridae family encompasses the genus Vesiculovirus which includes vesicular stomatitis virus (VSV), Cocal virus, Piry virus, Chandipura virus, and Spring viremia of carp virus (sequences encoding the Spring viremia of carp virus are available under GenBank accession number U18101). The G proteins of viruses in the Vesiculovirus genera are virally-encoded integral membrane proteins that form externally projecting homotrimeric spike glycoproteins complexes that are required for receptor binding and membrane fusion. The G proteins of viruses in the Vesiculovirus genera have a covalently bound palmititic acid (C[0083] 16) moiety. The amino acid sequences of the G proteins from the Vesiculoviruses are fairly well conserved. For example, the Piry virus G protein share about 38% identity and about 55% similarity with the VSV G proteins (several strains of VSV are known, e.g., Indiana, New Jersey, Orsay, San Juan, etc., and their G proteins are highly homologous). The Chandipura virus G protein and the VSV G proteins share about 37% identity and 52% similarity. Given the high degree of conservation (amino acid sequence) and the related functional characteristics (e.g., binding of the virus to the host cell and fusion of membranes, including syncytia formation) of the G proteins of the Vesiculoviruses, the G proteins from non-VSV Vesiculoviruses may be used in place of the VSV G protein for the pseudotyping of viral particles. The G proteins of the Lyssa viruses (another genera within the Rhabdoviridae family) also share a fair degree of conservation with the VSV G proteins and function in a similar manner (e.g., mediate fusion of membranes) and therefore may be used in place of the VSV G protein for the pseudotyping of viral particles. The Lyssa viruses include the Mokola virus and the Rabies viruses (several strains of Rabies virus are known and their G proteins have been cloned and sequenced). The Mokola virus G protein shares stretches of homology (particularly over the extracellular and transmembrane domains) with the VSV G proteins which show about 31% identity and 48% similarity with the VSV G proteins. Preferred G proteins share at least 25% identity, preferably at least 30% identity and most preferably at least 35% identity with the VSV G proteins. The VSV G protein from which New Jersey strain (the sequence of this G protein is provided in GenBank accession numbers M27165 and M21557) is employed as the reference VSV G protein.
  • As used herein, the term “lentivirus vector” refers to retroviral vectors derived from the Lentiviridae family (e.g., human immunodeficiency virus, simian immunodeficiency virus, equine infectious anemia virus, and caprine arthritis-encephalitis virus) that are capable of integrating into non-dividing cells (See, e.g., U.S. Pat. Nos. 5,994,136 and 6,013,516, both of which are incorporated herein by reference). [0084]
  • The term “pseudotyped lentivirus vector” refers to lentivirus vector containing a heterologous membrane protein (e.g., a viral envelope glycoprotein or the G proteins of viruses in the Rhabdoviridae family such as VSV, Piry, Chandipura and Mokola). [0085]
  • As used herein, the term “transposon” refers to transposable elements (e.g., Tn5, Tn7, and Tn10) that can move or transpose from one position to another in a genome. In general, the transposition is controlled by a transposase. The term “transposon vector,” as used herein, refers to a vector encoding a nucleic acid of interest flanked by the terminal ends of transposon. Examples of transposon vectors include, but are not limited to, those described in U.S. Pat. Nos. 6,027,722; 5,958,775; 5,968,785; 5,965,443; and 5,719,055, all of which are incorporated herein by reference. [0086]
  • As used herein, the term “adeno-associated virus (AAV) vector” refers to a vector derived from an adeno-associated virus serotype, including without limitation, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAVX7, etc. AAV vectors can have one or more of the AAV wild-type genes deleted in whole or part, preferably the rep and/or cap genes, but retain functional flanking ITR sequences. [0087]
  • AAV vectors can be constructed using recombinant techniques that are known in the art to include one or more heterologous nucleotide sequences flanked on both ends (5′ and 3′) with functional AAV ITRs. In the practice of the invention, an AAV vector can include at least one AAV ITR and a suitable promoter sequence positioned upstream of the heterologous nucleotide sequence and at least one AAV ITR positioned downstream of the heterologous sequence. A “recombinant AAV vector plasmid” refers to one type of recombinant AAV vector wherein the vector comprises a plasmid. As with AAV vectors in general, 5′ and 3′ ITRs flank the selected heterologous nucleotide sequence. [0088]
  • AAV vectors can also include transcription sequences such as polyadenylation sites, as well as selectable markers or reporter genes, enhancer sequences, and other control elements which allow for the induction of transcription. Such control elements are described above. [0089]
  • As used herein, the term “AAV virion” refers to a complete virus particle. An AAV virion may be a wild type AAV virus particle (comprising a linear, single-stranded AAV nucleic acid genome associated with an AAV capsid, i.e., a protein coat), or a recombinant AAV virus particle (described below). In this regard, single-stranded AAV nucleic acid molecules (either the sense/coding strand or the antisense/anticoding strand as those terms are generally defined) can be packaged into an AAV virion; both the sense and the antisense strands are equally infectious. [0090]
  • As used herein, the term “recombinant AAV virion” or “rAAV” is defined as an infectious, replication-defective virus composed of an AAV protein shell encapsulating (i.e., surrounding with a protein coat) a heterologous nucleotide sequence, which in turn is flanked 5′ and 3′ by AV. ITRs. A number of techniques for constructing recombinant AV. virions are known in the art (See, e.g., U.S. Pat. No. 5,173,414; WO 92/01070; WO 93/03769; Lebkowski et al., Molec. Cell. Biol. 8:3988-3996 [1988]; Vincent et al., Vaccines 90 [1990] (Cold Spring Harbor Laboratory Press); Carter, Current Opinion in Biotechnology 3:533-539 [1992]; Muzyczka, Current Topics in Microbiol. and Immunol. 158:97-129 [1992]; Kotin, Human Gene Therapy 5:793-801 [1994]; Shelling and Smith, Gene Therapy 1:165-169 [1994]; and Zhou et al., J. Exp. Med. 179:1867-1875 [1994], all of which are incorportaed herein by reference). [0091]
  • Suitable nucleotide sequences for use in AAV vectors (and, indeed, any of the vectors described herein) include any functionally relevant nucleotide sequence. Thus, the AAV vectors of the present invention can comprise any desired gene that encodes a protein that is defective or missing from a target cell genome or that encodes a non-native protein having a desired biological or therapeutic effect (e.g., an antiviral function), or the sequence can correspond to a molecule having an antisense or ribozyme function. Suitable genes include those used for the treatment of inflammatory diseases, autoimmune, chronic and infectious diseases, including such disorders as AIDS, cancer, neurological diseases, cardiovascular disease, hypercholestemia; various blood disorders including various anemias, thalasemias and hemophilia; genetic defects such as cystic fibrosis, Gaucher's Disease, adenosine deaminase (ADA) deficiency, emphysema, etc. A number of antisense oligonucleotides (e.g., short oligonucleotides complementary to sequences around the translational initiation site (AUG codon) of an mRNA) that are useful in antisense therapy for cancer and for viral diseases have been described in the art. (See, e.g., Han et al., Proc. Natl. Acad. Sci. USA 88:4313-4317 [1991]; Uhlmann et al., Chem. Rev. 90:543-584 [1990]; Helene et al., Biochim. Biophys. Acta. 1049:99-125 [1990]; Agarwal et al., Proc. Natl. Acad. Sci. USA 85:7079-7083 [1989]; and Heikkila et al., Nature 328:445-449 [1987]). For a discussion of suitable ribozymes, see, e.g., Cech et al (1992) J. Biol. Chem. 267:17479-17482 and U.S. Pat. No. 5,225,347, incorporated herein by reference. [0092]
  • By “adeno-associated virus inverted terminal repeats” or “AAV ITRs” is meant the art-recognized palindromic regions found at each end of the AAV genome which function together in cis as origins of DNA replication and as packaging signals for the virus. For use with the present invention, flanking AAV ITRs are positioned 5′ and 3′ of one or more selected heterologous nucleotide sequences and, together with the rep coding region or the Rep expression product, provide for the integration of the selected sequences into the genome of a target cell. [0093]
  • The nucleotide sequences of AAV ITR regions are known (See, e.g., Kotin, Human Gene Therapy 5:793-801 [1994]; Bems, K. I. “Parvoviridae and their Replication” in Fundamental Virology, 2nd Edition, (B. N. Fields and D. M. Knipe, eds.) for the AAV-2 sequence. As used herein, an “AAV ITR” need not have the wild-type nucleotide sequence depicted, but may be altered, e.g., by the insertion, deletion or substitution of nucleotides. Additionally, the AAV ITR may be derived from any of several AAV serotypes, including without limitation, AAV-1, AAV-2, AAV-3, AAV-4, AAV-5, AAVX7, etc. The 5′ and 3′ ITRs which flank a selected heterologous nucleotide sequence need not necessarily be identical or derived from the same AAV serotype or isolate, so long as they function as intended, i.e., to allow for the integration of the associated heterologous sequence into the target cell genome when the rep gene is present (either on the same or on a different vector), or when the Rep expression product is present in the target cell. [0094]
  • As used herein the term, the term “in vitro” refers to an artificial environment and to processes or reactions that occur within an artificial environment. In vitro environments can consist of, but are not limited to, test tubes and cell cultures. The term “in vivo” refers to the natural environment (e.g., an animal or a cell) and to processes or reaction that occur within a natural environment. [0095]
  • As used herein, the term “clonally derived” refers to a cell line that it derived from a single cell. [0096]
  • As used herein, the term “non-clonally derived” refers to a cell line that is derived from more than one cell. [0097]
  • As used herein, the term “passage” refers to the process of diluting a culture of cells that has grown to a particular density or confluency (e.g., 70% or 80% confluent), and then allowing the diluted cells to regrow to the particular density or confluency desired (e.g., by replating the cells or establishing a new roller bottle culture with the cells. [0098]
  • As used herein, the term “stable,” when used in reference to genome, refers to the stable maintenance of the information content of the genome from one generation to the next, or, in the particular case of a cell line, from one passage to the next. Accordingly, a genome is considered to be stable if no gross changes occur in the genome (e.g., a gene is deleted or a chromosomal translocation occurs). The term “stable” does not exclude subtle changes that may occur to the genome such as point mutations. [0099]
  • As used herein, the term “response,” when used in reference to an assay, refers to the generation of a detectable signal (e.g., accumulation of reporter protein, increase in ion concentration, accumulation of a detectable chemical product). [0100]
  • As used herein, the term “membrane receptor protein” refers to membrane spanning proteins that bind a ligand (e.g., a hormone or neurotransmitter). As is known in the art, protein phosphorylation is a common regulatory mechanism used by cells to selectively modify proteins carrying regulatory signals from outside the cell to the nucleus. The proteins that execute these biochemical modifications are a group of enzymes known as protein kinases. They may further be defined by the substrate residue that they target for phosphorylation. One group of protein kinases are the tyrosine kinases (TKs) which selectively phosphorylate a target protein on its tyrosine residues. Some tyrosine kinases are membrane-bound receptors (RTKs), and, upon activation by a ligand, can autophosphorylate as well as modify substrates. The initiation of sequential phosphorylation by ligand stimulation is a paradigm that underlies the action of such effectors as, for example, epidermal growth factor (EGF), insulin, platelet-derived growth factor (PDGF), and fibroblast growth factor (FGF). The receptors for these ligands are tyrosine kinases and provide the interface between the binding of a ligand (hormone, growth factor) to a target cell and the transmission of a signal into the cell by the activation of one or more biochemical pathways. Ligand binding to a receptor tyrosine kinase activates its intrinsic enzymatic activity (See, e.g., Ullrich and Schlessinger, Cell 61:203-212 [1990]). Tyrosine kinases can also be cytoplasmic, non-receptor-type enzymes and act as a downstream component of a signal transduction pathway. [0101]
  • As used herein, the term “signal transduction protein” refers to a proteins that are activated or otherwise effected by ligand binding to a membrane receptor protein or some other stimulus. Examples of signal transduction protein include adenyl cyclase, phospholipase C, and G-proteins. Many membrane receptor proteins are coupled to G-proteins (i.e., G-protein coupled receptors (GPCRs); for a review, see Neer, 1995, Cell 80:249-257 [1995]). Typically, GPCRs contain seven transmembrane domains. Putative GPCRs can be identified on the basis of sequence homology to known GPCRs. [0102]
  • GPCRs mediate signal transduction across a cell membrane upon the binding of a ligand to an extracellular portion of a GPCR. The intracellular portion of a GPCR interacts with a G-protein to modulate signal transduction from outside to inside a cell. A GPCR is therefore said to be “coupled” to a G-protein. G-proteins are composed of three polypeptide subunits: an α subunit, which binds and hydrolyses GTP, and a dimeric βγ subunit. In the basal, inactive state, the G-protein exists as a heterotrimer of the α and βγ subunits. When the G-protein is inactive, guanosine diphosphate (GDP) is associated with the α subunit of the G-protein. When a GPCR is bound and activated by a ligand, the GPCR binds to the G-protein heterotrimer and decreases the affinity of the Gα subunit for GDP. In its active state, the G subunit exchanges GDP for guanine triphosphate (GTP) and active Gα subunit disassociates from both the receptor and the dimeric βγ subunit. The disassociated, active Gα subunit transduces signals to effectors that are “downstream” in the G-protein signalling pathway within the cell. Eventually, the G-protein's endogenous GTPase activity returns active G subunit to its inactive state, in which it is associated with GDP and the dimeric βγ subunit. [0103]
  • Numerous members of the heterotrimeric G-protein family have been cloned, including more than 20 genes encoding various Gα subunits. The various G subunits have been categorized into four families, on the basis of amino acid sequences and functional homology. These four families are termed Gα[0104] s, Gαi, Gαq, and Gα12. Functionally, these four families differ with respect to the intracellular signaling pathways that they activate and the GPCR to which they couple.
  • For example, certain GPCRs normally couple with Gα[0105] s and, through Gαs, these GPCRs stimulate adenylyl cyclase activity. Other GPCRs normally couple with GGαq, and through GGαq, these GPCRs can activate phospholipase C (PLC), such as the β isoform of phospholipase C (i.e., PLCβ, Stermweis and Smrcka, Trends in Biochem. Sci. 17:502-506 [1992]).
  • As used herein, the term “immunoglobulin” refers to proteins which bind a specific antigen. Immunoglobulins include, but are not limited to, polyclonal, monoclonal, chimeric, and humanized antibodies, Fab fragments, F(ab′)2 fragments, and includes immunoglobulins of the following classes: IgG, IgA, IgM, IgD, IbE, and secreted immunoglobulins (sIg). Immunoglobulins generally comprise two identical heavy chains (γ, α, μ, δ, or ε) and two light chains (κ or λ). [0106]
  • As used herein, the term “antigen binding protein” refers to proteins which bind to a specific antigen. “Antigen binding proteins” include, but are not limited to, immunoglobulins, including polyclonal, monoclonal, chimeric, and humanized antibodies; Fab fragments, F(ab′)2 fragments, and Fab expression libraries; and single chain antibodies. Various procedures known in the art are used for the production of polyclonal antibodies. For the production of an antibody, various host animals can be immunized by injection with the peptide corresponding to the desired epitope including but not limited to rabbits, mice, rats, sheep, goats, etc. In a preferred embodiment, the peptide is conjugated to an immunogenic carrier (e.g., diphtheria toxoid, bovine serum albumin (BSA), or keyhole limpet hemocyanin (KLH)). Various adjuvants are used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (Bacille Calmette-Guerin) and [0107] Corynebacterium parvum.
  • For preparation of monoclonal antibodies, any technique that provides for the production of antibody molecules by continuous cell lines in culture may be used (See, e.g., Harlow and Lane, [0108] Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). These include, but are not limited to, the hybridoma technique originally developed by Köhler and Milstein (Köhler and Milstein, Nature 256:495-497 [1975]), as well as the trioma technique, the human B-cell hybridoma technique (See e.g., Kozbor et al. Immunol. Today 4:72 [1983]), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 [1985]).
  • According to the invention, techniques described for the production of single chain antibodies (U.S. Pat. No. 4,946,778; herein incorporated by reference) can be adapted to produce specific single chain antibodies as desired. An additional embodiment of the invention utilizes the techniques known in the art for the construction of Fab expression libraries (Huse et al., Science 246:1275-1281 [1989]) to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity. [0109]
  • Antibody fragments that contain the idiotype (antigen binding region) of the antibody molecule can be generated by known techniques. For example, such fragments include but are not limited to: the F(ab′)2 fragment that can be produced by pepsin digestion of an antibody molecule; the Fab′ fragments that can be generated by reducing the disulfide bridges of an F(ab′)2 fragment, and the Fab fragments that can be generated by treating an antibody molecule with papain and a reducing agent. [0110]
  • Genes encoding antigen binding proteins can be isolated by methods known in the art. In the production of antibodies, screening for the desired antibody can be accomplished by techniques known in the art (e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich” immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, for example), Western Blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc.) etc. [0111]
  • As used herein, the term “reporter gene” refers to a gene encoding a protein that may be assayed. Examples of reporter genes include, but are not limited to, luciferase (See, e.g., deWet et al., Mol. Cell. Biol. 7:725 [1987] and U.S. Pat Nos.,6,074,859; 5,976,796; 5,674,713; and 5,618,682; all of which are incorporated herein by reference), green fluorescent protein (e.g., GenBank Accession Number U43284; a number of GFP variants are commercially available from CLONTECH Laboratories, Palo Alto, Calif.), chloramphenicol acetyltransferase, β-galactosidase, alkaline phosphatase, and horse radish peroxidase. [0112]
  • As used herein, the term “purified” refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated. An “isolated nucleic acid sequence” is therefore a purified nucleic acid sequence. “Substantially purified” molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which they are naturally associated. [0113]
  • DETAILED DESCRIPTION OF THE INVENTION
  • The present invention provides novel regulatory sequences for use in expression vectors. In some embodiments, the present invention provides retroviral expression vectors containing novel regulatory elements. In addition, in still other embodiments, the present invention provides methods for expressing proteins of interest in host cells. In particularly preferred embodiments, the present invention provides methods for expressing two chains of a multisubunit protein (e.g., a heavy chain and a light chain of an immunoglobulin or the subunits of follicle stimulating hormone) in a nearly equal ratio. These methods take advantage of the novel regulatory sequences and vectors of the present invention to solve problems in the prior art. [0114]
  • I. Components of Retroviral Expression Vectors [0115]
  • In particularly preferred embodiments, the retroviral vectors of the present invention include the following elements in operable association: a) a 5′ LTR; b) a packaging signal; c) a 3′ LTR, and d) a nucleic acid encoding a protein of interest located between the 5′ and 3′ LTRs. In addition, in some preferred embodiments, novel compositions, including, but not limited to those described below are included in expression vectors in order to aid in the expression, secretion and purification of proteins of interest. The following novel elements are described in more detail below: bovine/human hybrid alpha-lactalbumin (α-LA) promoter (A); mutant RNA export element (B); and internal ribosome entry site (C). [0116]
  • A. Bovine/Human Hybrid Alpha Lactalbumin Promoter [0117]
  • In some embodiments, the present invention provides a hybrid α-lactalbumin (α-LA) promoter. It is contemplated that the hybrid promoter may be constructed from portions of any two or more mammalian α-lactalbumin promoters (e.g., human, bovine, goat, sheep, rabbit, or mouse α-lactalbumin promoters among others; see, e.g., GenBank Accession numbers AF124257; AF123893; AX067504; Soulier et al., Transgenic Res. 8(1):23-31 (1999); McKee et al., Nat. Biotech. 16(7):647-51 (1998); Lubon et al., Biochem. J. 256(2):391-6 (1988); and U.S. Pat. No. 5,530,177). In some embodiments, the portion of at least one of the promoters contributing to the hybrid is at least 50 nucleotides in length, while in preferred embodiments, the portion of at least one of the promoters contributing to the hybrid is at least 100 nucleotides in length, while in particularly preferred embodiments, the portion of at least one of the promoters contributing to the hybrid is at least 500 nucleotides length, with the portion of the at least one other promoter contributing to the hybrid being of similar or longer length. Once constructed, the hybrid promoters can be assayed for functionality by operably linking the promoter to a reporter gene such as beta-galactosidase, green fluorescent protein, or luciferase, creating a transgenic animal such as transgenic mouse or bovine that comprises the resulting construct, and assaying various tissues of the resulting transgenic animal to determine the specificity of expression from the hybrid promoter. In preferred embodiments, expression from the hybrid promoter is substantially specific to the mammary gland, and in particular to mammary epithelial cells, with no or only trace levels of expression of in other tissues. [0118]
  • In particularly preferred embodiments, the hybrid promoter is a bovine/human hybrid α-lactalbumin (α-LA) promoter (SEQ ID NO: 1). The human portion of the promoter was derived from human genomic DNA and contains bases from +15 relative to the transcription start point to −600 relative to the transcription start point. The bovine portion is attached to the end of the human portion and corresponds to bases −550 to −2000 relative to the transcription start point. [0119]
  • The hybrid promoter preferably used in the present invention utilizes a region of the human promoter that contained an internal poly-adenylation signal. The internal poly-adenylation signal was removed by mutation. The mutation was at base 2012 and involved a change from A to T. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated the removal of poly-adenylation signals improves retroviral RNA production by eliminating premature mRNA termination problems. In addition, it is contemplated that additional enhancer regions exist in the human, but not the bovine sequence. The hybrid promoter was constructed to take advantage of these additional sequences. Likewise, the hybrid promoter contains bovine elements that may or may not be found in the human promoter. [0120]
  • B. RNA Export Element [0121]
  • In some embodiments, the present invention comprises a mutant RNA export element (pre-mRNA processing element (PPE), Mertz sequence, or WPRE; See, e.g., U.S. Pat. Nos. 5,914,267 and 5,686,120 and PCT Publication WO99/14310, all of which are incorporated herein by reference). The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that the use of RNA export elements allows or facilitates high levels of expression of the protein of interest without incorporating splice signals or introns in the nucleic acid sequence encoding the protein of interest. [0122]
  • In some embodiments, a mutated PPE element is utilized. In some particularly preferred embodiments, the PPE sequence is mutated to remove internal ATG sequences. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that the removal of internal start sequences prevents potential unwanted translation initiation. In some embodiments utilizing a mutated PPE sequence, bases 4, 112, 131, and 238 of SEQ ID NO: 2 were changed from a G to a T. In all cases, these changes resulted in and ATG start codon being mutated to an ATT codon. In some embodiments, the mutated PPE sequence is placed in the 5′ untranslated region (UTR) of the mRNA encoding the gene of interest. In other embodiments, the mutated PPE sequence is placed in the 3′ UTR of the mRNA encoding the gene of interest. In some preferred embodiments, two mutated PPE sequences separated by a linker are placed in a head to tail array (See e.g., SEQ ID NO:2). It has been shown that two copies of the sequence cause a more dramatic effect on mRNA export. In other embodiments, 2-20 copies of the mutated PPE sequence are placed in the mRNA encoding the gene of interest. [0123]
  • Functional variants of the above sequences are easily identified by operably linking the variant sequence to a test gene in a vector, transfecting a host cell with the vector, and analyzing the host cell for expression of the test gene. Suitable test genes, host cells, and vectors are disclosed in the examples. [0124]
  • C. Internal Ribosome Entry Site [0125]
  • In some embodiments, the present invention comprises an internal ribosome entry site (IRES)/signal peptide sequence (e.g., SEQ ID NOs:3 and 12). The present invention contemplates that a variety of signal sequences may be fused with a variety of IRES sequences. Suitable signal sequences include those from α-lactalbumin, casein, tissue plasminogen activator, serum albumin, lactoferrin, and lactoferrin (See, e.g., Zwizinski et al., J. Biol. Chem. 255(16): 7973-77 [1980], Gray et al., Gene 39(2): 247-54 [1985], and Martial et al., Science 205: 602-607 [1979]). Such secretion signal sequences are preferably derived from genes encoding polypeptides secreted from the cell type targeted for tissue-specific expression (e.g., secreted milk proteins for expression in and secretion from mammary secretory cells). Suitable IRES sequences include, but are not limited, to those derived from foot and mouth disease virus (FDV), encephalomyocarditis virus, poliovirus and RDV (Scheper et al., Biochem. 76: 801-809 [1994]; Meyer et al., J. Virol. 69: 2819-2824 [1995]; Jang et al., 1988, J. Virol. 62: 2636-2643 [1998]; Haller et al., J. Virol. 66: 5075-5086 [1995]). Functional IRES/signal peptide sequences may be identified by operably linking two genes with the sequence and an appropriate promoter, transfecting a host cell with the construct, and assaying the host cell for production the proteins encoding by the two genes. Suitable genes, vector constructs, and host cells for such screening are provided in the examples. In preferred embodiments, the coding sequences for the IRES and signal peptide are adjacent to one another, with no intervening coding sequences (i.e., that may be separated by noncoding sequences in some instances). [0126]
  • The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. The IRES allows translation of the gene to start at the IRES sequence, thereby resulting in the expression of two genes of interest in the same construct. The bovine α-lactalbumin signal peptide or casein signal peptide causes extracellular secretion of expressed protein products. [0127]
  • In some embodiments, the initial ATG of the signal peptide is attached to the IRES in order to allow the most efficient translation initiation from the IRES. In some embodiments, the second codon of the signal peptide is mutated from an ATG to a GCC, changing the second amino acid of the α-lactalbumin signal peptide from a methionine to an alanine. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that this mutation facilitates more efficient translation initiation by the IRES. In some embodiments, the (IRES)/signal peptide is inserted into a vector between two genes of interest. In these embodiments, the (IRES)/signal peptide creates a second translation initiation site, allowing for the expression of two polypeptides from the same expression vector. In other words, a single transcript is produced that encodes two different polypeptides (e.g., the heavy and light chains of an immunoglobulin). [0128]
  • In some embodiments, the signal peptide is derived from α-lactalbumin. In other embodiments, the present invention comprises an internal ribosome entry site (IRES)/modified bovine α-S1 Casein signal peptide fusion protein (SEQ ID NO:12). The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. The IRES allows translation of the gene to start at the IRES sequence, allowing the expression of two genes of interest in the same construct. The bovine αS1 casein signal peptide causes secretion of expressed protein products. [0129]
  • In some embodiments the second codon of the bovine α-S1 casein signal peptide is mutated from a AAA to a GCC. The mutation results in the second codon of the signal peptide being changed from an alanine to a lysine. In some embodiments, the third codon of the signal peptide is mutated from a CTT to a TTG, a change which does not result and an amino acid substitution. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is contemplated that this mutation allows more efficient translation initiation by the IRES. [0130]
  • II. Retroviral Expression Vectors [0131]
  • In some embodiments, the present invention comprises retroviral expression vectors. Retroviruses (family Retroviridae) are generally divided into three groups: the spumaviruses (e.g., human foamy virus); the lentiviruses (e.g., human immunodeficiency virus and sheep visna virus), and the oncoviruses (e.g., MLV and Rous sarcoma virus). [0132]
  • Retroviruses are enveloped (i.e., surrounded by a host cell-derived lipid bilayer membrane) single-stranded RNA viruses which infect animal cells. When a retrovirus infects a cell, its RNA genome is converted into a double-stranded linear DNA form (i.e., it is reverse transcribed). The DNA form of the virus is then integrated into the host cell genome as a provirus. The provirus serves as a template for the production of additional viral genomes and viral mRNAs. Mature viral particles containing two copies of genomic RNA bud from the surface of the infected cell. The viral particle comprises the genomic RNA, reverse transcriptase and other pol gene products inside the viral capsid (containing the viral gag gene products) which is surrounded by a lipid bilayer membrane derived from the host cell containing the viral envelope glycoproteins (also referred to as membrane-associated proteins). [0133]
  • The genomic organization of numerous retroviruses is well known to the art and this has allowed the adaptation of the retroviral genome to produce retroviral vectors. The production of a recombinant retroviral vector carrying a gene of interest is typically achieved in two stages. [0134]
  • First, the gene of interest is inserted into a retroviral vector which contains the sequences necessary for the efficient expression of the gene of interest (including promoter and/or enhancer elements which may be provided by the viral long terminal repeats (LTRs) or by an internal promoter/enhancer and relevant splicing signals), sequences required for the efficient packaging of the viral RNA into infectious virions (e.g., the packaging signal (Psi), the tRNA primer binding site (−PBS), the 3′ regulatory sequences required for reverse transcription (+PBS)) and the viral LTRs. The LTRs contain sequences required for the association of viral genomic RNA, reverse transcriptase and integrase functions, and sequences involved in directing the expression of the genomic RNA to be packaged in viral particles. For safety reasons, many recombinant retroviral vectors lack functional copies of the genes which are essential for viral replication (these essential genes are either deleted or disabled); therefore, the resulting virus is said to be “replication defective”. [0135]
  • Second, following the construction of the recombinant vector, the vector DNA is introduced into a packaging cell line. Packaging cell lines provide viral proteins required in trans for the packaging of the viral genomic RNA into viral particles having the desired host range (i.e., the viral-encoded gag, pol and env proteins). The host range is controlled, in part, by the type of envelope gene product expressed on the surface of the viral particle. Packaging cell lines may express ecotrophic, amphotropic or xenotropic envelope gene products. Alternatively, the packaging cell line may lack sequences encoding a viral envelope (env) protein. In this case the packaging cell line will package the viral genome into particles lacking a membrane-associated protein (e.g., an env protein). In order to produce viral particles containing a membrane associated protein which will permit entry of the virus into a cell, the packaging cell line containing the retroviral sequences is commonly transfected with sequences encoding a membrane-associated protein (e.g., the G protein of vesicular stomatitis virus (VSV)). The transfected packaging cell will then produce viral particles which contain the membrane-associated protein expressed by the transfected packaging cell line; these viral particles which contain viral genomic RNA derived from one virus encapsidated by the envelope proteins of another virus are said to be “pseudotyped virus particles”. [0136]
  • The retroviral vectors of the present invention can be further modified to include additional regulatory sequences. As described above, the retroviral vectors of the present invention include the following elements in operable association: a) a 5′ LTR; b) a packaging signal; c) a 3′ LTR; and d) a nucleic acid encoding a protein of interest located between the 5′ and 3′ LTRs. In some embodiments of the present invention, the nucleic acid of interest may be arranged in opposite orientation to the 5′ LTR when transcription from an internal promoter is desired. Suitable internal promoters include, but are not limited to, the alpha-lactalbumin promoter, the CMV promoter, and the thymidine kinase promoter. [0137]
  • In other embodiments of the present invention, where secretion of the protein of interest is desired, the vectors are modified by including a signal peptide sequence in operable association with the protein of interest. The sequences of several suitable signal peptides are known in the art, including, but not limited to, those derived from tissue plasminogen activator, human growth hormone, lactoferrin, alpha S1-casein, and alpha-lactalbumin. [0138]
  • In other embodiments of the present invention, the vectors are modified by incorporating one or more of the elements described above, including, but not limited to, an RNA export element, a PPE element, and an IRES/bovine α-lactalbumin signal sequence. [0139]
  • The retroviral vectors of the present invention may further comprise a selectable marker which facilitates selection of transformed cells. A number of selectable markers known in the art find use in the present invention, including, but not limited to the [0140] bacterial aminoglycoside 3′ phosphotransferase gene (also referred to as the “neo gene”) that confers resistance to the drug G418 in mammalian cells, the bacterial hygromycin G phosphotransferase (hyg) gene that confers resistance to the antibiotic hygromycin, and the bacterial xanthine-guanine phosphoribosyl transferase gene (also referred to as the “gpt gene”) that confers the ability to grow in the presence of mycophenolic acid. In some embodiments, the selectable marker gene is provided as part of a polycistronic sequence also encoding the protein of interest.
  • In still other embodiments of the present invention, the retroviral vectors may comprise recombination elements recognized by a recombination system (e.g., the cre/loxP or flp recombinase systems: See, e.g., Hoess et al., Nucleic Acids Res., 14:2287 [1986], O'Gorman et al., Science 251:1351 [1991], van Deursen et al., Proc. Natl. Acad. Sci. USA 92:7376 [1995], and U.S. Pat. No. 6,025,192, incorporated herein by reference). After integration of the vectors into the genome of the host cell, the host cell can be transiently transfected (e.g., by electroporation, lipofection, or microinjection) with either a recombinase enzyme (e.g., Cre recombinase) or a nucleic acid sequence encoding the recombinase enzyme and one or more nucleic acid sequences encoding a protein of interest flanked by sequences recognized by the recombination enzyme so that the nucleic acid sequence of interest is inserted into the integrated vector. [0141]
  • Viral vectors, including recombinant retroviral vectors, provide a more efficient means of transferring genes into cells, as compared to other techniques such as calcium phosphate-DNA co-precipitation or DEAE-dextran-mediated transfection, electroporation or microinjection of nucleic acids. Nonetheless, the present invention is not limited to any particular mechanism. Indeed, an understanding of the mechanism is not required to practice the present invention. Nevertheless, it is believed that the efficiency of viral transfer is due in part to the fact that the transfer of nucleic acid is a receptor-mediated process (i.e., the virus binds to a specific receptor protein on the surface of the target cell). In addition, once inside a cell, the virally transferred nucleic acid integrates in controlled manner. This is in contrast to nucleic acids transferred by other means (e.g., calcium phosphate-DNA co-precipitation), which are typically subject to rearrangement and degradation. [0142]
  • Example 1, below, describes several illustrative examples of retroviral vectors of the current invention. However, it is not intended that the present invention be limited to the vectors described in Example 1. Indeed, any suitable retroviral vectors containing the novel elements of the present invention are contemplated. Furthermore, the elements described above find use in other vectors such as AAV vectors, transposon vectors, plasmids, bacterial artificial chromosomes, and yeast artificial chromosomes. [0143]
  • III. Expression of Proteins [0144]
  • In some embodiments of the present invention, the vectors and regulatory elements described above find use in the expression of one or more proteins. The present invention is not limited to the production of any particular protein. Indeed, the production of a wide variety of proteins is contemplated, including, but not limited to, erythropoietin, alpha-interferon, alpha-1 proteinase inhibitor, angiogenin, antithrombin III, beta-acid decarboxylase, human growth hormone, bovine growth hormone, porcine growth hormone, human serum albumin, beta-interferon, calf intestine alkaline phosphatase, cystic fibrosis transmembrane regulator, Factor VIII, Factor IX, Factor X, insulin, lactoferrin, tissue plasminogen activator, myelin basic protein, insulin, proinsulin, prolactin, hepatitis B antigen, immunoglobulins, monoclonal antibody CTLA4 Ig, Tag 72 monoclonal antibody, Tag 72 single chain antigen binding protein, protein C, cytokines and their receptors (e.g., tumor necrosis factor alpha and beta), growth hormone releasing factor, parathyroid hormone, thyroid stimulating hormone, lipoproteins, alpha-1-antitrypsin, follicle stimulating hormone, calcitonin, luteinizing hormone, glucagon, von Willebrands factor, atrial natriuretic factor, lung surfactant, urokinase, bombesin, thrombin, hemopoietic growth factor, enkephalinase, human macrophage inflammatory protein (MIP-1-alpha), serum albumins (e.g., mullerian-inhibiting substance), relaxin A-chain, relaxin B-chain, prorelaxin, mouse gonadotropin-associated peptide, beta-lactamase, DNase, inhibin, activin, vascular endothelial growth factor (VEGF), receptors for hormones or growth factors, integrin, protein A or D, rheumatoid factors, neurotrophic factors (e.g., bone-derived neurotrophic factor (BDNF)), neurotrophin-3, -4, -5, or -6 (NT-3, NT-4, NT-5, or NT-6), nerve growth factors (e.g., NGF-beta), platelet-derived growth factor (PDGF), fibroblast growth factors (e.g., aFGF and bFGF), epidermal growth factor (EGF), transforming growth factor (TGF) (e.g., TGF-alpha and TGF-beta, including TGF-β1, TGF-β2, TGF-β3, TGF-β4, or TGF-β5), insulin-like growth factor-I and -II (IGF-I and IGF-II), des(1-3)-IGF-I (brain IGF-I), insulin like growth factor binding proteins; CD proteins (e.g., CD-3, CD-4, CD-8, and CD-19), osteoinductive factors, immunotoxins, bone morphogenetic protein (BMP); interferons (e.g., interferon-alpha, -beta, and -gamma), colony stimulating factors (CSFs) ( e.g., M-CSF, GM-CSF, and G-CSF), interleukins (IL) ( e.g., IL-1 to IL-10), superoxide dismutase, T-cell receptors, surface membrane proteins, decay accelerating factor, viral antigens (e.g., a portion of the AIDS envelope), transport proteins, homing receptors, addressing, regulatory proteins, antibodies, chimeric proteins (e.g., immunoadhesins), and fragments of any of the above-listed polypeptides. One skilled in the art recognizes that the nucleic acid sequences for these proteins and their homologs are available from public databases (e.g., Gen Bank). [0145]
  • In some embodiments, the vectors of the present invention are used to express more than one exogenous protein. For example, host cells may be transfected with vectors encoding different proteins of interest (e.g., cotransfection with one vector encoding a first protein of interest and a second vector encoding a second protein of interest). In other embodiments, more than one protein is expressed by arranging the nucleic acids encoding the different proteins of interest in a polycistronic sequence (e.g., bicistronic or tricistronic sequences). This arrangement is especially useful when expression of the different proteins of interest in a 1:1 molar ratio is desired (e.g., expression of the light and heavy chains of an immunoglobulin molecule). [0146]
  • A. Expression of Protein in Cell Culture [0147]
  • In some embodiments of the present invention, proteins are expressed in cell culture. In some embodiments, retroviral vectors are used to express protein in mammalian tissue culture host cells, including, but not limited to, rat fibroblast cells, bovine kidney cells, and human kidney cells, while in some preferred embodiments, protein is expressed in bovine mammary cells. The host cells are cultured according to methods known in the art; suitable culture conditions for mammalian cells are well known in the art (See e.g., J. Immunol. Methods 56:221 [1983], [0148] Animal Cell Culture: A Practical Approach 2nd Ed., Rickwood, D. and Hames, B. D., eds. Oxford University Press, New York [1992]).
  • The present invention contemplates the transfection of a variety of host cells with integrating vectors. A number of mammalian host cell lines are known in the art. In general, these host cells are capable of growth and survival when placed in either monolayer culture or in suspension culture in a medium containing the appropriate nutrients and growth factors, as is described in more detail below. Typically, the cells are capable of expressing and secreting large quantities of a particular protein of interest into the culture medium. Examples of suitable mammalian host cells include, but are not limited to Chinese hamster ovary cells (CHO-K1, ATCC CC1-61); bovine mammary epithelial cells (ATCC CRL 10274; bovine mammary epithelial cells); monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture; see, e.g., Graham et al., J. Gen Virol., 36:59 [1977]); baby hamster kidney cells (BHK, ATCC CCL 10); mouse sertoli cells (TM4, Mather, Biol. Reprod. 23:243-251 [1980]); monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et al., Annals N.Y. Acad. Sci., 383:44-68 [1982]); MRC 5 cells; FS4 cells; rat fibroblasts (208F cells); MDBK cells (bovine kidney cells); and a human hepatoma line (Hep G2). [0149]
  • In addition to mammalian cell lines, the present invention also contemplates the transfection of plant protoplasts with integrating vectors at a low or high multiplicity of infection. For example, the present invention contemplates a plant cell or whole plant comprising at least one integrated integrating vector, preferably a retroviral vector, and most preferably a pseudotyped retroviral vector. All plants that can be produced by regeneration from protoplasts can also be transfected using the process according to the invention (e.g., cultivated plants of the genera Solanum, Nicotiana, Brassica, Beta, Pisum, Phaseolus, Glycine, Helianthus, Allium, Avena, Hordeum, Oryzae, Setaria, Secale, Sorghum, Triticum, Zea, Musa, Cocos, Cydonia, Pyrus, Malus, Phoenix, Elaeis, Rubus, Fragaria, Prunus, Arachis, Panicum, Saccharum, Coffea, Camellia, Ananas, Vitis or Citrus). In general, protoplasts are produced in accordance with conventional methods (See, e.g., U.S. Pat. Nos. 4,743,548; 4,677,066, 5,149,645; and 5,508,184; all of which are incorporated herein by reference). Plant tissue may be dispersed in an appropriate medium having an appropriate osmotic potential (e.g., 3 to 8 wt. % of a sugar polyol) and one or more polysaccharide hydrolases (e.g., pectinase, cellulase, etc.), and the cell wall degradation allowed to proceed for a sufficient time to provide protoplasts. After filtration the protoplasts may be isolated by centrifugation and may then be resuspended for subsequent treatment or use. Regeneration of protoplasts kept in culture to whole plants is performed by methods known in the art (See, e.g., Evans et al., [0150] Handbook of Plant Cell Culture, 1: 124-176, MacMillan Publishing Co., New York [1983]; Binding, Plant Protoplasts, p. 21-37, CRC Press, Boca Raton [1985],) and Potrykus and Shillito, Methods in Enzymology, Vol. 118, Plant Molecular Biology, A. and H. Weissbach eds., Academic Press, Orlando [1986]).
  • The present invention also contemplates the use of amphibian and insect host cell lines. Examples of suitable insect host cell lines include, but are not limited to, mosquito cell lines (e.g., ATCC CRL-1660). Examples of suitable amphibian host cell lines include, but are not limited to, toad cell lines (e.g., ATCC CCL-102). [0151]
  • In preferred embodiments of the present invention, the host cell cultures are prepared in a medium suitable for the particular cell being cultured. Commercially available media such as Ham's F10 (Sigma, St. Louis, Mo.), Minimal Essential Medium (MEM, Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium (DMEM, Sigma) are exemplary nutrient solutions. Suitable media are also described in U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; 5,122,469 and 4,560,655; and PCT Publications WO 90/03430; and WO 87/00195 (each of which are incorporated herein by reference). Any of these media may be supplemented as necessary, with hormones and/or other growth factors (e.g., insulin, transferrin, or epidermal growth factor), salts (e.g., sodium chloride, calcium, magnesium, and phosphate), buffers (e.g., HEPES), nucleosides (e.g., adenosine and thymidine), antibiotics (e.g., gentamycin (gentamicin)), trace elements (i.e., inorganic compounds usually present at final concentrations in the micromolar range) lipids (e.g., linoleic or other fatty acids) and their suitable carriers, and glucose or an equivalent energy source. Any other necessary supplements may also be included at appropriate concentrations known to those skilled in the art. For mammalian cell culture, the osmolality of the culture medium is generally about 290-330 mOsm. [0152]
  • The present invention also contemplates the use of a variety of culture systems (e.g., petri dishes, 96 well plates, roller bottles, and bioreactors) for the growth and expression of host cells. For example, the host cells can be cultured in a perfusion system. Perfusion culture refers to providing a continuous flow of culture medium through a culture maintained at high cell density. The cells are suspended and do not require a solid support upon which to grow. Generally, fresh nutrients must be supplied continuously with concomitant removal of toxic metabolites and, ideally, selective removal of dead cells. Filtering, entrapment and micro-capsulation methods are all suitable for refreshing the culture environment at sufficient rates. [0153]
  • In alternative embodiments, a fed batch culture procedure is employed. In the preferred fed batch culture method the mammalian host cells and culture medium are supplied to a culturing vessel initially and additional culture nutrients are fed, continuously or in discrete increments, to the culture during culturing, with or without periodic cell and/or roduct harvest before termination of culture. In some embodiments, the fed batch culture is a semi-continuous fed batch culture in which the whole culture (including cells and medium) is removed from the growth vessel and replaced by fresh medium. Fed batch culture is distinguished from simple batch culture in which all components for cell culturing (including the cells and all culture nutrients) are supplied to the culturing vessel at the start of the culturing process. Fed batch culture can be further distinguished from perfusion culturing insofar as the supernate is not removed from the culturing vessel during the process (in perfusion culturing, the cells are restrained in the culture ( e.g., by filtration, encapsulation, anchoring to microcarriers etc.) and the culture medium is continuously or intermittently introduced and removed from the culturing vessel). [0154]
  • Further, the cells of the culture may be propagated according to any scheme or routine suitable for the particular host cell and the particular production plan contemplated. [0155]
  • Therefore, the present invention contemplates single step, as well as multiple step culture procedures. In a single step culture, the host cells are inoculated into a culture environment and the processes of the instant invention are employed during a single production phase of the cell culture. In the multi-stage culture procedure, cells are cultivated in a number of steps or phases. For instance, cells may be grown in a first step or growth phase culture wherein cells, possibly removed from storage, are inoculated into a medium suitable for promoting growth and high viability. The cells may be maintained in the growth phase for a suitable period of time by the addition of fresh medium to the host cell culture. [0156]
  • Fed batch or continuous cell culture conditions are contemplated in order to enhance growth of the mammalian cells in the growth phase of the cell culture. In the growth phase, cells are grown under conditions and for a period of time that is optimized for growth. Culture conditions, such as temperature, pH, dissolved oxygen (dO[0157] 2) and the like, are those used with the particular host and are apparent to the ordinarily skilled artisan. Generally, the pH is adjusted to a level between about 6.5 and 7.5 using either an acid (e.g., CO2) or a base (e.g., Na2CO3 or NaOH). A suitable temperature range for culturing mammalian cells (e.g., CHO cells) is between about 30° to 38° C. and a suitable dO2 is between 5-90% of air saturation.
  • Following the polypeptide production phase, the polypeptide of interest is recovered from the culture medium using well-established techniques. Preferably, the protein of interest is recovered from the culture medium as a secreted polypeptide (e.g., the secretion of the protein of interest is directed by a signal peptide sequence), although it also may be recovered from host cell lysates. As a first step, the culture medium or lysate is centrifuged to remove particulate cell debris. The polypeptide is then purified from contaminant soluble proteins and polypeptides using any suitable method. Suitable purificaiton methods include, but are not limited to fractionation on immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using (e.g., Sephadex G-75); and protein A Sepharose columns to remove contaminants such as IgG. A protease inhibitor such as phenyl methyl sulfonyl fluoride (PMSF) also may be useful to inhibit proteolytic degradation during purification. Additionally, the protein of interest can be fused in frame to a marker sequence which allows for purification of the protein of interest. Non-limiting examples of marker sequences include a hexahistidine tag which may be supplied by a vector, preferably a pQE-9 vector, and a hemagglutinin (HA) tag. The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein (See e.g., Wilson et al., Cell, 37:767 [1984]). One skilled in the art appreciates that purification methods suitable for the polypeptide of interest may require modification to account for changes in the character of the polypeptide upon expression in recombinant cell culture. [0158]
  • B. Expression of Proteins in Animals [0159]
  • In some embodiments of the present invention, the host cell utilized for expression of the protein of interest is part of a mammal. In preferred embodiments, the mammal is a transgenic bovine. The transgenic bovine may be produced by any suitable method (See e.g., Chan et al., PNAS, 95:14028 [1998]; U.S. Pat. No. 5,741,957 (incorporated herein by reference); and Pursel et al., Science, 244:1281 [1989]). In particularly preferred embodiments, the protein is expressed in the mammary gland of a bovine and secreted in the milk of the bovine. In embodiments where proteins are expressed in the milk of a bovine, proteins and signal sequences for tissue specific expression and secretion are utilized, including, but not limited to, bovine/human α-lactalbumin promoter and bovine α-lactalbumin signal sequence. The protein of interest may be recovered from bovine milk using any suitable method, including but not limited to, those described above for the recovery of protein from cell cultures. [0160]
  • Those skilled in the art recognize that the vectors of the present invention will find use in the production of other transgenic animals as well, including, but not limited to, mice, goats, pigs, birds and rabbits (See e.g., U.S Pat. Nos. 5,523,226; 5,453,457; 4,873,191; 4,736,866; each of which is herein incorporated by reference). [0161]
  • C. Expression of Antibodies [0162]
  • In some embodiments of the present invention, single vectors are utilized for the expression of two or more proteins, including individual subunits of multisubunit proteins. In some embodiments, two or more chains of an immunoglobulin (e.g., one heavy chain ((γ, α, μ, δ, or [0163] 68 ) and one light chain (κ or λ)), separated by an IRES sequence, are expressed from the same vector as single transcriptional unit. The present invention is not limited to any particular vector. Indeed, the use of a variety of vectors is contemplated, including, but not limited to plasmids, cosmids, bacterial artificial chomosomes, yeast artificial chromosomes, adeno-associated virus vectors, and adenovirus vectors. Large numbers of suitable vectors are known to those of skill in the art, and are commercially available. Such vectors include, but are not limited to, the following vectors: 1) Bacterial—pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, phagescript, psiX174, pbluescript SK, pBSKS, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); and 2) Eukaryotic—pWLNEO, pSV2CAT, pOG44, PXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). Any other plasmid or vector may be used as long as they are replicable and viable in the host. In some preferred embodiments of the present invention, mammalian expression vectors comprise an origin of replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation sites, splice donor and acceptor sites, transcriptional termination sequences, and 5′ flanking non-transcribed sequences. In other embodiments, DNA sequences derived from the SV40 splice, and polyadenylation sites may be used to provide the required non-transcribed genetic elements.
  • In certain embodiments of the present invention, the DNA sequence in the expression vector is operatively linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. Promoters useful in the present invention include, but are not limited to, the LTR or SV40 promoter, the [0164] E. coli lac or trp, the phage lambda PL and PR, T3 and T7 promoters, and the cytomegalovirus (CMV) immediate early, herpes simplex virus (HSV) thymidine kinase, and mouse metallothionein-I promoters and other promoters known to control expression of gene in prokaryotic or eukaryotic cells or their viruses. In other embodiments of the present invention, recombinant expression vectors include origins of replication and selectable markers permitting transformation of the host cell (e.g., dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or tetracycline or ampicillin resistance in E. coli).
  • In some embodiments of the present invention, transcription of the DNA encoding the polypeptides of the present invention by higher eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp that act on a promoter to increase its transcription. Enhancers useful in the present invention include, but are not limited to, the SV40 enhancer on the late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. [0165]
  • In other embodiments, the expression vector also contains a ribosome binding site for translation initiation and a transcription terminator. In still other embodiments of the present invention, the vector may also include appropriate sequences for amplifying expression. [0166]
  • In some particularly preferred embodiments, retroviral vectors are used to express immunoglobulins. In some embodiments, retroviral vectors for expression of immunoglobulins contain regulatory elements. In some preferred embodiments of the present invention, two immunoglobulins chains are expressed in the same retrovirus vector construct separated by an IRES sequence. In some particularly preferred embodiments, the two chains are separated by an IRES/α-LA signal sequence. In other embodiments, the vector further contains RNA export elements. In further embodiments, the RNA export element is a WPRE. In still other embodiments, the PPE element is at least one Mertz sequence. In some preferred embodiments, the PPE element is mutated to remove start signals. In other preferred embodiments, two PPE elements are placed in a head to tail array separated by a linker. [0167]
  • In preferred embodiments, expression of immunoglobulins by the vectors of the current invention is controlled by a promoter. In some embodiments, expression is controlled by a CMV promoter, while in other embodiments, expression is controlled by a MMTV promoter. In some preferred embodiments, expression is controlled by a hybrid bovine/human α-LA promoter. [0168]
  • In some embodiments of the present invention, heavy and light chains are expressed by the vectors of the current invention of a ratio of about 0.7:1.3. In preferred embodiments, heavy and light chains are expressed and a ratio of about 0.8:1.2. In particularly preferred embodiments, heavy and light chains are expressed at a ratio of about 0.9:1.1. In still more preferred embodiments, heavy and light chains are expressed at a ratio of about 1:1. In particularly preferred embodiments, the majority (e.g., greater that 90%, preferably greater than 95%, and most preferably greater than about 99%) of the heavy and light chains are correctly assembled in a ratio of 1:1 to form a functional (e.g., able to bind an antigen) antibody. [0169]
  • In illustrative examples of the present invention, immunoglobulins are expressed in a host cell comprising the vectors and elements described above. In some illustrative examples (See e.g., Examples 6, 8, and 12), the vectors described in Example 1 are used to express a variety of immunoglobulins in a variety of cell lines. In general, this expression led to the formation of finctional, tetrameric immunoglobulins. [0170]
  • D. Expression of Other Proteins [0171]
  • The vectors of the present invention are also useful for expressing G-protein coupled receptors (GPCRs) and other transmembrane proteins. It is contemplated that when these proteins are expressed, they are correctly inserted into the membrane in their native conformation. Thus, GPCRs and other transmembrane proteins may be purified as part of a membrane fraction or purified from the membranes by methods known in the art. [0172]
  • Furthermore, the vectors of the present invention are useful for co-expressing a protein of interest for which there is no assay or for which assays are difficult. In this system, a protein of interest and a signal protein are arranged in a polycistronic sequence. Preferably, an IRES sequence separates the signal protein and protein of interest (e.g., a GPCR) and the genes encoding the signal protein and protein of interest are expressed as a single transcriptional unit. The present invention is not limited to any particular signal protein. Indeed, the use of a variety of signal proteins for which easy assays exist is contemplated. These signal proteins include, but are not limited to, green fluorescent protein, luciferase, beta-galactosidase, and antibody heavy or light chains. It is contemplated that when the signal protein and protein of interest are co-expressed from a polycistronic sequence, the presence of the signal protein is indicative of the presence of the protein of interest. Accordingly, in some embodiments, the present invention provides methods for indirectly detecting the expression of protein of interest comprising providing a host cell transfected with a vector encoding a polycistronic sequence, wherein the polycistronic sequence comprises a signal protein and a protein of interest operably linked by an IRES, and culturing the host cells under conditions such that the signal protein and protein of interest are produced, wherein the presence of the signal protein indicates the presence of the protein of interest. [0173]
  • Experimental [0174]
  • The following examples serve to illustrate certain preferred embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof. [0175]
  • In the experimental disclosure which follows, the following abbreviations apply: M (molar); mM (millimolar); μM (micromolar); nM (nanomolar); mol (moles); mmol (millimoles); μmol (micromoles); nmol (nanomoles); gm (grams); mg (milligrams); μg (micrograms);pg (picograms); L (liters); ml (milliliters); μl (microliters); cm (centimeters); mm (millimeters); μm (micrometers); nm (nanometers); °C. (degrees Centigrade); AMP (adenosine 5′-monophosphate); BSA (bovine serum albumin); cDNA (copy or complimentary DNA); CS (calf serum); DNA (deoxyribonucleic acid); ssDNA (single stranded DNA); dsDNA (double stranded DNA); dNTP (deoxyribonucleotide triphosphate); LH (luteinizing hormone); NIH (National Institutes of Health, Besthesda, MD); RNA (ribonucleic acid); PBS (phosphate buffered saline); g (gravity); OD (optical density); HEPES (N-[2-αHydroxyethyl]piperazine-N-[2-ethanesulfonic acid]); HBS (HEPES buffered saline); PBS (phosphate buffered saline); SDS (sodium dodecylsulfate); Tris-HCl (tris[Hydroxymethyl]aminomethane-hydrochloride); Klenow (DNA polymerase I large (Klenow) fragment); rpm (revolutions per minute); EGTA (ethylene glycol-bis(β-aminoethyl ether) N, N, N′, N′-tetraacetic acid); EDTA (ethylenediaminetetracetic acid); bla (β-lactamase or ampicillin-resistance gene); ORI (plasmid origin of replication); lacI (lac repressor); X-gal (5-bromo-4-chloro-3-indolyl-β-D-galactoside); ATCC (American Type Culture Collection, Rockville, Md.); GIBCO/BRL (GIBCO/BRL, Grand Island, N.Y.); Perkin-Elmer (Perkin-Elmer, Norwalk, Conn.); and Sigam (Sigma Chemical Company, St. Louis, Mo.). [0176]
  • EXAMPLE 1 Vector Construction
  • The following Example describes the construction of vectors used in the experiments below. [0177]
  • A. CMV MN14 [0178]
  • The CMV MN14 vector (SEQ ID NO:4; MN14 antibody is described in U.S. Pat. No. 5,874,540, incorporated herein by reference) comprises the following elements, arranged in 5′ to 3′ order: CMV promoter; MN14 heavy chain signal peptide, MN14 antibody heavy chain; IRES from encephalomyocarditis virus; bovine α-lactalbumin signal peptide; MN 14 antibody light chain; and 3′ MoMuLV LTR. In addition to sequences described in SEQ ID NO: 4, the CMV MN14 vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO:7; the 5′ LTR is derived from Moloney Murine Sarcoma Virus in each of the constructs described herein, but is converted to the [0179] MoMuLV 5′ LTR when integrated).
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene. The expression of MN14 antibody is controlled by the CMV promoter. The MN14 heavy chain gene and light chain gene are attached together by an IRES sequence. The CMV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA. The mRNA expression from the LTR as well as from the CMV promoter is terminated and poly adenylated in the 3′ LTR. The construct was cloned by similar methods as described in section B below. [0180]
  • The IRES sequence (SEQ ID NO:3) comprises a fusion of the IRES from the plasmid pLXIN (Clontech) and the bovine α-lactalbumin signal peptide. The initial ATG of the signal peptide was attached to the IRES to allow the most efficient translation initiation from the IRES. The 3′ end of the signal peptide provides a multiple cloning site allowing easy attachment of any protein of interest to create a fusion protein with the signal peptide. The IRES sequence can serve as a translational enhancer as well as creating a second translation initiation site that allows two proteins to be produced from a single mRNA. [0181]
  • The IRES-bovine α-lactalbumin signal peptide was constructed as follows. The portion of the plasmid pLXIN (Clontech, Palo Alto, Calif.) containing the ECMV IRES was PCR amplified using the following primers. [0182]
  • Primer 1 (SEQ ID NO: 35): [0183]
  • 5′ [0184] GATCCACTAGTAACGGCCGCCAGAATTCGC 3′
  • Primer 2 (SEQ ID NO: 36): [0185]
  • 5′ [0186] CAGAGAGACAAAGGAGGCCATATTATCATCGTGTTTTTCAAAG 3′
  • [0187] Primer 2 attaches a tail corresponding to the start of the bovine α-lactalbumin signal peptide coding region to the IRES sequence. In addition, the second triplet codon of the α-lactalbumin signal peptide was mutated from ATG to GCC to allow efficient translation from the IRES sequence. This mutation results in a methionine to alanine change in the protein sequence. This mutation was performed because the IRES prefers an alanine as the second amino acid in the protein chain. The resulting IRES PCR product contains an EcoRI site on the 5′ end of the fragment (just downstream of Primer 1 above).
  • Next, the α-lactalbumin signal peptide containing sequence was PCR amplified from the α-LA Signal Peptide vector construct using the following primers. [0188]
  • Primer 3 (SEQ ID NO: 14): [0189]
  • 5′ [0190] CTTTGAAAAACACGATGATAATATGGCCTCCTTTGTCTCTCTG 3′
  • Primer 4 (SEQ ID NO: 15): [0191]
  • 5′ [0192] TTCGCGAGCTCGAGATCTAGATATCCCATG 3′
  • [0193] Primer 3 attaches a tail corresponding to the 3′ end of the IRES sequence to the α-lactalbumin signal peptide coding region. As stated above, the second triplet codon of the bovine α-lactalbumin signal peptide was mutated to allow efficient translation from the IRES sequence. The resulting signal peptide PCR fragment contains NaeI, NcoI, EcoRV, XbaI, BglII and XhoI sites on the 3′ end.
  • After the IRES and signal peptide were amplified individually using the primers shown above, the two reaction products were mixed and PCR was performed using [0194] primer 1 and primer 4. The resultant product of this reaction is a spliced fragment that contains the IRES attached to the full length α-lactalbumin signal peptide. The ATG encoding the start of the signal peptide is placed at the same location as the ATG encoding the start of the neomycin phosphotransferase gene found in the vector pLXIN. The fragment also contains the EcoRI site on the 5′ end and Nael, NcoI, EcoRV, XbaI, BglII and XhoI sites on the 3′ end.
  • The spliced IRES/α-lactalbumin signal peptide PCR fragment was digested with EcoRI and XhoI. The α-LA Signal Peptide vector construct was also digested with EcoRI and XhoI. These two fragments were ligated together to give the pIRES construct. [0195]
  • The IRES/α-lactalbumin signal peptide portion of the pIRES vector was sequenced and found to contain mutations in the 5′ end of the IRES. These mutations occur in a long stretch of C's and were found in all clones that were isolated. [0196]
  • To repair this problem, pLXIN DNA was digested with EcoRI and BsmFI. The 500 bp band corresponding to a portion of the IRES sequence was isolated. The mutated IRES/α-lactalbumin signal peptide construct was also digested with EcoRI and BsmFI and the mutated IRES fragment was removed. The IRES fragment from pLXIN was then substituted for the IRES fragment of the mutated IRES/α-lactalbumin signal peptide construct. The IRES/α-LA signal peptide portion of resulting plasmid was then verified by DNA sequencing. [0197]
  • The resulting construct was found to have a number of sequence differences when compared to the expected pLXIN sequence obtained from Clontech. We also sequenced the IRES portion of pLXIN purchased from Clontech to verify its sequence. The differences from the expected sequence also appear to be present in the pLXIN plasmid that we obtained from Clontech. Four sequence differences were identified: [0198]
  • bp 347 T—was G in pLXIN sequence [0199]
  • bp 786-788 ACG—was GC in LXIN sequence. [0200]
  • B. CMV LL2 [0201]
  • The CMV LL2 (SEQ ID NO:5; LL2 antibody is described in U.S. Pat. No. 6,187,287, incorporated herein by reference) construct comprises the following elements, arranged in 5′ to 3′ order: 5° CMV promoter (Clontech), LL2 heavy chain signal peptide, LL2 antibody heavy chain; IRES from encephalomyocarditis virus; bovine α-LA signal peptide; LL2 antibody light chain; and 3′ MoMuLV LTR. In addition to sequences described in SEQ ID NO:5, the CMV LL2 vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO:7). [0202]
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene. The expression of LL2 antibody is controlled by the CMV promoter (Clontech). The LL2 heavy chain gene and light chain gene are attached together by an IRES sequence. The CMV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA. The mRNA expression from the LTR as well as from the CMV promoter is terminated and poly adenylated in the 3′ LTR. [0203]
  • The IRES sequence (SEQ ID NO:3) comprises a fusion of the IRES from the plasmid pLXIN (Clontech) and the bovine alpha-lactalbumin signal peptide. The initial ATG of the signal peptide was attached to the IRES to allow the most efficient translation initiation from the IRES. The 3′ end of the signal peptide provides a multiple cloning site allowing easy attachment of any protein of interest to create a fusion protein with the signal peptide. The IRES sequence can serve as a translational enhancer as well as creating a second translation initiation site that allows two proteins to be produced from a single mRNA. [0204]
  • The LL2 light chain gene was attached to the IRES α-lactalbumin signal peptide as follows. The LL2 light chain was PCR amplified from the vector pCRLL2 using the following primers. [0205]
  • Primer 1 (SEQ ID NO: 16): [0206]
  • 5′ [0207] CTACAGGTGTCCACGTCGACATCCAGCTGACCCAG 3′
  • Primer 2 (SEQ ID NO: 17): [0208]
  • 5′ [0209] CTGCAGAATAGATCTCTAACACTCTCCCCTGTTG 3′
  • These primers add a HincII site right at the start of the coding region for mature LL2 light chain. Digestion of the PCR product with HincII gives a blunt end fragment starting with the initial GAC encoding mature LL2 on the 5′ end. [0210] Primer 2 adds a BglII site to the 3′ end of the gene right after the stop codon. The resulting PCR product was digested with HincII and BglII and cloned directly into the IRES-Signal Peptide plasmid that was digested with Nael and BglII.
  • The Kozak sequence of the LL2 heavy chain gene was then modified. The vector pCRMN14HC was digested with XhoI and AvrII to remove about a 400 bp fragment. PCR was then used to amplify the same portion of the LL2 heavy chain construct that was removed by the XhoI-AvrII digestion. This amplification also mutated the 5′ end of the gene to add a better Kozak sequence to the clone. The Kozak sequence was modified to resemble the typical IgG Kozak sequence. The PCR primers are shown below. [0211]
  • Primer 1 (SEQ ID NO: 18): [0212]
  • 5′[0213] CAGTGTGATCTCGAGAATTCAGGACCTCACCATGGGATGGAGCTGTATCAT 3′
  • Primer 2 (SEQ ID NO: 19): [0214]
  • 5′[0215] AGGCTGTATTGGTGGATTCGTCT 3′
  • The PCR product was digested with XhoI and AvrII and inserted back into the previously digested plasmid backbone. [0216]
  • The “good” Kozak sequence was then added to the light chain gene. The “good” Kozak LL2 heavy chain gene construct was digested with EcoRI and the heavy chain gene containing fragment was isolated. The IRES α-Lactalbumin Signal Peptide LL2 light chain gene construct was also digested with EcoRI. The heavy chain gene was then cloned into the EcoRI site of IRES light chain construct. This resulted in the heavy chain gene being placed at the 5′ end of the IRES sequence. [0217]
  • Next, a multiple cloning site was added into the LNCX retroviral backbone plasmid. The LNCX plasmid was digested with HindIII and ClaI. Two oligonucleotide primers were produced and annealed together to create an double stranded DNA multiple cloning site. The following primers were annealed together. [0218]
  • Primer 1 (SEQ ID NO: 20): [0219]
  • 5′[0220] AGCTTCTCGAGTTAACAGATCTAGGCCTCCTAGGTCGACAT 3′
  • Primer 2 (SEQ ID NO: 21): 5′[0221]
  • CGATGTCGACCTAGGAGGCCTAGATCTGTTAACTCGAGA 3′[0222]
  • After annealing, the multiple cloning site was ligated into LNCX to create LNC-MCS. [0223]
  • Next, the double chain gene fragment was ligated into the retroviral backbone gene construct. The double chain gene construct created above was digested with SalI and BglII and the double chain containing fragment was isolated. The retroviral expression plasmid LNC-MCS was digested with XhoI and BglII. The double chain fragment was then cloned into the LNC-MCS retroviral expression backbone. [0224]
  • Next, an RNA splicing problem in the construct was corrected. The construct was digested with NsiI. The resulting fragment was then partially digested with EcoRI. The fragments resulting from the partial digest that were approximately 9300 base pairs in size were gel purified. A linker was created to mutate the splice donor site at the 3′ end of the LL2 heavy chain gene. The linker was again created by annealing two oligonucleotide primers together to form the double stranded DNA linker. The two primers used to create the linker are shown below. [0225]
  • Primer 1 (SEQ ID NO: 22): [0226]
  • 5′[0227] CGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCCGGGAAATGAAAGCCG 3′
  • Primer 2 (SEQ ID NO: 23): [0228]
  • 5′[0229] AATTCGGCTTTCATTTCCCGGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCGTGCA 3′
  • After annealing the linker was substituted for the original NsiI/EcoRI fragment that was removed during the partial digestion. [0230]
  • C. MMTV MN14 [0231]
  • The MMTV MN14 (SEQ ID NO:6) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MMTV promoter; double mutated PPE sequence; MN 14 antibody heavy chain; IRES from encephalomyocarditis virus; bovine αLA signal peptide MN 14 antibody light chain; WPRE sequence; and 3′ MoMuLV LTR. In addition to the sequences described in SEQ ID NO:6, the MMTV MN14 vector further comprises a MoMuLV LTR, MoMuLV extended viral packaging signal; neomycin phosphotransferase gene located 5′ of the MMTV promoter (these additional elements are provided in SEQ ID NO: 7). [0232]
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene. The expression of MN14 antibody is controlled by the MMTV promoter (Pharmacia). The MN14 heavy chain gene and light chain gene are attached together by an IRES/bovine α-LA signal peptide sequence (SEQ ID NO: 3). The MMTV promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES/bovine α-LA signal peptide sequence. Ribosomes attach to the mRNA at the CAP site and at the IRES/bovine α-LA signal peptide sequence. This allows both heavy and light chain protein to be produced from a single mRNA. In addition, there are two genetic elements contained within the mRNA to aid in export of the mRNA from the nucleus to the cytoplasm and aid in poly-adenylation of the mRNA. The PPE sequence is contained between the RNA CAP site and the start of the MN14 protein coding region, the WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the MMTV promoter is terminated and poly-adenylated in the 3′ LTR. [0233]
  • ATG sequences within the PPE element (SEQ ID NO:2) were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence is placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region. The WPRE is isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold. [0234]
  • D. α-LA MN14 [0235]
  • The α-LA MN14 (SEQ ID NO:7) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MoMuLV LTR, MoMuLV extended viral packaging signal, neomycin phosphotransferase gene, bovine/human alpha-lactalbumin hybrid promoter, double mutated PPE element, MN14 heavy chain signal peptide, MN14 antibody heavy chain, IRES from encephalomyocarditis virus/bovine αLA signal peptide, MN14 antibody light chain, WPRE sequence; and 3′ MoMuLV LTR. [0236]
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene. The expression of MN14 antibody is controlled by the hybrid α-LA promoter (SEQ ID NO:1). The MN14 heavy chain gene and light chain gene are attached 30 together by an IRES sequence/bovine α-LA signal peptide (SEQ ID NO:3). The α-LA promoter drives production of a mRNA containing the heavy chain gene and the light chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both heavy and light chain protein to be produced from a single mRNA. [0237]
  • In addition, there are two genetic elements contained within the mRNA to aid in export of the mRNA from the nucleus to the cytoplasm and aid in poly-adenylation of the mRNA. The mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region. ATG sequences within the PPE element (SEQ ID NO:2) were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence is placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region. The WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold. The WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR. [0238]
  • The bovine/human alpha-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences. The human portion of the promoter is from +15 relative to transcription start point (tsp) to −600 relative to the tsp. The bovine portion is then attached to the end of the human portion and corresponds to −550 to −2000 relative to the tsp. The hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. [0239]
  • For construction of the bovine/human α-lactalbumin promoter, human genomic DNA was isolated and purified. A portion of the human α-lactalbumin promoter was PCR amplified using the following two primers: [0240]
  • Primer 1 (SEQ ID NO: 24): [0241]
  • 5′[0242] AAAGCATATGTTCTGGGCCTTGTTACATGGCTGGATTGGTT 3′
  • Primer 2 (SEQ ID NO: 25): [0243]
  • 5 ′[0244] TGAATTCGGCGCCCCCAAGAACCTGAAATGGAAGCATCACTCAGTTTCATATAT 3′
  • This two primers created a NdeI site on the 5′ end of the PCR fragment and a EcoRI site on the 3′ end of the PCR fragment. [0245]
  • The human PCR fragment created using the above primers was double digested with the restriction enzymes NdeI and EcoRI. The plasmid pKBaP-1 was also double digested with NdeI and EcoRI. The plasmid pKBaP-1 contains the bovine α-[0246] lactalbumin 5′ flanking region attached to a multiple cloning site. This plasmid allows attachment of various genes to the bovine α-lactalbumin promoter.
  • Subsequently, the human fragment was ligated/substituted for the bovine fragment of the promoter that was removed from the pKBaP-1 plasmid during the double digestion. The resulting plasmid was confirmed by DNA sequencing to be a hybrid of the Bovine and Human a-lactalbumin promoter/regulatory regions. [0247]
  • Attachment of the MN14 light chain gene to the IRES α-lactalbumin signal peptide was accomplished as follows. The MN14 light chain was PCR amplified from the vector pCRMN14LC using the following primers. [0248]
  • Primer 1 (SEQ ID NO: 26): 5′[0249] CTACAGGTGTCCACGTCGACATCCAGCTGACCCAG 3′
  • Primer 2 (SEQ ID NO: 27): 5′ [0250] CTGCAGAATAGATCTCTAACACTCTCCCCTGTTG 3′
  • These primers add a HincII site right at the start of the coding region for mature MN14 light chain. Digestion of the PCR product with HincII gives a blunt end fragment starting with the initial GAC encoding mature MN14 on the 5′ end. [0251] Primer 2 adds a BglII site to the 3′ end of the gene right after the stop codon. The resulting PCR product was digested with HincII and BglII and cloned directly into the IRES-Signal Peptide plasmid that was digested with NaeI and BglII.
  • Next, the vector pCRMN14HC was digested with XhoI and NruI to remove about a 500 bp fragment. PCR was then used to amplify the same portion of the MN14 heavy chain construct that was removed by the XhoI-NruI digestion. This amplification also mutated the 5′ end of the gene to add a better Kozak sequence to the clone. The Kozak sequence was modified to resemble the typical IgG Kozak sequence. The PCR primers are shown below. [0252]
  • Primer 1 (SEQ ID NO: 28): [0253]
  • 5′[0254] CAGTGTGATCTCGAGAATTCAGGACCTCACCATGGGATGGAGCTGTATCAT 3′
  • Primer 2 (SEQ ID NO: 29): [0255]
  • 5′[0256] GTGTCTTCGGGTCTCAGGCTGT 3′
  • The PCR product was digested with XhoI and NruI and inserted back into the previously digested plasmid backbone. [0257]
  • Next, the “good” Kozak MN14 heavy chain gene construct was digested with EcoRI and the heavy chain gene containing fragment was isolated. The IRES α-Lactalbumin Signal Peptide MN14 light chain gene construct was also digested with EcoRI. The heavy chain gene was then cloned into the EcoRI site of IRES light chain construct. This resulted in the heavy chain gene being placed at the 5′ end of the IRES sequence. [0258]
  • A multiple cloning site was then added to the LNCX retroviral backbone plasmid. The LNCX plasmid was digested with HindIII and ClaI. Two oligonucleotide primers were produced and annealed together to create an double stranded DNA multiple cloning site. The following primers were annealed together. [0259]
  • Primer 1 (SEQ ID NO: 30): [0260]
  • 5′ [0261] AGCTTCTCGAGTTAACAGATCTAGGCCTCCTAGGTCGACAT 3′
  • Primer 2 (SEQ ID NO: 31): [0262]
  • 5′ [0263] CGATGTCGACCTAGGAGGCCTAGATCTGTTAACTCGAGA 3′
  • After annealing the multiple cloning site was ligated into LNCX to create LNC-MCS. [0264]
  • The double chain gene fragment was then inserted into a retroviral backbone gene construct. The double chain gene construct created in [0265] step 3 was digested with SailI and BglII and the double chain containing fragment was isolated. The retroviral expression plasmid LNC-MCS was digested with XhoI and BglII. The double chain fragment was then cloned into the LNC-MCS retroviral expression backbone.
  • Next, a RNA splicing problem in the construct was repaired. The construct was digested with NsiI. The resulting fragment was then partially digested with EcoRI. The fragments resulting from the partial digest that were approximately 9300 base pairs in size, were gel purified. A linker was created to mutate the splice donor site at the 3′ end of the MN14 heavy chain gene. The linker was again created by annealing two oligonucleotide primers together to form the double stranded DNA linker. The two primers used to create the linker are shown below. [0266]
  • Primer 1 (SEQ ID NO: 32): [0267]
  • 5′[0268] CGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCCGGGAAATGAAAGCCG 3′
  • Primer 2 (SEQ ID NO: 33): [0269]
  • 5′[0270] AATTCGGCTTTCATTTCCCGGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCGTGCA 3′
  • After annealing the linker was substituted for the original NsiI/EcoRI fragment that was removed during the partial digestion. [0271]
  • Next, the mutated double chain fragment was inserted into the α-Lactalbumin expression retroviral backbone LN α-LA-Mertz-MCS. The gene construct produced above was digested with BamHI and BglII and the mutated double chain gene containing fragment was isolated. The LN α-LA-Mertz-MCS retroviral backbone plasmid was digested with BglII. The BamHI/BglII fragment was then inserted into the retroviral backbone plasmid. [0272]
  • A WPRE element was then inserted into the gene construct. The plasmid BluescriptII SK+WPRE-B11 was digested with BamHI and HincII to remove the WPRE element and the element was isolated. The vector created above was digested with BglII and HpaI. The WPRE fragment was ligated into the BglII and HpaI sites to create the final gene construct. [0273]
  • E. α-LA Bot [0274]
  • The α-LA Bot (SEQ ID NO:8, botulinum toxin antibody) construct comprises the following elements, arranged in 5′ to 3′ order: bovine/human alpha-lactalbumin hybrid promoter, mutated PPE element, cc49 signal peptide, botulinum toxin antibody light chain, IRES from encephalomyocarditis virus/bovine α-LA signal peptide, botulinum toxin antibody heavy chain, WPRE sequence, and 3′ MoMuLV LTR. In addition, the α-LA botulinum toxin antibody vector further comprises a 5′ MoMuLV LTR, a MoMuLV extended viral packaging signal, and a neomycin phosphotransferase gene (these additional elements are provided in SEQ ID NO: 7). [0275]
  • This construct uses the 5′ MoMuLV LTR to control production of the neomycin phosphotransferase gene. The expression of botulinum toxin antibody is controlled by the hybrid a-LA promoter. The botulinum toxin antibody light chain gene and heavy chain gene are attached together by an IRES/bovine α-LA signal peptide sequence. The bovine/human alpha-lactalbumin hybrid promoter drives production of a mRNA containing the light chain gene and the heavy chain gene attached by the IRES. Ribosomes attach to the mRNA at the CAP site and at the IRES sequence. This allows both light and heavy chain protein to be produced from a single mRNA. [0276]
  • In addition, there are two genetic elements contained within the mRNA to aid in export of the mRNA from the nucleus to the cytoplasm and aid in poly-adenylation of the mRNA. The mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region. ATG sequences within the PPE element (SEQ ID NO:2) were mutated to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence was placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region. The WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the niRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold. The WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR. [0277]
  • The bovine/human α-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine α-lactalbumin promoter sequences. The human portion of the promoter is from +15 relative to transcription start point to −600 relative to the tsp. The bovine portion is then attached to the end of the human portion and corresponds to −550 to −2000 relative to the tsp. The hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human. [0278]
  • F. LSRNL [0279]
  • The LSRNL (SEQ ID NO:9) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ MoMuLV LTR, MoMuLV viral packaging signal; hepatitis B surface antigen; RSV promoter; neomycin phosphotransferase gene; and 3′ MoMuLV LTR. [0280]
  • This construct uses the 5′ MoMuLV LTR to control production of the Hepatitis B surface antigen gene. The expression of the neomycin phosphotransferase gene is controlled by the RSV promoter. The mRNA expression from the LTR as well as from the RSV promoter is terminated and poly adenylated in the 3′ LTR. [0281]
  • G. α-LA cc49IL2 [0282]
  • The α-LA cc49IL2 (SEQ ID NO:10; the cc49 antibody is described in U.S. Pat. Nos. 5,512,443; 5,993,813; and 5,892,019; each of which is herein incorporated by reference) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ bovine/human α-lactalbumin hybrid promoter; cc49-IL2 coding region; and 3′ MoMuLV LTR. This gene construct expresses a fusion protein of the single chain antibody cc49 attached to Interleukin-2. Expression of the fusion protein is controlled by the bovine/human α-lactalbumin hybrid promoter. [0283]
  • The bovine/human α-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences. The human portion of the promoter is from +15 relative to transcription start point to −600 relative to the tsp. The bovine portion is then attached to the end of the human portion and corresponds to −550 to −2000 relative to the tsp. The hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human. The 3′ viral LTR provide the poly-adenylation sequence for the mRNA. [0284]
  • H. α-LA YP [0285]
  • The α-LA YP (SEQ ID NO: 11) construct comprises the following elements, arranged in 5′ to 3′ order: 5′ bovine/human alpha-lactalbumin hybrid promoter; double mutated PPE sequence; bovine αLA signal peptide; [0286] Yersenia pestis antibody heavy chain Fab coding region; EMCV IRES/bovine α-LA signal peptide; Yersenia pestis antibody light chain Fab coding region; WPRE sequence; 3′ MoMuLV LTR.
  • This gene construct will cause the expression of [0287] Yersenia pestis mouse Fab antibody. The expression of the gene construct is controlled by the bovine/human α-lactalbumin hybrid promoter. The PPE sequence and the WPRE sequence aid in moving the mRNA from the nucleus to the cytoplasm. The IRES sequence allows both the heavy and the light chain genes to be translated from the same mRNA. The 3′ viral LTR provides the poly-adenylation sequence for the mRNA.
  • In addition, there are two genetic elements contained within the mRNA to aid in export of the mRNA from the nucleus to the cytoplasm and aid in poly-adenylation of the mRNA. The mutated PPE sequence (SEQ ID NO:2) is contained between the RNA CAP site and the start of the MN14 protein coding region. ATG sequences within the PPE element (SEQ ID NO:2) were mutated ([0288] bases 4, 112, 131, and 238 of SEQ ID NO: 2 were changed from a G to a T) to prevent potential unwanted translation initiation. Two copies of this mutated sequence were used in a head to tail array. This sequence was placed just downstream of the promoter and upstream of the Kozak sequence and signal peptide-coding region. The WPRE was isolated from woodchuck hepatitis virus and also aids in the export of mRNA from the nucleus and creating stability in the mRNA. If this sequence is included in the 3′ untranslated region of the RNA, level of protein expression from this RNA increases up to 10-fold. The WPRE is contained between the end of MN14 protein coding and the poly-adenylation site. The mRNA expression from the LTR as well as from the bovine/human alpha-lactalbumin hybrid promoter is terminated and poly adenylated in the 3′ LTR.
  • The bovine/human alpha-lactalbumin hybrid promoter (SEQ ID NO:1) is a modular promoter/enhancer element derived from human and bovine alpha-lactalbumin promoter sequences. The human portion of the promoter is from +15 relative to transcription start point to −600 relative to the tsp. The bovine portion is then attached to the end of the human portion and corresponds to −550 to −2000 relative to the tsp. The hybrid was developed to remove poly-adenylation signals that were present in the bovine promoter and hinder retroviral RNA production. It was also developed to contain genetic control elements that are present in the human gene, but not the bovine. Likewise, the construct contains control elements present in the bovine but not in the human. [0289]
  • EXAMPLE 2 Generation of Cell Lines Stably Expressing the MoMLV Gag and Pol Proteins
  • Examples 2-5 describe the production of pseudotyped retroviral vectors. These methods are generally applicable to the production of the vectors described above. The expression of the fusogenic VSV G protein on the surface of cells results in syncytium formation and cell death. Therefore, in order to produce retroviral particles containing the VSV G protein as the membrane-associated protein a two-step approach was taken. First, stable cell lines expressing the gag and pol proteins from MoMLV at high levels were generated (e.g., 293GP[0290] SD cells). The stable cell line which expresses the gag and pol proteins produces noninfectious viral particles lacking a membrane-associated protein (e.g., an envelope protein). The stable cell line was then co-transfected, using the calcium phosphate precipitation, with VSV-G and gene of interest plasmid DNAs. The pseudotyped vector generated was used to infect 293GPSD cells to produce stably transformed cell lines. Stable cell lines can be transiently transfected with a plasmid capable of directing the high level expression of the VSV G protein (see below). The transiently transfected cells produce VSV G-pseudotyped retroviral vectors which can be collected from the cells over a period of 3 to 4 days before the producing cells die as a result of syncytium formation.
  • The first step in the production of VSV G-pseudotyped retroviral vectors, the generation of stable cell lines expressing the MoMLV gag and pol proteins is described below. The human adenovirus Ad-5-transformed embryonal kidney cell line 293 (ATCC CRL 1573) was cotransfected with the pCMVgag-pol and the gene encoding for phleomycin. pCMV gag-pol contains the MoMLV gag and pol genes under the control of the CMV promoter (pCMV gag-pol is available from the ATCC). [0291]
  • The plasmid DNA was introduced into the 293 cells using calcium phosphate co-precipitation (Graham and Van der Eb, Virol. 52:456 [1973]). Approximately 5×10[0292] 5 293 cells were plated into a 100 mm tissue culture plate the day before the DNA co-precipitate was added. Stable transformants were selected by growth in DMEM-high glucose medium containing 10% FCS and 10 μg/ml phleomycin (selective medium). Colonies which grew in the selective medium were screened for extracellular reverse transcriptase activity (Goff et al., J. Virol. 38:239 [1981]) and intracellular p30gag expression. The presence of p30gag expression was determined by Western blotting using a goat-anti p30 antibody (NCI antiserum 77S000087). A clone which exhibited stable expression of the retroviral genes was selected. This clone was named 293GPSD (293 gag-pol-San Diego). The 293GPSD cell line, a derivative of the human Ad-5-transformed embryonal kidney cell line 293, was grown in DMEM-high glucose medium containing 10% FCS.
  • EXAMPLE 3 Preparation of Pseudotyped Retroviral Vectors Bearing the G Glycoprotein of VSV
  • In order to produce VSV G protein pseudotyped retrovirus the following steps were taken. The 293GP[0293] SD cell line was co-transfected with VSV-G plasmid and DNA plasmid of interest. This co-transfection generates the infectious particles used to infect 293GPSD cells to generate the packaging cell lines. This Example describes the production of pseudotyped LNBOTDC virus. This general method may be used to produce any of the vectors described in Example 1.
  • a) Cell Lines and Plasmids [0294]
  • The packaging cell line, 293GP[0295] SD was grown in alpha-MEM-high glucose medium containing 10% FCS The titer of the pseudo-typed virus may be determined using either 208F cells (Quade, Virol. 98:461 [1979]) or NIH/3T3 cells (ATCC CRL 1658); 208F and NIH/3T3 cells are grown in DMEM-high glucose medium containing 10% CS.
  • The plasmid LNBOTDC contains the gene encoding BOTD under the transcriptional control of cytomegalovirus intermediate-early promoter followed by the gene encoding neomycin phosphotransferase (Neo) under the transcriptional control of the LTR promoter. The plasmid pHCMV-G contains the VSV G gene under the transcriptional control of the human cytomegalovirus intermediate-early promoter (Yee et al., Meth. Cell Biol. 43:99 [1994]). [0296]
  • b) Production of stable packaging cell lines, pseudotyped vector and Titering of Pseudotyped LNBOTDC Vector [0297]
  • LNBOTDC DNA (SEQ ID NO: 13) was co-transfected with pHCMV-G DNA into the packaging line 293GP[0298] SD to produce LNBOTDC virus. The resulting LNBOTDC virus was then used to infect 293GPSD cells to transform the cells. The procedure for producing pseudotyped LNBOTDC virus was carried out as described (Yee et al., Meth. Cell Biol. 43:99 [1994].
  • This is a retroviral gene construct that upon creation of infectious replication defective retroviral vector will cause the insertion of the sequence described above into the cells of interest. Upon insertion the CMV regulatory sequences control the expression of the botulinum toxin antibody heavy and light chain genes. The IRES sequence allows both the heavy and the light chain genes to be translated from the same mRNA. The 3′ viral LTR provides the poly-adenylation sequence for the mRNA. [0299]
  • Both heavy and light chain protein for botulinum toxin antibody are produced from this signal mRNA. The two proteins associated to form active botulinum toxin antibody. The heavy and light chain proteins also appear to be formed in an equal molar ratio to each other. [0300]
  • Briefly, on [0301] day 1, approximately 5×104 293GPSD cells were placed in a 75 cm2 tissue culture flask. On the following day (day 2), the 293GPSD cells were transfected with 25 μg of pLNBOTDC plasmid DNA and 25 μg of VSV-G plasmid DNA using the standard calcium phosphate co-precipitation procedure (Graham and Van der Eb, Virol. 52:456 [1973]). A range of 10 to 40 μg of plasmid DNA may be used. Because 293GPSD cells may take more than 24 hours to attach firmly to tissue culture plates, the 293GPSD cells may be placed in 75 cm2 flasks 48 hours prior to transfection. The transfected 293GPSD cells provide pseudotyped LNBOTDC virus.
  • On [0302] day 3, approximately 1×105 293GPSD cells were placed in a 75 cm2 tissue culture flask 24 hours prior to the harvest of the pseudotyped virus from the transfected 2933GPSD cells. On day 4, culture medium was harvested from the transfected 293GPSD cells 48 hours after the application of the pLNBOTDC and VSV-G DNA. The culture medium was filtered through a 0.45 μm filter and polybrene was added to a final concentration of 8 μg/ml. The culture medium containing LNBOTDC virus was used to infect the 293GPSD cells as follows. The culture medium was removed from the 293GPSD cells and was replaced with the LNBOTDC virus containing culture medium. Polybrene was added to the medium following addition to cells. The virus containing medium was allowed to remain on the 293GPSD cells for 24 hours. Following the 16 hour infection period (on day 5), the medium was removed from the 293GPSD cells and was replaced with fresh medium containing 400 μg/ml G418 (GIBCO/BRL). The medium was changed approximately every 3 days until G418-resistant colonies appeared approximately two weeks later.
  • The G418-resistant 293 colonies were plated as single cells in 96 wells. Sixty to one hundred G418-resistant colonies were screened for the expression of the BOTDC antibody in order to identify high producing clones. The top 10 clones in 96-well plates were transferred 6-well plates and allowed to grow to confluency. [0303]
  • The top 10 clones were then expanded to screen for high titer production. Based on protein expression and titer production, 5 clonal cell lines were selected. One line was designated the master cell bank and the other 4 as backup cell lines. Pseudotyped vector was generated as follows. Approximately 1×10[0304] 6 293GPSD/LNBOTDC cells were placed into a 75 cm2 tissue culture flask. Twenty-four hours later, the cells were transfected with 25 μg of pHCMV-G plasmid DNA using calcium phosphate co-precipitation. Six to eight hours after the calcium-DNA precipitate was applied to the cells, the DNA solution was replaced with fresh culture medium (lacking G41 8). Longer transfection times (overnight) were found to result in the detachment of the majority of the 293GPSD/LNBOTDC cells from the plate and are therefore avoided. The transfected 293GPSD/LNBOTDC cells produce pseudotyped LNBOTDC virus.
  • The pseudotyped LNBOTDC virus generated from the transfected 293GP[0305] SD/LNBOTDC cells can be collected at least once a day between 24 and 96 hr after transfection. The highest virus titer was generated approximately 48 to 72 hr after initial pHCMV-G transfection. While syncytium formation became visible about 48 hr after transfection in the majority of the transfected cells, the cells continued to generate pseudotyped virus for at least an additional 48 hr as long as the cells remained attached to the tissue culture plate. The collected culture medium containing the VSV G-pseudotyped LNBOTDC virus was pooled, filtered through a 0.45 μm filter and stored at −80° C. or concentrated immediately and then stored at −80° C.
  • The titer of the VSV G-pseudotyped LNBOTDC virus was then determined as follows. Approximately 5×10[0306] 4 rat 208F fibroblasts cells were plated into 6 well plates. Twenty-fours hours after plating, the cells were infected with serial dilutions of the LNBOTDC virus-containing culture medium in the presence of 8 μg/ml polybrene. Twenty four hours after infection with virus, the medium was replaced with fresh medium containing 400 μg/ml G418 and selection was continued for 14 days until G418-resistant colonies became visible. Viral titers were typically about 0.5 to 5.0×106 colony forming units (cfu)/ml. The titer of the virus stock could be concentrated to a titer of greater than 109 cfu/ml as described below.
  • EXAMPLE 4 Concentration of Pseudotyped Retroviral Vectors
  • The VSV G-pseudotyped LNBOTDC viruses were concentrated to a high titer by one cycle of ultracentrifugation. However, two cycles can be performed for further concentration. The frozen culture medium collected as described in Example 2 which contained pseudotyped LNBOTDC virus was thawed in a 37° C. water bath and was then transferred to Oakridge centrifuge tubes (50 ml Oakridge tubes with sealing caps, Nalge Nunc International) previously sterilized by autoclaving. The virus was sedimented in a JA20 rotor (Beckman) at 48,000×g (20,000 rpm) at 4° C. for 120 min. The culture medium was then removed from the tubes in a biosafety hood and the media remaining in the tubes was aspirated to remove the supernatent. The virus pellet was resuspended to 0.5 to 1% of the original volume of culture medium DMEM. The resuspended virus pellet was incubated overnight at 4° C. without swirling. The virus pellet could be dispersed with gentle pipetting after the overnight incubation without significant loss of infectious virus. The titer of the virus stock was routinely increased 100- to 300-fold after one round of ultracentrifugation. The efficiency of recovery of infectious virus varied between 30 and 100%. [0307]
  • The virus stock was then subjected to low speed centrifugation in a microfuge for 5 min at 4° C. to remove any visible cell debris or aggregated virions that were not resuspended under the above conditions. It was noted that if the virus stock is not to be used for injection into oocytes or embryos, this centrifugation step may be omitted. [0308]
  • The virus stock can be subjected to another round of ultracentrifugation to further concentrate the virus stock. The resuspended virus from the first round of centrifugation is pooled and pelleted by a second round of ultracentrifugation which is performed as described above. Viral titers are increased approximately 2000-fold after the second round of ultracentrifugation (titers of the pseudotyped LNBOTDC virus are typically greater than or equal to 1×10[0309] 9 cfu/ml after the second round of ultracentrifugation).
  • The titers of the pre- and post-centrifugation fluids were determined by infection of 208F cells (NIH 3T3 or bovine mammary epithelial cells can also be employed) followed by selection of G418-resistant colonies as described above in Example 2. [0310]
  • EXAMPLE 5 Preparation of Pseudotyped Retrovirus for Infection of Host Cells
  • The concentrated pseudotyped retroviruses were resuspended in 0.1×HBS (2.5 mM HEPES, pH 7.12, 14 mM NaCl, 75 μM Na[0311] 2HPO4-H2O) and 18 pl aliquots were placed in 0.5 ml vials (Eppendorf) and stored at −80° C. until used. The titer of the concentrated vector was determined by diluting 1μl of the concentrated virus 10−7- or 10−8-fold with 0.1×HBS. The diluted virus solution was then used to infect 208F and bovine mammary epithelial cells and viral titers were determined as described in Example 2.
  • EXAMPLE 6 Expression of MN14 by Host Cells
  • This Example describes the production of antibody MN14 from cells transfected with a high number of integrating vectors. Pseudotyped vector were made from the packaging cell lines for the following vectors: CMV MN14, α-LA MN14, and MMTV MN14. Rat fibroblasts (208F cells), MDBK cells (bovine kidney cells), and bovine mammary epithelial cells were transfected at a multiplicity of infection of 1000. One thousand cells were plated in a T25 flask and 10[0312] 6 colony forming units (CFU's) of vector in 3 ml media was incubated with the cells. The duration of the infection was 24 hr, followed by a media change. Following transfection, the cells were allowed to grow and become confluent.
  • The cell lines were grown to confluency in T25 flasks and 5 ml of media was changed daily. The media was assayed daily for the presence of MN14. All of the MN14 produced is active (an ELISA to detect human IgG gave the exact same values as the CEA binding ELISA) and Western blotting has shown that the heavy and light chains are produced at a ratio that appears to be a 1:1 ratio. In addition, a non-denaturing Western blot indicated that what appeared to be 100% of the antibody complexes were correctly formed (See FIG. 1: [0313] Lane 1, 85 ng control Mn14; Lane 2, bovine mammary cell line, α-LA promoter; Lane 3, bovine mammary cell line, CMV promoter; Lane 4, bovine kidney cell line, α-LA promoter; Lane 5, bovine kidney cell line, CMV promoter; Lane 6, 208 cell line, α-LA promoter; Lane 7, 208 cell line, CMV promoter)).
  • FIG. 2 is a graph showing the production of MN14 over time for four cell lines. The Y axis shows MN14 production in ng/ml of media. The X-axis shows the day of media collection for the experiment. Four sets of data are shown on the graph. The comparisons are between the CMV and α-LA promoter and between the 208 cells and the bovine mammary cells. The bovine mammary cell line exhibited the highest expression, followed by the 208F cells and MDBK cells. With respect to the constructs, the CMV driven construct demonstrated the highest level of expression, followed by the α-LA driven gene construct and the MMTV construct. At 2 weeks, the level of daily production of the CMV construct was 4.5 μg/ml of media (22.5 mg/day in a T25 flask). The level of expression subsequently increased slowly to 40 μg/day as the cells became very densely confluent over the subsequent week. 2.7 L of media from an α-lac-MN14 packaging cell line was processed by affinity chromatography to produce a purified stock of MN14. [0314]
  • FIG. 3 is a western blot of a 15% SDS-PAGE gel run under denaturing conditions in order to separate the heavy and light chains of the MN14 antibody. [0315] Lane 1 shows MN14 from bovine mammary cell line, hybrid α-LA promoter; lane 2 shows MN14 from bovine mammary cell line, CMV promoter; lane 3 shows MN14 from bovine kidney cell line, hybrid αLA promoter; lane 4 shows MN14 from bovine kidney cell line, CMV promoter; lane 5 shows MN14 from rat fibroblast cell line, hybrid α-LA promoter; lane 6 shows MN14 from rat fibroblast, CMV promoter. In agreement with FIG. 1 above, the results show that the heavy and light chains are produced in a ratio of approximately 1:1.
  • EXAMPLE 7 Quantitation of Protein Produced Per Cell
  • This Example describes the quantitation of the amount of protein produced per cell in cell cultures produced according to the invention. Various cells (208F cells, MDBK cells, and bovine mammary cells) were plated in 25 cm[0316] 2 culture dishes at 1000 cells/dish. Three different vectors were used to infect the three cells types (CMV-MN14, MMTV-MN14, and α-LA-MN14) at an MOI of 1000 (titers: 2.8×106, 4.9×106, and 4.3×106, respectively). Media was collected approximately every 24 hours from all cells. Following one month of media collection, the 208F and MDBK cells were discarded due to poor health and low MN14 expression. The cells were passaged to T25 flasks and collection of media from the bovine mammary cells was continued for approximately 2 months with continued expression of MN14. After two months in T25 flasks, the cells with CMV promoters were producing 22.5 pg/cell/day and the cells with α LA promoters were producing 2.5 pg MN14/cell/day.
  • After 2 months in T25 flasks, roller bottles (850 cm[0317] 2) were seeded to scale-up production and to determine if MN14 expression was stable following multiple passages. Two roller bottles were seeded with bovine mammary cells expressing MN14 from a CMV promoter and two roller bottles were seeded with bovine mammary cells expressing MN14 from the α-LA promoter. The cultures reached confluency after approximately two weeks and continue to express MN14. Roller bottle expression is shown in Table 1 below.
    TABLE 1
    Production of MN14 in Roller Bottles
    MN14
    MN14 Production/
    Production/ Week - Total
    Cell Line Promoter Week (μg/ml) (μg/ml)
    Bovine CMV 2.6 1 - 520 
    mammary
    Bovine CMV 10.6  2 - 2120 
    mammary
    Bovine CMV 8.7  3 - 1740 
    mammary
    Bovine CMV 7.8  4 - 1560 
    mammary
    Bovine α-LA 0.272 1 - 54.4
    mammary
    Bovine α-LA 2.8 2 - 560 
    mammary
    Bovine α-LA 2.2 3 - 440 
    mammary
    Bovine α-LA 2.3 4 - 460 
    mammary
  • EXAMPLE 8 Expression of LL2 Antibody
  • This Example demonstrates the expression of antibody LL2 by bovine mammary cells and 293 human kidney fibroblast cells. Bovine mammary cells were infected with vector CMV LL2 (7.85×10[0318] 7 CFU/ml) at MOI's of 1000 and 10,000 and plated in 25cm2 culture dishes. None of the cells survived transfection at the MOI of 10,000. At 20% confluency, 250 ng/ml of LL2 was present in the media. Active LL2 antibody was produced by both cell types. Non-denaturing and denaturing western analysis demonstrated that all the antibody produced is active and correctly assembled in approximately a 1:1 ratio of heavy:light chain.
  • EXAMPLE 9 Expression of Bot Antibody by Bovine Mammary Cells
  • This Example demonstrates the expression of botulinum toxin antibody in bovine mammary cells. Bovine mammary cells were infected with vector α-LA Bot (2.2×10[0319] 2 CFU/ml) and plated in 25 cm2 culture dishes. At 100% confluency, 6 ng/ml of botulinum toxin antibody was present in the media.
  • EXAMPLE 10 Expression of Hepatitis B Surface Antigen by Bovine Mammary Cells
  • This Example demonstrates the expression of Hepatitis B Surface Antigen antibody in bovine mammary cells. Bovine mammary cells were infected with vector LSRNL (350 CFU/ml) and plated in 25 cm[0320] 2 culture dishes. At 100% confluency, 20 ng/ml of Hepatitis B Surface Antigen was present in the media.
  • EXAMPLE 11 Expression of cc49IL2 Antigen Binding Protein
  • This Example demonstrates the expression of cc49IL2 in bovine mammary cells and human kidney fibroblast cells. Bovine mammary cells were infected with vector LSRNL (3.1×10[0321] 5 CFU/ml) at a MOI of 1000 and plated in 25 cm2 culture dishes. At 100% confluency, 10 μg/ml of cc49IL2 was present in the media. Human kidney fibroblast (293) cells were infected with the α-LA cc49IL2 vector. Active cc49-IL2 fusion protein was produced by the cells.
  • EXAMPLE 12 Production of YP Antibody
  • This Example demonstrates the production of [0322] Yersinea pestis antibody by bovine mammary epithelial cells and human kidney fibroblast cells (293 cells). Cells lines were infected with the α-LA YP vector. Both of the cell lines produced YP antibody. All of the antibody is active and the heavy and light chains are produced in a ratio approximating 1:1.
  • EXAMPLE 13 Expression of Multiple Proteins by Bovine Mammary Cells
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells. Mammary cells producing MN14 (infected with CMV-MN14 vector) were infected with cc49IL2 vector (3.1×10[0323] 5 CFU/ml) at an MOI of 1000, and 1000 cells were plated in 25 cm2 culture plates. At 100% confluency, the cells expressed MN14 at 2.5 μg/ml and cc49IL2 at 5 μg/ml.
  • EXAMPLE 14 Expression of Multiple Proteins by Bovine Mammary Cells
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells. Mammary cells producing MN14 (infected with CMV-MN14 vector) were infected with LSNRL vector (100 CFU/ml) at an MOI of 1000, and 1000 cells were plated in 25 cm[0324] 2 culture plates. At 100% confluency, the cells expressed MN14 at 2.5 μg/ml and hepatitis surface antigen at 150 ng/ml.
  • EXAMPLE 15 Expression of Multiple Proteins by Bovine Mammary Cells
  • This Example demonstrates the expression of multiple proteins in bovine mammary cells. Mammary cells producing hepatitis B surface antigen (infected with LSRNL vector) were infected with cc49IL2 vector at an MOI of 1000, and 1000 cells were plated in 25 cm[0325] 2 culture plates. At 100% confluency, the cells expressed MN14 at 2.4 and hepatitis B surface antigen at 13.
  • EXAMPLE 16 Expression of Hepatitis B Surface Antigen and Bot Antibody in Bovine Mammary Cells
  • This Example demonstrates the culture of transfected cells in roller bottle cultures. 208F cells and bovine mammary cells were plated in 25 cm[0326] 2 culture dishes at 1000 cells/25 cm2. LSRNL or α-LA Bot vectors were used to infect each cell line at a MOI of 1000. Following one month of culture and media collection, the 208F cells were discarded due to poor growth and plating. Likewise, the bovine mammary cells infected with α-LA Bot were discarded due to low protein expression. The bovine mammary cells infected with LSRNL were passaged to seed roller bottles (850 cm2). Approximately 20 ng/ml hepatitis type B surface antigen was produced in the roller bottle cultures.
  • EXAMPLE 17 Expression and Assay of G-protein Coupled Receptors
  • This example describes the expression of a G-Protein Coupled Receptor protein (GPCR) from a retroviral vector. This example also describes the expression of a signal protein from an IRES as a marker for expression of a difficult to assay protein or a protein that has no assay such as a GPCR. The gene construct (SEQ ID NO: 34; FIG. 17) comprises a G-protein-coupled receptor followed by the IRES-signal peptide-antibody light chain cloned into the MCS of pLBCX retroviral backbone. Briefly, a PvuII/PvuII fragment (3057 bp) containing the GPCR-IRES-antibody light chain was cloned into the StuI site of pLBCX. pLBCX contains the EM7 (T7) promoter, Blasticidin gene and SV40 polyA in place of the Neomycin resistance gene from pLNCX. [0327]
  • The gene construct was used to produce a replication defective retroviral packaging cell line and this cell line was used to produce replication defective retroviral vector. The vector produced from this cell line was then used to infect 293GP cells (human embryonic kidney cells). After infection, the cells were placed under Blasticidin selection and single cell Blasticidin resistant clones were isolated. The clones were screened for expression of antibody light chain. The top 12 light chain expressing clones were selected. These 12 light chain expressing clones were then screened for expression of the GPCR using a ligand binding assay. All twelve of the samples also expressed the receptor protein. The clonal cell lines and there expression are shown in Table 2. [0328]
    TABLE 2
    Cell Clone Antibody Light GPCR
    Number Chain Expression Expression
     4 + +
     8 + +
    13 + +
    19 + +
    20 + +
    22 + +
    24 + +
    27 + +
    30 + +
    45 + +
    46 + +
    50 + +
  • All publications and patents mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in molecular biology, protein fermentation, biochemistry, or related fields are intended to be within the scope of the following claims. [0329]
  • 1 36 1 2101 DNA Artificial Sequence Synthetic 1 gatcagtcct gggtggtcat tgaaaggact gatgctgaag ttgaagctcc aatactttgg 60 ccacctgatg cgaagaactg actcatgtga taagaccctg atactgggaa agattgaagg 120 caggaggaga agggatgaca gaggatggaa gagttggatg gaatcaccaa ctcgatggac 180 atgagtttga gcaagcttcc aggagttggt aatgggcagg gaagcctggc gtgctgcagt 240 ccatggggtt gcaaagagtt ggacactact gagtgactga actgaactga tagtgtaatc 300 catggtacag aatataggat aaaaaagagg aagagtttgc cctgattctg aagagttgta 360 ggatataaaa gtttagaata cctttagttt ggaagtctta aattatttac ttaggatggg 420 tacccactgc aatataagaa atcaggcttt agagactgat gtagagagaa tgagccctgg 480 cataccagaa gctaacagct attggttata gctgttataa ccaatatata accaatatat 540 tggttatata gcatgaagct tgatgccagc aatttgaagg aaccatttag aactagtatc 600 ctaaactcta catgttccag gacactgatc ttaaagctca ggttcagaat cttgttttat 660 aggctctagg tgtatattgt ggggcttccc tggtggctca gatggtaaag tgtctgcctg 720 caatgtgggt gatctgggtt cgatccctgg cttgggaaga tcccctggag aaggaaatgg 780 caacccactc tagtactctt acctggaaaa ttccatggac agaggagcct tgtaagctac 840 agtccatggg attgcaaaga gttgaacaca actgagcaac taagcacagc acagtacagt 900 atacacctgt gaggtgaagt gaagtgaagg ttcaatgcag ggtctcctgc attgcagaaa 960 gattctttac catctgagcc accagggaag cccaagaata ctggagtggg tagcctattc 1020 cttctccagg ggatcttccc atcccaggaa ttgaactgga gtctcctgca tttcaggtgg 1080 attcttcacc agctgaacta ccaggtggat actactccaa tattaaagtg cttaaagtcc 1140 agttttccca cctttcccaa aaaggttggg tcactctttt ttaaccttct gtggcctact 1200 ctgaggctgt ctacaagctt atatatttat gaacacattt attgcaagtt gttagtttta 1260 gatttacaat gtggtatctg gctatttagt ggtattggtg gttggggatg gggaggctga 1320 tagcatctca gagggcagct agatactgtc atacacactt ttcaagttct ccatttttgt 1380 gaaatagaaa gtctctggat ctaagttata tgtgattctc agtctctgtg gtcatattct 1440 attctactcc tgaccactca acaaggaacc aagatatcaa gggacacttg ttttgtttca 1500 tgcctgggtt gagtgggcca tgacatatgt tctgggcctt gttacatggc tggattggtt 1560 ggacaagtgc cagctctgat cctgggactg tggcatgtga tgacatacac cccctctcca 1620 cattctgcat gtctctaggg gggaaggggg aagctcggta tagaaccttt attgtatttt 1680 ctgattgcct cacttcttat attgccccca tgcccttctt tgttcctcaa gtaaccagag 1740 acagtgcttc ccagaaccaa ccctacaaga aacaaagggc taaacaaagc caaatgggaa 1800 gcaggatcat ggtttgaact ctttctggcc agagaacaat acctgctatg gactagatac 1860 tgggagaggg aaaggaaaag tagggtgaat tatggaagga agctggcagg ctcagcgttt 1920 ctgtcttggc atgaccagtc tctcttcatt ctcttcctag atgtagggct tggtaccaga 1980 gcccctgagg ctttctgcat gaatataaat atatgaaact gagtgatgct tccatttcag 2040 gttcttgggg gcgccgaatt cgagctcggt acccggggat ctcgaggggg ggcccggtac 2100 c 2101 2 245 DNA Artificial Sequence Synthetic 2 gattacttac tggcaggtgc tgggggcttc cgagacaatc gcgaacatct acaccacaca 60 acaccgcctc gaccagggtg agatatcggc cggggacgcg gcggtggtaa ttacaagcga 120 ggatccgatt acttactggc aggtgctggg ggcttccgag acaatcgcga acatctacac 180 cacacaacac cgcctcgacc agggtgagat atcggccggg gacgcggcgg tggtaattac 240 aagcg 245 3 680 DNA Artificial Sequence Synthetic 3 ggaattcgcc cctctccctc ccccccccct aacgttactg gccgaagccg cttggaataa 60 ggccggtgtg cgtttgtcta tatgttattt tccaccatat tgccgtcttt tggcaatgtg 120 agggcccgga aacctggccc tgtcttcttg acgagcattc ctaggggtct ttcccctctc 180 gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct ggaagcttct 240 tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc ggaacccccc acctggcgac 300 aggtgcctct gcggccaaaa gccacgtgta taagatacac ctgcaaaggc ggcacaaccc 360 cagtgccacg ttgtgagttg gatagttgtg gaaagagtca aatggctctc ctcaagcgta 420 ttcaacaagg ggctgaagga tgcccagaag gtaccccatt gtatgggatc tgatctgggg 480 cctcggtgca catgctttac atgtgtttag tcgaggttaa aaaaacgtct aggccccccg 540 aaccacgggg acgtggtttt cctttgaaaa acacgatgat aatatggcct cctttgtctc 600 tctgctcctg gtaggcatcc tattccatgc cacccaggcc ggcgccatgg gatatctaga 660 tctcgagctc gcgaaagctt 680 4 4207 DNA Artificial Sequence Synthetic 4 cggatccggc cattagccat attattcatt ggttatatag cataaatcaa tattggctat 60 tggccattgc atacgttgta tccatatcat aatatgtaca tttatattgg ctcatgtcca 120 acattaccgc catgttgaca ttgattattg actagttatt aatagtaatc aattacgggg 180 tcattagttc atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg 240 cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata 300 gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc 360 cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac 420 ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg 480 cagtacatct acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc 540 aatgggcgtg gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc 600 aatgggagtt tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc 660 gccccattga cgcaaatggg cggtaggcat gtacggtggg aggtctatat aagcagagct 720 cgtttagtga accgtcagat cgcctggaga cgccatccac gctgttttga cctccataga 780 agacaccggg accgatccag cctccgcggc cccaagcttc tcgacggatc cccgggaatt 840 caggacctca ccatgggatg gagctgtatc atcctcttct tggtagcaac agctacaggt 900 gtccactccg aggtccaact ggtggagagc ggtggaggtg ttgtgcaacc tggccggtcc 960 ctgcgcctgt cctgctccgc atctggcttc gatttcacca catattggat gagttgggtg 1020 agacaggcac ctggaaaagg tcttgagtgg attggagaaa ttcatccaga tagcagtacg 1080 attaactatg cgccgtctct aaaggataga tttacaatat cgcgagacaa cgccaagaac 1140 acattgttcc tgcaaatgga cagcctgaga cccgaagaca ccggggtcta tttttgtgca 1200 agcctttact tcggcttccc ctggtttgct tattggggcc aagggacccc ggtcaccgtc 1260 tcctcagcct ccaccaaggg cccatcggtc ttccccctgg caccctcctc caagagcacc 1320 tctgggggca cagcggccct gggctgcctg gtcaaggact acttccccga accggtgacg 1380 gtgtcgtgga actcaggcgc cctgaccagc ggcgtgcaca ccttcccggc tgtcctacag 1440 tcctcaggac tctactccct cagcagcgtg gtgaccgtgc cctccagcag cttgggcacc 1500 cagacctaca tctgcaacgt gaatcacaag cccagcaaca ccaaggtgga caagagagtt 1560 gagcccaaat cttgtgacaa aactcacaca tgcccaccgt gcccagcacc tgaactcctg 1620 gggggaccgt cagtcttcct cttcccccca aaacccaagg acaccctcat gatctcccgg 1680 acccctgagg tcacatgcgt ggtggtggac gtgagccacg aagaccctga ggtcaagttc 1740 aactggtacg tggacggcgt ggaggtgcat aatgccaaga caaagccgcg ggaggagcag 1800 tacaacagca cgtaccgtgt ggtcagcgtc ctcaccgtcc tgcaccagga ctggctgaat 1860 ggcaaggagt acaagtgcaa ggtctccaac aaagccctcc cagcccccat cgagaaaacc 1920 atctccaaag ccaaagggca gccccgagaa ccacaggtgt acaccctgcc cccatcccgg 1980 gaggagatga ccaagaacca ggtcagcctg acctgcctgg tcaaaggctt ctatcccagc 2040 gacatcgccg tggagtggga gagcaatggg cagccggaga acaactacaa gaccacgcct 2100 cccgtgctgg actccgacgg ctccttcttc ctctatagca agctcaccgt ggacaagagc 2160 aggtggcagc aggggaacgt cttctcatgc tccgtgatgc acgaggctct gcacaaccac 2220 tacacgcaga agagcctctc cctgtctccc gggaaatgaa agccgaattc gcccctctcc 2280 ctcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt 2340 ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc ggaaacctgg 2400 ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag gaatgcaagg 2460 tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac aaacaacgtc 2520 tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc tctgcggcca 2580 aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag 2640 ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca aggggctgaa 2700 ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt gcacatgctt 2760 tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt 2820 tttcctttga aaaacacgat gataatatgg cctcctttgt ctctctgctc ctggtaggca 2880 tcctattcca tgccacccag gccgacatcc agctgaccca gagcccaagc agcctgagcg 2940 ccagcgtggg tgacagagtg accatcacct gtaaggccag tcaggatgtg ggtacttctg 3000 tagcctggta ccagcagaag ccaggtaagg ctccaaagct gctgatctac tggacatcca 3060 cccggcacac tggtgtgcca agcagattca gcggtagcgg tagcggtacc gacttcacct 3120 tcaccatcag cagcctccag ccagaggaca tcgccaccta ctactgccag caatatagcc 3180 tctatcggtc gttcggccaa gggaccaagg tggaaatcaa acgaactgtg gctgcaccat 3240 ctgtcttcat cttcccgcca tctgatgagc agttgaaatc tggaactgcc tctgttgtgt 3300 gcctgctgaa taacttctat cccagagagg ccaaagtaca gtggaaggtg gataacgccc 3360 tccaatcggg taactcccag gagagtgtca cagagcagga cagcaaggac agcacctaca 3420 gcctcagcag caccctgacg ctgagcaaag cagactacga gaaacacaaa gtctacgcct 3480 gcgaagtcac ccatcagggc ctgagctcgc ccgtcacaaa gagcttcaac aggggagagt 3540 gttagagatc taggcctcct aggtcgacat cgataaaata aaagatttta tttagtctcc 3600 agaaaaaggg gggaatgaaa gaccccacct gtaggtttgg caagctagct taagtaacgc 3660 cattttgcaa ggcatggaaa aatacataac tgagaataga gaagttcaga tcaaggtcag 3720 gaacagatgg aacagctgaa tatgggccaa acaggatatc tgtggtaagc agttcctgcc 3780 ccggctcagg gccaagaaca gatggaacag ctgaatatgg gccaaacagg atatctgtgg 3840 taagcagttc ctgccccggc tcagggccaa gaacagatgg tccccagatg cggtccagcc 3900 ctcagcagtt tctagagaac catcagatgt ttccagggtg ccccaaggac ctgaaatgac 3960 cctgtgcctt atttgaacta accaatcagt tcgcttctcg cttctgttcg cgcgcttctg 4020 ctccccgagc tcaataaaag agcccacaac ccctcactcg gggcgccagt cctccgattg 4080 actgagtcgc ccgggtaccc gtgtatccaa taaaccctct tgcagttgca tccgacttgt 4140 ggtctcgctg ttccttggga gggtctcctc tgagtgattg actacccgtc agcgggggtc 4200 tttcatt 4207 5 4210 DNA Artificial Sequence Synthetic 5 ggatccggcc attagccata ttattcattg gttatatagc ataaatcaat attggctatt 60 ggccattgca tacgttgtat ccatatcata atatgtacat ttatattggc tcatgtccaa 120 cattaccgcc atgttgacat tgattattga ctagttatta atagtaatca attacggggt 180 cattagttca tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc 240 ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag 300 taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc 360 acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg 420 gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc 480 agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca 540 atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca 600 atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg 660 ccccattgac gcaaatgggc ggtaggcatg tacggtggga ggtctatata agcagagctc 720 gtttagtgaa ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa 780 gacaccggga ccgatccagc ctccgcggcc ccaagcttct cgacggatcc ccgggaattc 840 aggacctcac catgggatgg agctgtatca tcctcttctt ggtagcaaca gctacaggtg 900 tccactccca ggtccagctg gtccaatcag gggctgaagt caagaaacct gggtcatcag 960 tgaaggtctc ctgcaaggct tctggctaca cctttactag ctactggctg cactgggtca 1020 ggcaggcacc tggacagggt ctggaatgga ttggatacat taatcctagg aatgattata 1080 ctgagtacaa tcagaacttc aaggacaagg ccacaataac tgcagacgaa tccaccaata 1140 cagcctacat ggagctgagc agcctgaggt ctgaggacac ggcattttat ttttgtgcaa 1200 gaagggatat tactacgttc tactggggcc aaggcaccac ggtcaccgtc tcctcagcct 1260 ccaccaaggg cccatcggtc ttccccctgg caccctcctc caagagcacc tctgggggca 1320 cagcggccct gggctgcctg gtcaaggact acttccccga accggtgacg gtgtcgtgga 1380 actcaggcgc cctgaccagc ggcgtgcaca ccttcccggc tgtcctacag tcctcaggac 1440 tctactccct cagcagcgtg gtgaccgtgc cctccagcag cttgggcacc cagacctaca 1500 tctgcaacgt gaatcacaag cccagcaaca ccaaggtgga caagagagtt gagcccaaat 1560 cttgtgacaa aactcacaca tgcccaccgt gcccagcacc tgaactcctg gggggaccgt 1620 cagtcttcct cttcccccca aaacccaagg acaccctcat gatctcccgg acccctgagg 1680 tcacatgcgt ggtggtggac gtgagccacg aagaccctga ggtcaagttc aactggtacg 1740 tggacggcgt ggaggtgcat aatgccaaga caaagccgcg ggaggagcag tacaacagca 1800 cgtaccgtgt ggtcagcgtc ctcaccgtcc tgcaccagga ctggctgaat ggcaaggagt 1860 acaagtgcaa ggtctccaac aaagccctcc cagcccccat cgagaaaacc atctccaaag 1920 ccaaagggca gccccgagaa ccacaggtgt acaccctgcc cccatcccgg gaggagatga 1980 ccaagaacca ggtcagcctg acctgcctgg tcaaaggctt ctatcccagc gacatcgccg 2040 tggagtggga gagcaatggg cagccggaga acaactacaa gaccacgcct cccgtgctgg 2100 actccgacgg ctccttcttc ctctatagca agctcaccgt ggacaagagc aggtggcagc 2160 aggggaacgt cttctcatgc tccgtgatgc acgaggctct gcacaaccac tacacgcaga 2220 agagcctctc cctgtctccc gggaaatgaa agccgaattc gcccctctcc ctcccccccc 2280 cctaacgtta ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt ctatatgtta 2340 ttttccacca tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc 2400 ttgacgagca ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat 2460 gtcgtgaagg aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc 2520 ctttgcaggc agcggaaccc cccacctggc gacaggtgcc tctgcggcca aaagccacgt 2580 gtataagata cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt 2640 gtggaaagag tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag 2700 aaggtacccc attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt 2760 tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga 2820 aaaacacgat gataatatgg cctcctttgt ctctctgctc ctggtaggca tcctattcca 2880 tgccacccag gccgacatcc agctgaccca gtctccatca tctctgagcg catctgttgg 2940 agatagggtc actatgagct gtaagtccag tcaaagtgtt ttatacagtg caaatcacaa 3000 gaactacttg gcctggtacc agcagaaacc agggaaagca cctaaactgc tgatctactg 3060 ggcatccact agggaatctg gtgtcccttc gcgattctct ggcagcggat ctgggacaga 3120 ttttactttc accatcagct ctcttcaacc agaagacatt gcaacatatt attgtcacca 3180 atacctctcc tcgtggacgt tcggtggagg gaccaaggtg cagatcaaac gaactgtggc 3240 tgcaccatct gtcttcatct tcccgccatc tgatgagcag ttgaaatctg gaactgcctc 3300 tgttgtgtgc ctgctgaata acttctatcc cagagaggcc aaagtacagt ggaaggtgga 3360 taacgccctc caatcgggta actcccagga gagtgtcaca gagcaggaca gcaaggacag 3420 cacctacagc ctcagcagca ccctgacgct gagcaaagca gactacgaga aacacaaagt 3480 ctacgcctgc gaagtcaccc atcagggcct gagctcgccc gtcacaaaga gcttcaacag 3540 gggagagtgt tagagatcta ggcctcctag gtcgacatcg ataaaataaa agattttatt 3600 tagtctccag aaaaaggggg gaatgaaaga ccccacctgt aggtttggca agctagctta 3660 agtaacgcca ttttgcaagg catggaaaaa tacataactg agaatagaga agttcagatc 3720 aaggtcagga acagatggaa cagctgaata tgggccaaac aggatatctg tggtaagcag 3780 ttcctgcccc ggctcagggc caagaacaga tggaacagct gaatatgggc caaacaggat 3840 atctgtggta agcagttcct gccccggctc agggccaaga acagatggtc cccagatgcg 3900 gtccagccct cagcagtttc tagagaacca tcagatgttt ccagggtgcc ccaaggacct 3960 gaaatgaccc tgtgccttat ttgaactaac caatcagttc gcttctcgct tctgttcgcg 4020 cgcttctgct ccccgagctc aataaaagag cccacaaccc ctcactcggg gcgccagtcc 4080 tccgattgac tgagtcgccc gggtacccgt gtatccaata aaccctcttg cagttgcatc 4140 cgacttgtgg tctcgctgtt ccttgggagg gtctcctctg agtgattgac tacccgtcag 4200 gtctttcatt 4210 6 5732 DNA Artificial Sequence Synthetic 6 cgagcttggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actctttcaa taataactct 240 tctgtgcaag attacaatct aaacaattcg gagaactcga ccttcctcct gaggcaagga 300 ccacagccaa cttcctctta caagccgcat cgattttgtc cttcagaaat agaaataaga 360 atgcttgcta aaaattatat ttttaccaat aagaccaatc caataggtag attattagtt 420 actatgttaa gaaatgaatc attatctttt agtactattt ttactcaaat tcagaagtta 480 gaaatgggaa tagaaaatag aaagagacgc tcaacctcaa ttgaagaaca ggtgcaagga 540 ctattgacca caggcctaga agtaaaaaag ggaaaaaaga gtgtttttgt caaaatagga 600 gacaggtggt ggcaaccagg gacttatagg ggaccttaca tctacagacc aacagatgcc 660 cccttaccat atacaggaag atatgactta aattgggata ggtgggttac agtcaatggc 720 tataaagtgt tatatagatc cctccccttt cgtgaaagac tcgccagagc tagacctcct 780 tggtgtatgt tgtctcaaga aaagaaagac gacatgaaac aacaggtaca tgattatatt 840 tatctaggaa caggaatgca cttttgggga aagattttcc ataccaagga ggggacagtg 900 gctggactaa tagaacatta ttctgcaaaa acttatggca tgagttatta tgattagcct 960 tgatttgccc aaccttgcgg ttcccaaggc ttaagtaagt ttttggttac aaactgttct 1020 taaaacaagg atgtgagaca agtggtttcc tgacttggtt tggtatcaaa ggttctgatc 1080 tgagctctga gtgttctatt ttcctatgtt cttttggaat ttatccaaat cttatgtaaa 1140 tgcttatgta aaccaagata taaaagagtg ctgatttttt gagtaaactt gcaacagtcc 1200 taacattcac ctcttgtgtg tttgtgtctg ttcgccatcc cgtctccgct cgtcacttat 1260 ccttcacttt ccagagggtc cccccgcaga ccccggcgac cctcaggtcg gccgactgcg 1320 gcagctggcg cccgaacagg gaccctcgga taagtgaccc ttgtctttat ttctactatt 1380 ttgtgttcgt cttgttttgt ctctatcttg tctggctatc atcacaagag cggaacggac 1440 tcacctcagg gaaccaagct agcccggggt cgacggatcc gattacttac tggcaggtgc 1500 tgggggcttc cgagacaatc gcgaacatct acaccacaca acaccgcctc gaccagggtg 1560 agatatcggc cggggacgcg gcggtggtaa ttacaagcga gatccgatta cttactggca 1620 ggtgctgggg gcttccgaga caatcgcgaa catctacacc acacaacacc gcctcgacca 1680 gggtgagata tcggccgggg acgcggcggt ggtaattaca agcgagatcc ccgggaattc 1740 aggacctcac catgggatgg agctgtatca tcctcttctt ggtagcaaca gctacaggtg 1800 tccactccga ggtccaactg gtggagagcg gtggaggtgt tgtgcaacct ggccggtccc 1860 tgcgcctgtc ctgctccgca tctggcttcg atttcaccac atattggatg agttgggtga 1920 gacaggcacc tggaaaaggt cttgagtgga ttggagaaat tcatccagat agcagtacga 1980 ttaactatgc gccgtctcta aaggatagat ttacaatatc gcgagacaac gccaagaaca 2040 cattgttcct gcaaatggac agcctgagac ccgaagacac cggggtctat ttttgtgcaa 2100 gcctttactt cggcttcccc tggtttgctt attggggcca agggaccccg gtcaccgtct 2160 cctcagcctc caccaagggc ccatcggtct tccccctggc accctcctcc aagagcacct 2220 ctgggggcac agcggccctg ggctgcctgg tcaaggacta cttccccgaa ccggtgacgg 2280 tgtcgtggaa ctcaggcgcc ctgaccagcg gcgtgcacac cttcccggct gtcctacagt 2340 cctcaggact ctactccctc agcagcgtgg tgaccgtgcc ctccagcagc ttgggcaccc 2400 agacctacat ctgcaacgtg aatcacaagc ccagcaacac caaggtggac aagagagttg 2460 agcccaaatc ttgtgacaaa actcacacat gcccaccgtg cccagcacct gaactcctgg 2520 ggggaccgtc agtcttcctc ttccccccaa aacccaagga caccctcatg atctcccgga 2580 cccctgaggt cacatgcgtg gtggtggacg tgagccacga agaccctgag gtcaagttca 2640 actggtacgt ggacggcgtg gaggtgcata atgccaagac aaagccgcgg gaggagcagt 2700 acaacagcac gtaccgtgtg gtcagcgtcc tcaccgtcct gcaccaggac tggctgaatg 2760 gcaaggagta caagtgcaag gtctccaaca aagccctccc agcccccatc gagaaaacca 2820 tctccaaagc caaagggcag ccccgagaac cacaggtgta caccctgccc ccatcccggg 2880 aggagatgac caagaaccag gtcagcctga cctgcctggt caaaggcttc tatcccagcg 2940 acatcgccgt ggagtgggag agcaatgggc agccggagaa caactacaag accacgcctc 3000 ccgtgctgga ctccgacggc tccttcttcc tctatagcaa gctcaccgtg gacaagagca 3060 ggtggcagca ggggaacgtc ttctcatgct ccgtgatgca cgaggctctg cacaaccact 3120 acacgcagaa gagcctctcc ctgtctcccg ggaaatgaaa gccgaattcg cccctctccc 3180 tccccccccc ctaacgttac tggccgaagc cgcttggaat aaggccggtg tgcgtttgtc 3240 tatatgttat tttccaccat attgccgtct tttggcaatg tgagggcccg gaaacctggc 3300 cctgtcttct tgacgagcat tcctaggggt ctttcccctc tcgccaaagg aatgcaaggt 3360 ctgttgaatg tcgtgaagga agcagttcct ctggaagctt cttgaagaca aacaacgtct 3420 gtagcgaccc tttgcaggca gcggaacccc ccacctggcg acaggtgcct ctgcggccaa 3480 aagccacgtg tataagatac acctgcaaag gcggcacaac cccagtgcca cgttgtgagt 3540 tggatagttg tggaaagagt caaatggctc tcctcaagcg tattcaacaa ggggctgaag 3600 gatgcccaga aggtacccca ttgtatggga tctgatctgg ggcctcggtg cacatgcttt 3660 acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc cgaaccacgg ggacgtggtt 3720 ttcctttgaa aaacacgatg ataatatggc ctcctttgtc tctctgctcc tggtaggcat 3780 cctattccat gccacccagg ccgacatcca gctgacccag agcccaagca gcctgagcgc 3840 cagcgtgggt gacagagtga ccatcacctg taaggccagt caggatgtgg gtacttctgt 3900 agcctggtac cagcagaagc caggtaaggc tccaaagctg ctgatctact ggacatccac 3960 ccggcacact ggtgtgccaa gcagattcag cggtagcggt agcggtaccg acttcacctt 4020 caccatcagc agcctccagc cagaggacat cgccacctac tactgccagc aatatagcct 4080 ctatcggtcg ttcggccaag ggaccaaggt ggaaatcaaa cgaactgtgg ctgcaccatc 4140 tgtcttcatc ttcccgccat ctgatgagca gttgaaatct ggaactgcct ctgttgtgtg 4200 cctgctgaat aacttctatc ccagagaggc caaagtacag tggaaggtgg ataacgccct 4260 ccaatcgggt aactcccagg agagtgtcac agagcaggac agcaaggaca gcacctacag 4320 cctcagcagc accctgacgc tgagcaaagc agactacgag aaacacaaag tctacgcctg 4380 cgaagtcacc catcagggcc tgagctcgcc cgtcacaaag agcttcaaca ggggagagtg 4440 ttagagatcc cccgggctgc aggaattcga tatcaagctt atcgataatc aacctctgga 4500 ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 4560 tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 4620 ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 4680 gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 4740 caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 4800 actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 4860 ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct gtgttgccac 4920 ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 4980 tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 5040 gacgagtcgg atctcccttt gggccgcctc cccgcctgat cgataccgtc aacatcgata 5100 aaataaaaga ttttatttag tctccagaaa aaggggggaa tgaaagaccc cacctgtagg 5160 tttggcaagc tagcttaagt aacgccattt tgcaaggcat ggaaaaatac ataactgaga 5220 atagagaagt tcagatcaag gtcaggaaca gatggaacag ctgaatatgg gccaaacagg 5280 atatctgtgg taagcagttc ctgccccggc tcagggccaa gaacagatgg aacagctgaa 5340 tatgggccaa acaggatatc tgtggtaagc agttcctgcc ccggctcagg gccaagaaca 5400 gatggtcccc agatgcggtc cagccctcag cagtttctag agaaccatca gatgtttcca 5460 gggtgcccca aggacctgaa atgaccctgt gccttatttg aactaaccaa tcagttcgct 5520 tctcgcttct gttcgcgcgc ttctgctccc cgagctcaat aaaagagccc acaacccctc 5580 actcggggcg ccagtcctcc gattgactga gtcgcccggg tacccgtgta tccaataaac 5640 cctcttgcag ttgcatccga cttgtggtct cgctgttcct tgggagggtc tcctctgagt 5700 gattgactac ccgtcagcgg gggtctttca tt 5732 7 9183 DNA Artificial Sequence Synthetic 7 aaagacccca cccgtaggtg gcaagctagc ttaagtaacg ccactttgca aggcatggaa 60 aaatacataa ctgagaatag aaaagttcag atcaaggtca ggaacaaaga aacagctgaa 120 taccaaacag gatatctgtg gtaagcggtt cctgccccgg ctcagggcca agaacagatg 180 agacagctga gtgatgggcc aaacaggata tctgtggtaa gcagttcctg ccccggctcg 240 gggccaagaa cagatggtcc ccagatgcgg tccagccctc agcagtttct agtgaatcat 300 cagatgtttc cagggtgccc caaggacctg aaaatgaccc tgtaccttat ttgaactaac 360 caatcagttc gcttctcgct tctgttcgcg cgcttccgct ctccgagctc aataaaagag 420 cccacaaccc ctcactcggc gcgccagtct tccgatagac tgcgtcgccc gggtacccgt 480 attcccaata aagcctcttg ctgtttgcat ccgaatcgtg gtctcgctgt tccttgggag 540 ggtctcctct gagtgattga ctacccacga cgggggtctt tcatttgggg gctcgtccgg 600 gatttggaga cccctgccca gggaccaccg acccaccacc gggaggtaag ctggccagca 660 acttatctgt gtctgtccga ttgtctagtg tctatgtttg atgttatgcg cctgcgtctg 720 tactagttag ctaactagct ctgtatctgg cggacccgtg gtggaactga cgagttctga 780 acacccggcc gcaaccctgg gagacgtccc agggactttg ggggccgttt ttgtggcccg 840 acctgaggaa gggagtcgat gtggaatccg accccgtcag gatatgtggt tctggtagga 900 gacgagaacc taaaacagtt cccgcctccg tctgaatttt tgctttcggt ttggaaccga 960 agccgcgcgt cttgtctgct gcagcgctgc agcatcgttc tgtgttgtct ctgtctgact 1020 gtgtttctgt atttgtctga aaattagggc cagactgtta ccactccctt aagtttgacc 1080 ttaggtcact ggaaagatgt cgagcggatc gctcacaacc agtcggtaga tgtcaagaag 1140 agacgttggg ttaccttctg ctctgcagaa tggccaacct ttaacgtcgg atggccgcga 1200 gacggcacct ttaaccgaga cctcatcacc caggttaaga tcaaggtctt ttcacctggc 1260 ccgcatggac acccagacca ggtcccctac atcgtgacct gggaagcctt ggcttttgac 1320 ccccctccct gggtcaagcc ctttgtacac cctaagcctc cgcctcctct tcctccatcc 1380 gccccgtctc tcccccttga acctcctcgt tcgaccccgc ctcgatcctc cctttatcca 1440 gccctcactc cttctctagg cgccggaatt ccgatctgat caagagacag gatgaggatc 1500 gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag 1560 gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg 1620 gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa 1680 tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc 1740 agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc 1800 ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga 1860 tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa 1920 acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct 1980 ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgcgcat 2040 gcccgacggc gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt 2100 ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta 2160 tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga 2220 ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg 2280 ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga ccaagcgacg 2340 cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag gttgggcttc 2400 ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct catgctggag 2460 ttcttcgccc accccgggct cgatcccctc gcgagttggt tcagctgctg cctgaggctg 2520 gacgacctcg cggagttcta ccggcagtgc aaatccgtcg gcatccagga aaccagcagc 2580 ggctatccgc gcatccatgc ccccgaactg caggagtggg gaggcacgat ggccgctttg 2640 gtcgaggcgg atcctagaac tagcgaaaat gcaagagcaa agacgaaaac atgccacaca 2700 tgaggaatac cgattctctc attaacatat tcaggccagt tatctgggct taaaagcaga 2760 agtccaaccc agataacgat catatacatg gttctctcca gaggttcatt actgaacact 2820 cgtccgagaa taacgagtgg atcagtcctg ggtggtcatt gaaaggactg atgctgaagt 2880 tgaagctcca atactttggc cacctgatgc gaagaactga ctcatgtgat aagaccctga 2940 tactgggaaa gattgaaggc aggaggagaa gggatgacag aggatggaag agttggatgg 3000 aatcaccaac tcgatggaca tgagtttgag caagcttcca ggagttggta atgggcaggg 3060 aagcctggcg tgctgcagtc catggggttg caaagagttg gacactactg agtgactgaa 3120 ctgaactgat agtgtaatcc atggtacaga atataggata aaaaagagga agagtttgcc 3180 ctgattctga agagttgtag gatataaaag tttagaatac ctttagtttg gaagtcttaa 3240 attatttact taggatgggt acccactgca atataagaaa tcaggcttta gagactgatg 3300 tagagagaat gagccctggc ataccagaag ctaacagcta ttggttatag ctgttataac 3360 caatatataa ccaatatatt ggttatatag catgaagctt gatgccagca atttgaagga 3420 accatttaga actagtatcc taaactctac atgttccagg acactgatct taaagctcag 3480 gttcagaatc ttgttttata ggctctaggt gtatattgtg gggcttccct ggtggctcag 3540 atggtaaagt gtctgcctgc aatgtgggtg atctgggttc gatccctggc ttgggaagat 3600 cccctggaga aggaaatggc aacccactct agtactctta cctggaaaat tccatggaca 3660 gaggagcctt gtaagctaca gtccatggga ttgcaaagag ttgaacacaa ctgagcaact 3720 aagcacagca cagtacagta tacacctgtg aggtgaagtg aagtgaaggt tcaatgcagg 3780 gtctcctgca ttgcagaaag attctttacc atctgagcca ccagggaagc ccaagaatac 3840 tggagtgggt agcctattcc ttctccaggg gatcttccca tcccaggaat tgaactggag 3900 tctcctgcat ttcaggtgga ttcttcacca gctgaactac caggtggata ctactccaat 3960 attaaagtgc ttaaagtcca gttttcccac ctttcccaaa aaggttgggt cactcttttt 4020 taaccttctg tggcctactc tgaggctgtc tacaagctta tatatttatg aacacattta 4080 ttgcaagttg ttagttttag atttacaatg tggtatctgg ctatttagtg gtattggtgg 4140 ttggggatgg ggaggctgat agcatctcag agggcagcta gatactgtca tacacacttt 4200 tcaagttctc catttttgtg aaatagaaag tctctggatc taagttatat gtgattctca 4260 gtctctgtgg tcatattcta ttctactcct gaccactcaa caaggaacca agatatcaag 4320 ggacacttgt tttgtttcat gcctgggttg agtgggccat gacatatgtt ctgggccttg 4380 ttacatggct ggattggttg gacaagtgcc agctctgatc ctgggactgt ggcatgtgat 4440 gacatacacc ccctctccac attctgcatg tctctagggg ggaaggggga agctcggtat 4500 agaaccttta ttgtattttc tgattgcctc acttcttata ttgcccccat gcccttcttt 4560 gttcctcaag taaccagaga cagtgcttcc cagaaccaac cctacaagaa acaaagggct 4620 aaacaaagcc aaatgggaag caggatcatg gtttgaactc tttctggcca gagaacaata 4680 cctgctatgg actagatact gggagaggga aaggaaaagt agggtgaatt atggaaggaa 4740 gctggcaggc tcagcgtttc tgtcttggca tgaccagtct ctcttcattc tcttcctaga 4800 tgtagggctt ggtaccagag cccctgaggc tttctgcatg aatataaata tatgaaactg 4860 agtgatgctt ccatttcagg ttcttggggg cgccgaattc gagctcggta cccggggatc 4920 tcgacggatc cgattactta ctggcaggtg ctgggggctt ccgagacaat cgcgaacatc 4980 tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 5040 attacaagcg agatccgatt acttactggc aggtgctggg ggcttccgag acaatcgcga 5100 acatctacac cacacaacac cgcctcgacc agggtgagat atcggccggg gacgcggcgg 5160 tggtaattac aagcgagatc cccgggaatt caggacctca ccatgggatg gagctgtatc 5220 atcctcttct tggtagcaac agctacaggt gtccactccg aggtccaact ggtggagagc 5280 ggtggaggtg ttgtgcaacc tggccggtcc ctgcgcctgt cctgctccgc atctggcttc 5340 gatttcacca catattggat gagttgggtg agacaggcac ctggaaaagg tcttgagtgg 5400 attggagaaa ttcatccaga tagcagtacg attaactatg cgccgtctct aaaggataga 5460 tttacaatat cgcgagacaa cgccaagaac acattgttcc tgcaaatgga cagcctgaga 5520 cccgaagaca ccggggtcta tttttgtgca agcctttact tcggcttccc ctggtttgct 5580 tattggggcc aagggacccc ggtcaccgtc tcctcagcct ccaccaaggg cccatcggtc 5640 ttccccctgg caccctcctc caagagcacc tctgggggca cagcggccct gggctgcctg 5700 gtcaaggact acttccccga accggtgacg gtgtcgtgga actcaggcgc cctgaccagc 5760 ggcgtgcaca ccttcccggc tgtcctacag tcctcaggac tctactccct cagcagcgtg 5820 gtgaccgtgc cctccagcag cttgggcacc cagacctaca tctgcaacgt gaatcacaag 5880 cccagcaaca ccaaggtgga caagagagtt gagcccaaat cttgtgacaa aactcacaca 5940 tgcccaccgt gcccagcacc tgaactcctg gggggaccgt cagtcttcct cttcccccca 6000 aaacccaagg acaccctcat gatctcccgg acccctgagg tcacatgcgt ggtggtggac 6060 gtgagccacg aagaccctga ggtcaagttc aactggtacg tggacggcgt ggaggtgcat 6120 aatgccaaga caaagccgcg ggaggagcag tacaacagca cgtaccgtgt ggtcagcgtc 6180 ctcaccgtcc tgcaccagga ctggctgaat ggcaaggagt acaagtgcaa ggtctccaac 6240 aaagccctcc cagcccccat cgagaaaacc atctccaaag ccaaagggca gccccgagaa 6300 ccacaggtgt acaccctgcc cccatcccgg gaggagatga ccaagaacca ggtcagcctg 6360 acctgcctgg tcaaaggctt ctatcccagc gacatcgccg tggagtggga gagcaatggg 6420 cagccggaga acaactacaa gaccacgcct cccgtgctgg actccgacgg ctccttcttc 6480 ctctatagca agctcaccgt ggacaagagc aggtggcagc aggggaacgt cttctcatgc 6540 tccgtgatgc acgaggctct gcacaaccac tacacgcaga agagcctctc cctgtctccc 6600 gggaaatgaa agccgaattc gcccctctcc ctcccccccc cctaacgtta ctggccgaag 6660 ccgcttggaa taaggccggt gtgcgtttgt ctatatgtta ttttccacca tattgccgtc 6720 ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg 6780 tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc 6840 tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc 6900 cccacctggc gacaggtgcc tctgcggcca aaagccacgt gtataagata cacctgcaaa 6960 ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag tcaaatggct 7020 ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc attgtatggg 7080 atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg 7140 tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat gataatatgg 7200 cctcctttgt ctctctgctc ctggtaggca tcctattcca tgccacccag gccgacatcc 7260 agctgaccca gagcccaagc agcctgagcg ccagcgtggg tgacagagtg accatcacct 7320 gtaaggccag tcaggatgtg ggtacttctg tagcctggta ccagcagaag ccaggtaagg 7380 ctccaaagct gctgatctac tggacatcca cccggcacac tggtgtgcca agcagattca 7440 gcggtagcgg tagcggtacc gacttcacct tcaccatcag cagcctccag ccagaggaca 7500 tcgccaccta ctactgccag caatatagcc tctatcggtc gttcggccaa gggaccaagg 7560 tggaaatcaa acgaactgtg gctgcaccat ctgtcttcat cttcccgcca tctgatgagc 7620 agttgaaatc tggaactgcc tctgttgtgt gcctgctgaa taacttctat cccagagagg 7680 ccaaagtaca gtggaaggtg gataacgccc tccaatcggg taactcccag gagagtgtca 7740 cagagcagga cagcaaggac agcacctaca gcctcagcag caccctgacg ctgagcaaag 7800 cagactacga gaaacacaaa gtctacgcct gcgaagtcac ccatcagggc ctgagctcgc 7860 ccgtcacaaa gagcttcaac aggggagagt gttagagatc ccccgggctg caggaattcg 7920 atatcaagct tatcgataat caacctctgg attacaaaat ttgtgaaaga ttgactggta 7980 ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 8040 atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 8100 ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 8160 ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 8220 tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 8280 ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 8340 cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 8400 acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 8460 ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 8520 ccccgcctga tcgataccgt caacatcgat aaaataaaag attttattta gtctccagaa 8580 aaagggggga atgaaagacc ccacctgtag gtttggcaag ctagcttaag taacgccatt 8640 ttgcaaggca tggaaaaata cataactgag aatagagaag ttcagatcaa ggtcaggaac 8700 agatggaaca gctgaatatg ggccaaacag gatatctgtg gtaagcagtt cctgccccgg 8760 ctcagggcca agaacagatg gaacagctga atatgggcca aacaggatat ctgtggtaag 8820 cagttcctgc cccggctcag ggccaagaac agatggtccc cagatgcggt ccagccctca 8880 gcagtttcta gagaaccatc agatgtttcc agggtgcccc aaggacctga aatgaccctg 8940 tgccttattt gaactaacca atcagttcgc ttctcgcttc tgttcgcgcg cttctgctcc 9000 ccgagctcaa taaaagagcc cacaacccct cactcggggc gccagtcctc cgattgactg 9060 agtcgcccgg gtacccgtgt atccaataaa ccctcttgca gttgcatccg acttgtggtc 9120 tcgctgttcc ttgggagggt ctcctctgag tgattgacta cccgtcagcg ggggtctttc 9180 att 9183 8 5711 DNA Artificial Sequence Synthetic 8 gatcagtcct gggtggtcat tgaaaggact gatgctgaag ttgaagctcc aatactttgg 60 ccacctgatg cgaagaactg actcatgtga taagaccctg atactgggaa agattgaagg 120 caggaggaga agggatgaca gaggatggaa gagttggatg gaatcaccaa ctcgatggac 180 atgagtttga gcaagcttcc aggagttggt aatgggcagg gaagcctggc gtgctgcagt 240 ccatggggtt gcaaagagtt ggacactact gagtgactga actgaactga tagtgtaatc 300 catggtacag aatataggat aaaaaagagg aagagtttgc cctgattctg aagagttgta 360 ggatataaaa gtttagaata cctttagttt ggaagtctta aattatttac ttaggatggg 420 tacccactgc aatataagaa atcaggcttt agagactgat gtagagagaa tgagccctgg 480 cataccagaa gctaacagct attggttata gctgttataa ccaatatata accaatatat 540 tggttatata gcatgaagct tgatgccagc aatttgaagg aaccatttag aactagtatc 600 ctaaactcta catgttccag gacactgatc ttaaagctca ggttcagaat cttgttttat 660 aggctctagg tgtatattgt ggggcttccc tggtggctca gatggtaaag tgtctgcctg 720 caatgtgggt gatctgggtt cgatccctgg cttgggaaga tcccctggag aaggaaatgg 780 caacccactc tagtactctt acctggaaaa ttccatggac agaggagcct tgtaagctac 840 agtccatggg attgcaaaga gttgaacaca actgagcaac taagcacagc acagtacagt 900 atacacctgt gaggtgaagt gaagtgaagg ttcaatgcag ggtctcctgc attgcagaaa 960 gattctttac catctgagcc accagggaag cccaagaata ctggagtggg tagcctattc 1020 cttctccagg ggatcttccc atcccaggaa ttgaactgga gtctcctgca tttcaggtgg 1080 attcttcacc agctgaacta ccaggtggat actactccaa tattaaagtg cttaaagtcc 1140 agttttccca cctttcccaa aaaggttggg tcactctttt ttaaccttct gtggcctact 1200 ctgaggctgt ctacaagctt atatatttat gaacacattt attgcaagtt gttagtttta 1260 gatttacaat gtggtatctg gctatttagt ggtattggtg gttggggatg gggaggctga 1320 tagcatctca gagggcagct agatactgtc atacacactt ttcaagttct ccatttttgt 1380 gaaatagaaa gtctctggat ctaagttata tgtgattctc agtctctgtg gtcatattct 1440 attctactcc tgaccactca acaaggaacc aagatatcaa gggacacttg ttttgtttca 1500 tgcctgggtt gagtgggcca tgacatatgt tctgggcctt gttacatggc tggattggtt 1560 ggacaagtgc cagctctgat cctgggactg tggcatgtga tgacatacac cccctctcca 1620 cattctgcat gtctctaggg gggaaggggg aagctcggta tagaaccttt attgtatttt 1680 ctgattgcct cacttcttat attgccccca tgcccttctt tgttcctcaa gtaaccagag 1740 acagtgcttc ccagaaccaa ccctacaaga aacaaagggc taaacaaagc caaatgggaa 1800 gcaggatcat ggtttgaact ctttctggcc agagaacaat acctgctatg gactagatac 1860 tgggagaggg aaaggaaaag tagggtgaat tatggaagga agctggcagg ctcagcgttt 1920 ctgtcttggc atgaccagtc tctcttcatt ctcttcctag atgtagggct tggtaccaga 1980 gcccctgagg ctttctgcat gaatataaat atatgaaact gagtgatgct tccatttcag 2040 gttcttgggg gcgccgaatt cgagctcggt acccggggat ctcgacggat ccgattactt 2100 actggcaggt gctgggggct tccgagacaa tcgcgaacat ctacaccaca caacaccgcc 2160 tcgaccaggg tgagatatcg gccggggacg cggcggtggt aattacaagc gagatccgat 2220 tacttactgg caggtgctgg gggcttccga gacaatcgcg aacatctaca ccacacaaca 2280 ccgcctcgac cagggtgaga tatcggccgg ggacgcggcg gtggtaatta caagcgagat 2340 ctcgagaagc ttgttgggaa ttcaggccat cgatcccgcc gccaccatgg aatggagctg 2400 ggtctttctc ttcttcctgt cagtaactac aggtgtccac tccgacatcc agatgaccca 2460 gtctccagcc tccctatctg catctgtggg agaaactgtc actatcacat gtcgagcaag 2520 tgggaatatt cacaattatt tagcatggta tcagcagaaa cagggaaaat ctcctcagct 2580 cctggtctat aatgcaaaaa ccttagcaga tggtgtgcca tcaaggttca gtggcagtgg 2640 atcaggaaca caatattctc tcaagatcaa cagcctgcag cctgaagatt ttgggagtta 2700 ttactgtcaa catttttgga gtactccgtg gacgttcggt ggaggcacca agctggaaat 2760 caaacgggct gatgctgcac caactgtatc catcttccca ccatccagtg agcagttaac 2820 atctggaggt gcctcagtcg tgtgcttctt gaacaacttc taccccaaag acatcaatgt 2880 caagtggaag attgatggca gtgaacgaca aaatggcgtc ctgaacagtt ggactgatca 2940 ggacagcaaa gacagcacct acagcatgag cagcaccctc acattgacca aggacgagta 3000 tgaacgacat aacagctata cctgtgaggc cactcacaag acatcaactt cacccattgt 3060 caagagcttc aacaggaatg agtgttgaaa gcatcgattt cccctgaatt cgcccctctc 3120 cctccccccc ccctaacgtt actggccgaa gccgcttgga ataaggccgg tgtgcgtttg 3180 tctatatgtt attttccacc atattgccgt cttttggcaa tgtgagggcc cggaaacctg 3240 gccctgtctt cttgacgagc attcctaggg gtctttcccc tctcgccaaa ggaatgcaag 3300 gtctgttgaa tgtcgtgaag gaagcagttc ctctggaagc ttcttgaaga caaacaacgt 3360 ctgtagcgac cctttgcagg cagcggaacc ccccacctgg cgacaggtgc ctctgcggcc 3420 aaaagccacg tgtataagat acacctgcaa aggcggcaca accccagtgc cacgttgtga 3480 gttggatagt tgtggaaaga gtcaaatggc tctcctcaag cgtattcaac aaggggctga 3540 aggatgccca gaaggtaccc cattgtatgg gatctgatct ggggcctcgg tgcacatgct 3600 ttacatgtgt ttagtcgagg ttaaaaaaac gtctaggccc cccgaaccac ggggacgtgg 3660 ttttcctttg aaaaacacga tgataatatg gcctcctttg tctctctgct cctggtaggc 3720 atcctattcc atgccaccca ggccgaggtt cagcttcagc agtctggggc agagcttgtg 3780 aagccagggg cctcagtcaa gttgtcctgc acagcttctg gcttcaacat taaagacacc 3840 tttatgcact gggtgaagca gaggcctgaa cagggcctgg agtggattgg aaggattgat 3900 cctgcgaatg ggaatactga atatgacccg aagttccagg gcaaggccac tataacagca 3960 gacacatcct ccaacacagt caacctgcag ctcagcagcc tgacatctga ggacactgcc 4020 gtctattact gtgctagtgg aggggaactg gggtttcctt actggggcca agggactctg 4080 gtcactgtct ctgcagccaa aacgacaccc ccatctgtct atccactggc ccctggatct 4140 gctgcccaaa ctaactccat ggtgaccctg ggatgcctgg tcaagggcta tttccctgag 4200 ccagtgacag tgacctggaa ctctggatcc ctgtccagcg gtgtgcacac cttcccagct 4260 gtcctgcagt ttgacctcta cactctgagc agctcagtga ctgtcccctc cagcacctgg 4320 cccagcgaga ccgtcacctg caacgttgcc cacccggcca gcagcaccaa ggtggacaag 4380 aaaattgtgc ccagggattg tactagtgga ggtggaggta gccaccatca ccatcaccat 4440 taatctagag ttaagcggcc gtcgagatct cgacatcgat aatcaacctc tggattacaa 4500 aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc tatgtggata 4560 cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca ttttctcctc 4620 cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg tcaggcaacg 4680 tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca ttgccaccac 4740 ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg cggaactcat 4800 cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg acaattccgt 4860 ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg ccacctggat 4920 tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg accttccttc 4980 ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag 5040 tcggatctcc ctttgggccg cctccccgcc tgatcgataa aataaaagat tttatttagt 5100 ctccagaaaa aggggggaat gaaagacccc acctgtaggt ttggcaagct agcttaagta 5160 acgccatttt gcaaggcatg gaaaaataca taactgagaa tagagaagtt cagatcaagg 5220 tcaggaacag atggaacagc tgaatatggg ccaaacagga tatctgtggt aagcagttcc 5280 tgccccggct cagggccaag aacagatgga acagctgaat atgggccaaa caggatatct 5340 gtggtaagca gttcctgccc cggctcaggg ccaagaacag atggtcccca gatgcggtcc 5400 agccctcagc agtttctaga gaaccatcag atgtttccag ggtgccccaa ggacctgaaa 5460 tgaccctgtg ccttatttga actaaccaat cagttcgctt ctcgcttctg ttcgcgcgct 5520 tctgctcccc gagctcaata aaagagccca caacccctca ctcggggcgc cagtcctccg 5580 attgactgag tcgcccgggt acccgtgtat ccaataaacc ctcttgcagt tgcatccgac 5640 ttgtggtctc gctgttcctt gggagggtct cctctgagtg attgactacc cgtcagcggg 5700 ggtctttcat t 5711 9 5130 DNA Artificial Sequence Synthetic 9 tttgaaagac cccacccgta ggtggcaagc tagcttaagt aacgccactt tgcaaggcat 60 ggaaaaatac ataactgaga atagaaaagt tcagatcaag gtcaggaaca aagaaacagc 120 tgaataccaa acaggatatc tgtggtaagc ggttcctgcc ccggctcagg gccaagaaca 180 gatgagacag ctgagtgatg ggccaaacag gatatctgtg gtaagcagtt cctgccccgg 240 ctcggggcca agaacagatg gtccccagat gcggtccagc cctcagcagt ttctagtgaa 300 tcatcagatg tttccagggt gccccaagga cctgaaaatg accctgtacc ttatttgaac 360 taaccaatca gttcgcttct cgcttctgtt cgcgcgcttc cgctctccga gctcaataaa 420 agagcccaca acccctcact cggcgcgcca gtcttccgat agactgcgtc gcccgggtac 480 ccgtattccc aataaagcct cttgctgttt gcatccgaat cgtggtctcg ctgttccttg 540 ggagggtctc ctctgagtga ttgactaccc acgacggggg tctttcattt gggggctcgt 600 ccgggatttg gagacccctg cccagggacc accgacccac caccgggagg taagctggcc 660 agcaacttat ctgtgtctgt ccgattgtct agtgtctatg tttgatgtta tgcgcctgcg 720 tctgtactag ttagctaact agctctgtat ctggcggacc cgtggtggaa ctgacgagtt 780 ctgaacaccc ggccgcaacc ctgggagacg tcccagggac tttgggggcc gtttttgtgg 840 cccgacctga ggaagggagt cgatgtggaa tccgaccccg tcaggatatg tggttctggt 900 aggagacgag aacctaaaac agttcccgcc tccgtctgaa tttttgcttt cggtttggaa 960 ccgaagccgc gcgtcttgtc tgctgcagcc aagcttgggc tgcaggtcga ggactgggga 1020 ccctgcaccg aacatggaga acacaacatc aggattccta ggacccctgc tcgtgttaca 1080 ggcggggttt ttcttgttga caagaatcct cacaatacca cagagtctag actcgtggtg 1140 gacttctctc aattttctag ggggagcacc cacgtgtcct ggccaaaatt cgcagtcccc 1200 aacctccaat cactcaccaa cctcttgtcc tccaatttgt cctggctatc gctggatgtg 1260 tctgcggcgt tttatcatat tcctcttcat cctgctgcta tgcctcatct tcttgttggt 1320 tcttctggac taccaaggta tgttgcccgt ttgtcctcta cttccaggaa catcaactac 1380 cagcacggga ccatgcaaga cctgcacgat tcctgctcaa ggaacctcta tgtttccctc 1440 ttgttgctgt acaaaacctt cggacggaaa ctgcacttgt attcccatcc catcatcctg 1500 ggctttcgca agattcctat gggagtgggc ctcagtccgt ttctcctggc tcagtttact 1560 agtgccattt gttcagtggt tcgtagggct ttcccccact gtttggcttt cagttatatg 1620 gatgatgtgg tattgggggc caagtctgta caacatcttg agtccctttt tacctctatt 1680 accaattttc ttttgtcttt gggtatacat ttaaacccta ataaaaccaa acgttggggc 1740 tactccctta acttcatggg atatgtaatt ggatgttggg gtactttacc gcaagaacat 1800 attgtactaa aaatcaagca atgttttcga aaactgcctg taaatagacc tattgattgg 1860 aaagtatgtc agagacttgt gggtcttttg ggctttgctg ccccttttac acaatgtggc 1920 tatcctgcct taatgccttt atatgcatgt atacaatcta agcaggcttt cactttctcg 1980 ccaacttaca aggcctttct gtgtaaacaa tatctgaacc tttaccccgt tgcccggcaa 2040 cggtcaggtc tctgccaagt gtttgctgac gcaaccccca ctggatgggg cttggctatc 2100 ggccatagcc gcatgcgcgg acctttgtgg ctcctctgcc gatccatact gcggaactcc 2160 tagcagcttg ttttgctcgc aggcggtctg gagcgaaact tatcggcacc gacaactctg 2220 ttgtcctctc tcggaaatac acctcctttc catggctgct agggtgtgct gccaactgga 2280 tcccctcagg atatagtagt ttcgcttttg catagggagg gggaaatgta gtcttatgca 2340 atacacttgt agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag 2400 aaaaagcacc gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga 2460 aggcaacaga caggtctgac atggattgga cgaaccactg aattccgcat tgcagagata 2520 attgtattta agtgcctagc tcgatacagc aaacgccatt tttgaccatt caccacattg 2580 gtgtgcacct tccaaagctt cacgctgccg caagcactca gggcgcaagg gctgctaaag 2640 gaagcggaac acgtagaaag ccagtccgca gaaacggtgc tgaccccgga tgaatgtcag 2700 ctactgggct atctggacaa gggaaaacgc aagcgcaaag agaaagcagg tagcttgcag 2760 tgggcttaca tggcgatagc tagactgggc ggttttatgg acagcaagcg aaccggaatt 2820 gccagctggg gcgccctctg gtaaggttgg gaagccctgc aaagtaaact ggatggcttt 2880 cttgccgcca aggatctgat ggcgcagggg atcaagatct gatcaagaga caggatgagg 2940 atcgtttcgc atgattgaac aagatggatt gcacgcaggt tctccggccg cttgggtgga 3000 gaggctattc ggctatgact gggcacaaca gacaatcggc tgctctgatg ccgccgtgtt 3060 ccggctgtca gcgcaggggc gcccggttct ttttgtcaag accgacctgt ccggtgccct 3120 gaatgaactg caggacgagg cagcgcggct atcgtggctg gccacgacgg gcgttccttg 3180 cgcagctgtg ctcgacgttg tcactgaagc gggaagggac tggctgctat tgggcgaagt 3240 gccggggcag gatctcctgt catctcacct tgctcctgcc gagaaagtat ccatcatggc 3300 tgatgcaatg cggcggctgc atacgcttga tccggctacc tgcccattcg accaccaagc 3360 gaaacatcgc atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg atcaggatga 3420 tctggacgaa gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg 3480 catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat 3540 ggtggaaaat ggccgctttt ctggattcat cgactgtggc cggctgggtg tggcggaccg 3600 ctatcaggac atagcgttgg ctacccgtga tattgctgaa gagcttggcg gcgaatgggc 3660 tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca tcgccttcta 3720 tcgccttctt gacgagttct tctgagcggg actctggggt tcgaaatgac cgaccaagcg 3780 acgcccaacc tgccatcacg agatttcgat tccaccgccg ccttctatga aaggttgggc 3840 ttcggaatcg ttttccggga cgccggctgg atgatcctcc agcgcgggga tctcatgctg 3900 gagttcttcg cccaccccaa ccctggccct attattgggt ggactaacca tggggggaat 3960 tgccgctgga ataggaacag ggactactgc tctaatggcc actcagcaat tccagcagct 4020 ccaagccgca gtacaggatg atctcaggga ggttgaaaaa tcaatctcta acctagaaaa 4080 gtctctcact tccctgtctg aagttgtcct acagaatcga aggggcctag acttgttatt 4140 tctaaaagaa ggagggctgt gtgctgctct aaaagaagaa tgttgcttct atgcggacca 4200 cacaggacta gtgagagaca gcatggccaa attgagagag aggcttaatc agagacagaa 4260 actgtttgag tcaactcaag gatggtttga gggactgttt aacagatccc cttggtttac 4320 caccttgata tctaccatta tgggacccct cattgtactc ctaatgattt tgctcttcgg 4380 accctgcatt cttaatcgat tagtccaatt tgttaaagac aggatatcag tggtccaggc 4440 tctagttttg actcaacaat atcaccagct gaagcctata gagtacgagc catagataaa 4500 ataaaagatt ttatttagtc tccagaaaaa ggggggaatg aaagacccca cctgtaggtt 4560 tggcaagcta gcttaagtaa cgccattttg caaggcatgg aaaaatacat aactgagaat 4620 agagaagttc agatcaaggt caggaacaga tggaacagct gaatatgggc caaacaggat 4680 atctgtggta agcagttcct gccccggctc agggccaaga acagatggaa cagctgaata 4740 tgggccaaac aggatatctg tggtaagcag ttcctgcccc ggctcagggc caagaacaga 4800 tggtccccag atgcggtcca gccctcagca gtttctagag aaccatcaga tgtttccagg 4860 gtgccccaag gacctgaaat gaccctgtgc cttatttgaa ctaaccaatc agttcgcttc 4920 tcgcttctgt tcgcgcgctt ctgctccccg agctcaataa aagagcccac aacccctcac 4980 tcggggcgcc agtcctccga ttgactgagt cgcccgggta cccgtgtatc caataaaccc 5040 tcttgcagtt gcatccgact tgtggtctcg ctgttccttg ggagggtctc ctctgagtga 5100 ttgactaccc gtcagcgggg gtctttcatt 5130 10 4661 DNA Artificial Sequence Synthetic 10 gatcagtcct gggtggtcat tgaaaggact gatgctgaag ttgaagctcc aatactttgg 60 ccacctgatg cgaagaactg actcatgtga taagaccctg atactgggaa agattgaagg 120 caggaggaga agggatgaca gaggatggaa gagttggatg gaatcaccaa ctcgatggac 180 atgagtttga gcaagcttcc aggagttggt aatgggcagg gaagcctggc gtgctgcagt 240 ccatggggtt gcaaagagtt ggacactact gagtgactga actgaactga tagtgtaatc 300 catggtacag aatataggat aaaaaagagg aagagtttgc cctgattctg aagagttgta 360 ggatataaaa gtttagaata cctttagttt ggaagtctta aattatttac ttaggatggg 420 tacccactgc aatataagaa atcaggcttt agagactgat gtagagagaa tgagccctgg 480 cataccagaa gctaacagct attggttata gctgttataa ccaatatata accaatatat 540 tggttatata gcatgaagct tgatgccagc aatttgaagg aaccatttag aactagtatc 600 ctaaactcta catgttccag gacactgatc ttaaagctca ggttcagaat cttgttttat 660 aggctctagg tgtatattgt ggggcttccc tggtggctca gatggtaaag tgtctgcctg 720 caatgtgggt gatctgggtt cgatccctgg cttgggaaga tcccctggag aaggaaatgg 780 caacccactc tagtactctt acctggaaaa ttccatggac agaggagcct tgtaagctac 840 agtccatggg attgcaaaga gttgaacaca actgagcaac taagcacagc acagtacagt 900 atacacctgt gaggtgaagt gaagtgaagg ttcaatgcag ggtctcctgc attgcagaaa 960 gattctttac catctgagcc accagggaag cccaagaata ctggagtggg tagcctattc 1020 cttctccagg ggatcttccc atcccaggaa ttgaactgga gtctcctgca tttcaggtgg 1080 attcttcacc agctgaacta ccaggtggat actactccaa tattaaagtg cttaaagtcc 1140 agttttccca cctttcccaa aaaggttggg tcactctttt ttaaccttct gtggcctact 1200 ctgaggctgt ctacaagctt atatatttat gaacacattt attgcaagtt gttagtttta 1260 gatttacaat gtggtatctg gctatttagt ggtattggtg gttggggatg gggaggctga 1320 tagcatctca gagggcagct agatactgtc atacacactt ttcaagttct ccatttttgt 1380 gaaatagaaa gtctctggat ctaagttata tgtgattctc agtctctgtg gtcatattct 1440 attctactcc tgaccactca acaaggaacc aagatatcaa gggacacttg ttttgtttca 1500 tgcctgggtt gagtgggcca tgacatatgt tctgggcctt gttacatggc tggattggtt 1560 ggacaagtgc cagctctgat cctgggactg tggcatgtga tgacatacac cccctctcca 1620 cattctgcat gtctctaggg gggaaggggg aagctcggta tagaaccttt attgtatttt 1680 ctgattgcct cacttcttat attgccccca tgcccttctt tgttcctcaa gtaaccagag 1740 acagtgcttc ccagaaccaa ccctacaaga aacaaagggc taaacaaagc caaatgggaa 1800 gcaggatcat ggtttgaact ctttctggcc agagaacaat acctgctatg gactagatac 1860 tgggagaggg aaaggaaaag tagggtgaat tatggaagga agctggcagg ctcagcgttt 1920 ctgtcttggc atgaccagtc tctcttcatt ctcttcctag atgtagggct tggtaccaga 1980 gcccctgagg ctttctgcat gaatataaat atatgaaact gagtgatgct tccatttcag 2040 gttcttgggg gcgccgaatt cgagctcggt acccggggat ctcgagaagc tttaaccatg 2100 gaatggagct gggtctttct cttcttcctg tcagtaacta caggtgtcca ctcccaggtt 2160 cagttgcagc agtctgacgc tgagttggtg aaacctgggg cttcagtgaa gatttcctgc 2220 aaggcttctg gctacacctt cactgaccat gcaattcact gggtgaaaca gaaccctgaa 2280 cagggcctgg aatggattgg atatttttct cccggaaatg atgattttaa atacaatgag 2340 aggttcaagg gcaaggccac actgactgca gacaaatcct ccagcactgc ctacgtgcag 2400 ctcaacagcc tgacatctga ggattctgca gtgtatttct gtacaagatc cctgaatatg 2460 gcctactggg gtcaaggaac ctcagtcacc gtctcctcag gaggcggagg cagcggaggc 2520 ggtggctcgg gaggcggagg ctcggacatt gtgatgtcac agtctccatc ctccctacct 2580 gtgtcagttg gcgagaaggt tactttgagc tgcaagtcca gtcagagcct tttatatagt 2640 ggtaatcaaa agaactactt ggcctggtac cagcagaaac cagggcagtc tcctaaactg 2700 ctgatttact gggcatccgc tagggaatct ggggtccctg atcgcttcac aggcagtgga 2760 tctgggacag atttcactct ctccatcagc agtgtgaaga ctgaagacct ggcagtttat 2820 tactgtcagc agtattatag ctatcccctc acgttcggtg ctgggaccaa gctggtgctg 2880 aaacgggccg ccgagcccaa atctcctgac aaaactcaca catgcccacc gtgcccagca 2940 cctgaactcc tggggggacc gtcagtcttc ctcttccccc caaaacccaa ggacaccctc 3000 atgatctccc ggacccctga ggtcacatgc gtggtggtgg acgtgagcca cgaagaccct 3060 gaggtcaagt tcaactggta cgtggacggc gtggaggtgc ataatgccaa gacaaagccg 3120 cgggaggagc agtacaacag cacgtaccgt gtggtcagcg tcctcaccgt cctgcaccag 3180 gactggctga atggcaagga gtacaagtgc aaggtctcca acaaagccct cccagccccc 3240 atcgagaaaa ccatctccaa agccaaaggg cagccccgag aaccacaggt gtacaccctg 3300 cccccatccc gggatgagct gaccaagaac caggtcagcc tgacctgcct ggtcaaaggc 3360 ttctatccca gcgacatcgc cgtggagtgg gagagcaatg ggcagccgga gaacaactac 3420 aagaccacgc ctcccgtgct ggactccgac ggctccttct tcctctacag caagctcacc 3480 gtggacaaga gcaggtggca gcaggggaac gtcttctcat gctccgtgat gcatgaggct 3540 ctgcacaacc actacacgca gaagagcctc tccctgtctc cgggtaaagg aggcggatca 3600 ggaggtggcg cacctacttc aagttctaca aagaaaacac agctacaact ggagcattta 3660 ctgctggatt tacagatgat tttgaatgga attaataatt acaagaatcc caaactcacc 3720 aggatgctca catttaagtt ttacatgccc aagaaggcca cagaactgaa acatcttcag 3780 tgtctagaag aagaactcaa acctctggag gaagtgctaa atttagctca aagcaaaaac 3840 tttcacttaa gacccaggga cttaatcagc aatatcaacg taatagttct ggaactaaag 3900 ggatctgaaa caacattcat gtgtgaatat gctgatgaga cagcaaccat tgtagaattt 3960 ctgaacagat ggattacctt ttgtcaaagc atcatctcaa cactaacttg aagcttgtta 4020 acatcgataa aataaaagat tttatttagt ctccagaaaa aggggggaat gaaagacccc 4080 acctgtaggt ttggcaagct agcttaagta acgccatttt gcaaggcatg gaaaaataca 4140 taactgagaa tagagaagtt cagatcaagg tcaggaacag atggaacagc tgaatatggg 4200 ccaaacagga tatctgtggt aagcagttcc tgccccggct cagggccaag aacagatgga 4260 acagctgaat atgggccaaa caggatatct gtggtaagca gttcctgccc cggctcaggg 4320 ccaagaacag atggtcccca gatgcggtcc agccctcagc agtttctaga gaaccatcag 4380 atgtttccag ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga actaaccaat 4440 cagttcgctt ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata aaagagccca 4500 caacccctca ctcggggcgc cagtcctccg attgactgag tcgcccgggt acccgtgtat 4560 ccaataaacc ctcttgcagt tgcatccgac ttgtggtctc gctgttcctt gggagggtct 4620 cctctgagtg attgactacc cgtcagcggg ggtctttcat t 4661 11 5691 DNA Artificial Sequence Synthetic 11 gatcagtcct gggtggtcat tgaaaggact gatgctgaag ttgaagctcc aatactttgg 60 ccacctgatg cgaagaactg actcatgtga taagaccctg atactgggaa agattgaagg 120 caggaggaga agggatgaca gaggatggaa gagttggatg gaatcaccaa ctcgatggac 180 atgagtttga gcaagcttcc aggagttggt aatgggcagg gaagcctggc gtgctgcagt 240 ccatggggtt gcaaagagtt ggacactact gagtgactga actgaactga tagtgtaatc 300 catggtacag aatataggat aaaaaagagg aagagtttgc cctgattctg aagagttgta 360 ggatataaaa gtttagaata cctttagttt ggaagtctta aattatttac ttaggatggg 420 tacccactgc aatataagaa atcaggcttt agagactgat gtagagagaa tgagccctgg 480 cataccagaa gctaacagct attggttata gctgttataa ccaatatata accaatatat 540 tggttatata gcatgaagct tgatgccagc aatttgaagg aaccatttag aactagtatc 600 ctaaactcta catgttccag gacactgatc ttaaagctca ggttcagaat cttgttttat 660 aggctctagg tgtatattgt ggggcttccc tggtggctca gatggtaaag tgtctgcctg 720 caatgtgggt gatctgggtt cgatccctgg cttgggaaga tcccctggag aaggaaatgg 780 caacccactc tagtactctt acctggaaaa ttccatggac agaggagcct tgtaagctac 840 agtccatggg attgcaaaga gttgaacaca actgagcaac taagcacagc acagtacagt 900 atacacctgt gaggtgaagt gaagtgaagg ttcaatgcag ggtctcctgc attgcagaaa 960 gattctttac catctgagcc accagggaag cccaagaata ctggagtggg tagcctattc 1020 cttctccagg ggatcttccc atcccaggaa ttgaactgga gtctcctgca tttcaggtgg 1080 attcttcacc agctgaacta ccaggtggat actactccaa tattaaagtg cttaaagtcc 1140 agttttccca cctttcccaa aaaggttggg tcactctttt ttaaccttct gtggcctact 1200 ctgaggctgt ctacaagctt atatatttat gaacacattt attgcaagtt gttagtttta 1260 gatttacaat gtggtatctg gctatttagt ggtattggtg gttggggatg gggaggctga 1320 tagcatctca gagggcagct agatactgtc atacacactt ttcaagttct ccatttttgt 1380 gaaatagaaa gtctctggat ctaagttata tgtgattctc agtctctgtg gtcatattct 1440 attctactcc tgaccactca acaaggaacc aagatatcaa gggacacttg ttttgtttca 1500 tgcctgggtt gagtgggcca tgacatatgt tctgggcctt gttacatggc tggattggtt 1560 ggacaagtgc cagctctgat cctgggactg tggcatgtga tgacatacac cccctctcca 1620 cattctgcat gtctctaggg gggaaggggg aagctcggta tagaaccttt attgtatttt 1680 ctgattgcct cacttcttat attgccccca tgcccttctt tgttcctcaa gtaaccagag 1740 acagtgcttc ccagaaccaa ccctacaaga aacaaagggc taaacaaagc caaatgggaa 1800 gcaggatcat ggtttgaact ctttctggcc agagaacaat acctgctatg gactagatac 1860 tgggagaggg aaaggaaaag tagggtgaat tatggaagga agctggcagg ctcagcgttt 1920 ctgtcttggc atgaccagtc tctcttcatt ctcttcctag atgtagggct tggtaccaga 1980 gcccctgagg ctttctgcat gaatataaat atatgaaact gagtgatgct tccatttcag 2040 gttcttgggg gcgccgaatt cgagctcggt acccggggat ctcgacggat ccgattactt 2100 actggcaggt gctgggggct tccgagacaa tcgcgaacat ctacaccaca caacaccgcc 2160 tcgaccaggg tgagatatcg gccggggacg cggcggtggt aattacaagc gagatccgat 2220 tacttactgg caggtgctgg gggcttccga gacaatcgcg aacatctaca ccacacaaca 2280 ccgcctcgac cagggtgaga tatcggccgg ggacgcggcg gtggtaatta caagcgagat 2340 ctcgagttaa cagatctagg cctcctaggt cgacggatcc ccgggaattc ggcgccgcca 2400 ccatgatgtc ctttgtctct ctgctcctgg taggcatcct attccatgcc acccaggccc 2460 aggtccaact gcagcagtct gggcctgagc tggtgaagcc tgggacttca gtgaggatat 2520 cctgcaaggc ttctggctac accttcacaa gctactattt acactgggtg aagcagaggc 2580 ctggacaggg acttgagtgg attgcatgga tttatcctgg aaatgttatt actacgtaca 2640 atgagaagtt caagggcaag gccacactga ctgcagacaa atcctccagc acagcctaca 2700 tgcacctcaa cagcctgacc tctgaggact ctgcggtcta tttctgtgca aggggtgacc 2760 atgatcttga ctactggggc caaggcacca ctctcacagt ctcctcagcc aaaacgacac 2820 ccccatctgt ctatccactg gcccctggat ctgctgccca aactaactcc atggtgaccc 2880 tgggatgcct ggtcaagggc tatttccctg agccagtgac agtgacctgg aactctggat 2940 ccctgtccag cggtgtgcac accttcccag ctgtcctgca gtctgacctc tacactctga 3000 gcagctcagt gactgtcccc tccagcacct ggcccagcga gaccgtcacc tgcaacgttg 3060 cccacccggc cagcagcacc aaggtggaca agaaaattgt gcccagggat tgtactagtg 3120 gaggtggagg tagctaaggg agatctcgac ggatccccgg gaattcgccc ctctccctcc 3180 ccccccccta acgttactgg ccgaagccgc ttggaataag gccggtgtgc gtttgtctat 3240 atgttatttt ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct 3300 gtcttcttga cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg 3360 ttgaatgtcg tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta 3420 gcgacccttt gcaggcagcg gaacccccca cctggcgaca ggtgcctctg cggccaaaag 3480 ccacgtgtat aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg 3540 atagttgtgg aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat 3600 gcccagaagg taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca 3660 tgtgtttagt cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc 3720 ctttgaaaaa cacgatgata atatggcctc ctttgtctct ctgctcctgg taggcatcct 3780 attccatgcc acccaggccg acattgtgct gacacaatct ccagcaatca tgtctgcatc 3840 tccaggggag aaggtcacca tgacctgcag tgccacctca agtgtaagtt acatacactg 3900 gtaccagcag aagtcaggca cctcccccaa aagatggatt tatgacacat ccaaactggc 3960 ttctggagtc cctgctcgct tcagtggcag tgggtctggg acctctcact ctctcacact 4020 cagcagcatg gaggctgaag atgctgccac ttattactgc cagcagtggg gtagttacct 4080 cacgttcggt gcggggacca agctggagct gaaacgggct gatgctgcac caactgtatc 4140 catcttccca ccatccagtg agcagttaac atctggaggt gcctcagtcg tgtgcttctt 4200 gaacaacttc taccccaaag acatcaatgt caagtggaag attgatggca gtgaacgaca 4260 aaatggcgtc ctgaacagtt ggactgatca ggacagcaaa gacagcacct acagcatgag 4320 cagcaccctc acgttgacca aggacgagta tgaacgacat aacagctata cctgtgaggc 4380 cactcacaag acatcaactt cacccattgt caagagcttc aacaggaatg agtgttaata 4440 ggggagatct cgacatcgat aatcaacctc tggattacaa aatttgtgaa agattgactg 4500 gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 4560 atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 4620 tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 4680 ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 4740 ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 4800 gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 4860 cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 4920 gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 4980 tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 5040 cctccccgcc tgatcgataa aataaaagat tttatttagt ctccagaaaa aggggggaat 5100 gaaagacccc acctgtaggt ttggcaagct agcttaagta acgccatttt gcaaggcatg 5160 gaaaaataca taactgagaa tagagaagtt cagatcaagg tcaggaacag atggaacagc 5220 tgaatatggg ccaaacagga tatctgtggt aagcagttcc tgccccggct cagggccaag 5280 aacagatgga acagctgaat atgggccaaa caggatatct gtggtaagca gttcctgccc 5340 cggctcaggg ccaagaacag atggtcccca gatgcggtcc agccctcagc agtttctaga 5400 gaaccatcag atgtttccag ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga 5460 actaaccaat cagttcgctt ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata 5520 aaagagccca caacccctca ctcggggcgc cagtcctccg attgactgag tcgcccgggt 5580 acccgtgtat ccaataaacc ctcttgcagt tgcatccgac ttgtggtctc gctgttcctt 5640 gggagggtct cctctgagtg attgactacc cgtcagcggg ggtctttcat t 5691 12 668 DNA Artificial Sequence Synthetic 12 ggaattcgcc cctctccctc ccccccccct aacgttactg gccgaagccg cttggaataa 60 ggccggtgtg cgtttgtcta tatgttattt tccaccatat tgccgtcttt tggcaatgtg 120 agggcccgga aacctggccc tgtcttcttg acgagcattc ctaggggtct ttcccctctc 180 gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct ggaagcttct 240 tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc ggaacccccc acctggcgac 300 aggtgcctct gcggccaaaa gccacgtgta taagatacac ctgcaaaggc ggcacaaccc 360 cagtgccacg ttgtgagttg gatagttgtg gaaagagtca aatggctctc ctcaagcgta 420 ttcaacaagg ggctgaagga tgcccagaag gtaccccatt gtatgggatc tgatctgggg 480 cctcggtgca catgctttac atgtgtttag tcgaggttaa aaaaacgtct aggccccccg 540 aaccacgggg acgtggtttt cctttgaaaa acacgatgat aatatggcct tgctcatcct 600 tacctgtctt gtggctgttg ctcttgccgg cgccatggga tatctagatc tcgagctcgc 660 gaaagctt 668 13 6255 DNA Artificial Sequence Synthetic 13 tttgaaagac cccacccgta ggtggcaagc tagcttaagt aacgccactt tgcaaggcat 60 ggaaaaatac ataactgaga atagaaaagt tcagatcaag gtcaggaaca aagaaacagc 120 tgaataccaa acaggatatc tgtggtaagc ggttcctgcc ccggctcagg gccaagaaca 180 gatgagacag ctgagtgatg ggccaaacag gatatctgtg gtaagcagtt cctgccccgg 240 ctcggggcca agaacagatg gtccccagat gcggtccagc cctcagcagt ttctagtgaa 300 tcatcagatg tttccagggt gccccaagga cctgaaaatg accctgtacc ttatttgaac 360 taaccaatca gttcgcttct cgcttctgtt cgcgcgcttc cgctctccga gctcaataaa 420 agagcccaca acccctcact cggcgcgcca gtcttccgat agactgcgtc gcccgggtac 480 ccgtattccc aataaagcct cttgctgttt gcatccgaat cgtggtctcg ctgttccttg 540 ggagggtctc ctctgagtga ttgactaccc acgacggggg tctttcattt gggggctcgt 600 ccgggatttg gagacccctg cccagggacc accgacccac caccgggagg taagctggcc 660 agcaacttat ctgtgtctgt ccgattgtct agtgtctatg tttgatgtta tgcgcctgcg 720 tctgtactag ttagctaact agctctgtat ctggcggacc cgtggtggaa ctgacgagtt 780 ctgaacaccc ggccgcaacc ctgggagacg tcccagggac tttgggggcc gtttttgtgg 840 cccgacctga ggaagggagt cgatgtggaa tccgaccccg tcaggatatg tggttctggt 900 aggagacgag aacctaaaac agttcccgcc tccgtctgaa tttttgcttt cggtttggaa 960 ccgaagccgc gcgtcttgtc tgctgcagcg ctgcagcatc gttctgtgtt gtctctgtct 1020 gactgtgttt ctgtatttgt ctgaaaatta gggccagact gttaccactc ccttaagttt 1080 gaccttaggt cactggaaag atgtcgagcg gatcgctcac aaccagtcgg tagatgtcaa 1140 gaagagacgt tgggttacct tctgctctgc agaatggcca acctttaacg tcggatggcc 1200 gcgagacggc acctttaacc gagacctcat cacccaggtt aagatcaagg tcttttcacc 1260 tggcccgcat ggacacccag accaggtccc ctacatcgtg acctgggaag ccttggcttt 1320 tgacccccct ccctgggtca agccctttgt acaccctaag cctccgcctc ctcttcctcc 1380 atccgccccg tctctccccc ttgaacctcc tcgttcgacc ccgcctcgat cctcccttta 1440 tccagccctc actccttctc taggcgccgg aattccgatc tgatcaagag acaggatgag 1500 gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 1560 agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 1620 tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 1680 tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 1740 gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 1800 tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 1860 ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 1920 cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 1980 atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 2040 gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 2100 tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 2160 gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 2220 ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 2280 atcgccttct tgacgagttc ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc 2340 gacgcccaac ctgccatcac gagatttcga ttccaccgcc gccttctatg aaaggttggg 2400 cttcggaatc gttttccggg acgccggctg gatgatcctc cagcgcgggg atctcatgct 2460 ggagttcttc gcccaccccg ggctcgatcc cctcgcgagt tggttcagct gctgcctgag 2520 gctggacgac ctcgcggagt tctaccggca gtgcaaatcc gtcggcatcc aggaaaccag 2580 cagcggctat ccgcgcatcc atgcccccga actgcaggag tggggaggca cgatggccgc 2640 tttggtcgag gcggatccgg ccattagcca tattattcat tggttatata gcataaatca 2700 atattggcta ttggccattg catacgttgt atccatatca taatatgtac atttatattg 2760 gctcatgtcc aacattaccg ccatgttgac attgattatt gactagttat taatagtaat 2820 caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg 2880 taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt 2940 atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac 3000 ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg 3060 acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact 3120 ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt 3180 ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc 3240 ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc 3300 gtaacaactc cgccccattg acgcaaatgg gcggtaggca tgtacggtgg gaggtctata 3360 taagcagagc tcgtttagtg aaccgtcaga tcgcctggag acgccatcca cgctgttttg 3420 acctccatag aagacaccgg gaccgatcca gcctccgcgg ccccaagctt ctcgacggat 3480 ccccgggaat tcaggccatc gatcccgccg ccaccatgga atggagctgg gtctttctct 3540 tcttcctgtc agtaactaca ggtgtccact ccgacatcca gatgacccag tctccagcct 3600 ccctatctgc atctgtggga gaaactgtca ctatcacatg tcgagcaagt gggaatattc 3660 acaattattt agcatggtat cagcagaaac agggaaaatc tcctcagctc ctggtctata 3720 atgcaaaaac cttagcagat ggtgtgccat caaggttcag tggcagtgga tcaggaacac 3780 aatattctct caagatcaac agcctgcagc ctgaagattt tgggagttat tactgtcaac 3840 atttttggag tactccgtgg acgttcggtg gaggcaccaa gctggaaatc aaacgggctg 3900 atgctgcacc aactgtatcc atcttcccac catccagtga gcagttaaca tctggaggtg 3960 cctcagtcgt gtgcttcttg aacaacttct accccaaaga catcaatgtc aagtggaaga 4020 ttgatggcag tgaacgacaa aatggcgtcc tgaacagttg gactgatcag gacagcaaag 4080 acagcaccta cagcatgagc agcaccctca cattgaccaa ggacgagtat gaacgacata 4140 acagctatac ctgtgaggcc actcacaaga catcaacttc acccattgtc aagagcttca 4200 acaggaatga gtgttgaaag catcgatttc ccctgaattc gcccctctcc ctcccccccc 4260 cctaacgtta ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt ctatatgtta 4320 ttttccacca tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc 4380 ttgacgagca ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat 4440 gtcgtgaagg aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc 4500 ctttgcaggc agcggaaccc cccacctggc gacaggtgcc tctgcggcca aaagccacgt 4560 gtataagata cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt 4620 gtggaaagag tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag 4680 aaggtacccc attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt 4740 tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga 4800 aaaacacgat gataatatgg cctcctttgt ctctctgctc ctggtaggca tcctattcca 4860 tgccacccag gccgaggttc agcttcagca gtctggggca gagcttgtga agccaggggc 4920 ctcagtcaag ttgtcctgca cagcttctgg cttcaacatt aaagacacct ttatgcactg 4980 ggtgaagcag aggcctgaac agggcctgga gtggattgga aggattgatc ctgcgaatgg 5040 gaatactgaa tatgacccga agttccaggg caaggccact ataacagcag acacatcctc 5100 caacacagtc aacctgcagc tcagcagcct gacatctgag gacactgccg tctattactg 5160 tgctagtgga ggggaactgg ggtttcctta ctggggccaa gggactctgg tcactgtctc 5220 tgcagccaaa acgacacccc catctgtcta tccactggcc cctggatctg ctgcccaaac 5280 taactccatg gtgaccctgg gatgcctggt caagggctat ttccctgagc cagtgacagt 5340 gacctggaac tctggatccc tgtccagcgg tgtgcacacc ttcccagctg tcctgcagtc 5400 tgacctctac actctgagca gctcagtgac tgtcccctcc agcacctggc ccagcgagac 5460 cgtcacctgc aacgttgccc acccggccag cagcaccaag gtggacaaga aaattgtgcc 5520 cagggattgt actagtggag gtggaggtag ccaccatcac catcaccatt aatctagagt 5580 taagcggccg tcgagatcta ggcctcctag gtcgacatcg ataaaataaa agattttatt 5640 tagtctccag aaaaaggggg gaatgaaaga ccccacctgt aggtttggca agctagctta 5700 agtaacgcca ttttgcaagg catggaaaaa tacataactg agaatagaga agttcagatc 5760 aaggtcagga acagatggaa cagctgaata tgggccaaac aggatatctg tggtaagcag 5820 ttcctgcccc ggctcagggc caagaacaga tggaacagct gaatatgggc caaacaggat 5880 atctgtggta agcagttcct gccccggctc agggccaaga acagatggtc cccagatgcg 5940 gtccagccct cagcagtttc tagagaacca tcagatgttt ccagggtgcc ccaaggacct 6000 gaaatgaccc tgtgccttat ttgaactaac caatcagttc gcttctcgct tctgttcgcg 6060 cgcttctgct ccccgagctc aataaaagag cccacaaccc ctcactcggg gcgccagtcc 6120 tccgattgac tgagtcgccc gggtacccgt gtatccaata aaccctcttg cagttgcatc 6180 cgacttgtgg tctcgctgtt ccttgggagg gtctcctctg agtgattgac tacccgtcag 6240 cgggggtctt tcatt 6255 14 43 DNA Artificial Sequence Synthetic 14 ctttgaaaaa cacgatgata atatggcctc ctttgtctct ctg 43 15 30 DNA Artificial Sequence Synthetic 15 ttcgcgagct cgagatctag atatcccatg 30 16 35 DNA Artificial Sequence Synthetic 16 ctacaggtgt ccacgtcgac atccagctga cccag 35 17 34 DNA Artificial Sequence Synthetic 17 ctgcagaata gatctctaac actctcccct gttg 34 18 51 DNA Artificial Sequence Synthetic 18 cagtgtgatc tcgagaattc aggacctcac catgggatgg agctgtatca t 51 19 23 DNA Artificial Sequence Synthetic 19 aggctgtatt ggtggattcg tct 23 20 41 DNA Artificial Sequence Synthetic 20 agcttctcga gttaacagat ctaggcctcc taggtcgaca t 41 21 39 DNA Artificial Sequence Synthetic 21 cgatgtcgac ctaggaggcc tagatctgtt aactcgaga 39 22 64 DNA Artificial Sequence Synthetic 22 cgaggctctg cacaaccact acacgcagaa gagcctctcc ctgtctcccg ggaaatgaaa 60 gccg 64 23 72 DNA Artificial Sequence Synthetic 23 aattcggctt tcatttcccg ggagacaggg agaggctctt ctgcgtgtag tggttgtgca 60 gagcctcgtg ca 72 24 41 DNA Artificial Sequence Synthetic 24 aaagcatatg ttctgggcct tgttacatgg ctggattggt t 41 25 54 DNA Artificial Sequence Synthetic 25 tgaattcggc gcccccaaga acctgaaatg gaagcatcac tcagtttcat atat 54 26 35 DNA Artificial Sequence Synthetic 26 ctacaggtgt ccacgtcgac atccagctga cccag 35 27 34 DNA Artificial Sequence Synthetic 27 ctgcagaata gatctctaac actctcccct gttg 34 28 51 DNA Artificial Sequence Synthetic 28 cagtgtgatc tcgagaattc aggacctcac catgggatgg agctgtatca t 51 29 22 DNA Artificial Sequence Synthetic 29 gtgtcttcgg gtctcaggct gt 22 30 41 DNA Artificial Sequence Synthetic 30 agcttctcga gttaacagat ctaggcctcc taggtcgaca t 41 31 39 DNA Artificial Sequence Synthetic 31 cgatgtcgac ctaggaggcc tagatctgtt aactcgaga 39 32 64 DNA Artificial Sequence Synthetic 32 cgaggctctg cacaaccact acacgcagaa gagcctctcc ctgtctcccg ggaaatgaaa 60 gccg 64 33 72 DNA Artificial Sequence Synthetic 33 aattcggctt tcatttcccg ggagacaggg agaggctctt ctgcgtgtag tggttgtgca 60 gagcctcgtg ca 72 34 9511 DNA Artificial Sequence Synthetic 34 gaattaattc ataccagatc accgaaaact gtcctccaaa tgtgtccccc tcacactccc 60 aaattcgcgg gcttctgcct cttagaccac tctaccctat tccccacact caccggagcc 120 aaagccgcgg cccttccgtt tctttgcttt tgaaagaccc cacccgtagg tggcaagcta 180 gcttaagtaa cgccactttg caaggcatgg aaaaatacat aactgagaat agaaaagttc 240 agatcaaggt caggaacaaa gaaacagctg aataccaaac aggatatctg tggtaagcgg 300 ttcctgcccc ggctcagggc caagaacaga tgagacagct gagtgatggg ccaaacagga 360 tatctgtggt aagcagttcc tgccccggct cggggccaag aacagatggt ccccagatgc 420 ggtccagccc tcagcagttt ctagtgaatc atcagatgtt tccagggtgc cccaaggacc 480 tgaaaatgac cctgtacctt atttgaacta accaatcagt tcgcttctcg cttctgttcg 540 cgcgcttccg ctctccgagc tcaataaaag agcccacaac ccctcactcg gcgcgccagt 600 cttccgatag actgcgtcgc ccgggtaccc gtattcccaa taaagcctct tgctgtttgc 660 atccgaatcg tggtctcgct gttccttggg agggtctcct ctgagtgatt gactacccac 720 gacgggggtc tttcatttgg gggctcgtcc gggatttgga gacccctgcc cagggaccac 780 cgacccacca ccgggaggta agctggccag caacttatct gtgtctgtcc gattgtctag 840 tgtctatgtt tgatgttatg cgcctgcgtc tgtactagtt agctaactag ctctgtatct 900 ggcggacccg tggtggaact gacgagttct gaacacccgg ccgcaaccct gggagacgtc 960 ccagggactt tgggggccgt ttttgtggcc cgacctgagg aagggagtcg atgtggaatc 1020 cgaccccgtc aggatatgtg gttctggtag gagacgagaa cctaaaacag ttcccgcctc 1080 cgtctgaatt tttgctttcg gtttggaacc gaagccgcgc gtcttgtctg ctgcagcgct 1140 gcagcatcgt tctgtgttgt ctctgtctga ctgtgtttct gtatttgtct gaaaattagg 1200 gccagactgt taccactccc ttaagtttga ccttaggtca ctggaaagat gtcgagcgga 1260 tcgctcacaa ccagtcggta gatgtcaaga agagacgttg ggttaccttc tgctctgcag 1320 aatggccaac ctttaacgtc ggatggccgc gagacggcac ctttaaccga gacctcatca 1380 cccaggttaa gatcaaggtc ttttcacctg gcccgcatgg acacccagac caggtcccct 1440 acatcgtgac ctgggaagcc ttggcttttg acccccctcc ctgggtcaag ccctttgtac 1500 accctaagcc tccgcctcct cttcctccat ccgccccgtc tctccccctt gaacctcctc 1560 gttcgacccc gcctcgatcc tccctttatc cagccctcac tccttctcta ggcgccggaa 1620 ttccgatctg atcaagagac aggatgaggg agcttgtata tccattttcg gatctgatca 1680 gcacgtgttg acaattaatc atcggcatag tatatcggca tagtataata cgacaaggtg 1740 aggaactaaa ccatggccaa gcctttgtct caagaagaat ccaccctcat tgaaagagca 1800 acggctacaa tcaacagcat ccccatctct gaagactaca gcgtcgccag cgcagctctc 1860 tctagcgacg gccgcatctt cactggtgtc aatgtatatc attttactgg gggaccttgt 1920 gcagaactcg tggtgctggg cactgctgct gctgcggcag ctggcaacct gacttgtatc 1980 gtcgcgatcg gaaatgagaa caggggcatc ttgagcccct gcggacggtg tcgacaggtg 2040 cttctcgatc tgcatcctgg gatcaaagcg atagtgaagg acagtgatgg acagccgacg 2100 gcagttggga ttcgtgaatt gctgccctct ggttatgtgt gggagggcta agcacttcgt 2160 ggccgaggag caggactgac acgtgctacg agatttcgat tccaccgccg ccttctatga 2220 aaggttgggc ttcggaatcg ttttccggga cgccggctgg atgatcctcc agcgcgggga 2280 tctcatgctg gagttcttcg cccaccccaa cttgtttatt gcagcttata atggttacaa 2340 ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 2400 tggtttgtcc aaactcatca atgtatctta tcatgtctgt acgagttggt tcagctgctg 2460 cctgaggctg gacgacctcg cggagttcta ccggcagtgc aaatccgtcg gcatccagga 2520 aaccagcagc ggctatccgc gcatccatgc ccccgaactg caggagtggg gaggcacgat 2580 ggccgctttg gtcgaggcgg atccggccat tagccatatt attcattggt tatatagcat 2640 aaatcaatat tggctattgg ccattgcata cgttgtatcc atatcataat atgtacattt 2700 atattggctc atgtccaaca ttaccgccat gttgacattg attattgact agttattaat 2760 agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac 2820 ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa 2880 tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt 2940 atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc 3000 ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat 3060 gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc 3120 ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc 3180 tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa 3240 aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcatgta cggtgggagg 3300 tctatataag cagagctcgt ttagtgaacc gtcagatcgc ctggagacgc catccacgct 3360 gttttgacct ccatagaaga caccgggacc gatccagcct ccgcggcccc aagcttctcg 3420 agttaacaga tctaggctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 3480 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 3540 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 3600 accatgatta cgccaagctt ggctgcaggt cgacggatcc actagtaacg gccgccagtg 3660 tgctggaatt caccatgggg caacccggga acggcagcgc cttcttgctg gcacccaatg 3720 gaagccatgc gccggaccac gacgtcacgc agcaaaggga cgaggtgtgg gtggtgggca 3780 tgggcatcgt catgtctctc atcgtcctgg ccatcgtgtt tggcaatgtg ctggtcatca 3840 cagccattgc caagttcgag cgtctgcaga cggtcaccaa ctacttcatc acaagcttgg 3900 cctgtgctga tctggtcatg gggctagcag tggtgccctt tggggccgcc catattctca 3960 tgaaaatgtg gacttttggc aacttctggt gcgagttctg gacttccatt gatgtgctgt 4020 gcgtcacggc atcgattgag accctgtgcg tgatcgcagt cgaccgctac tttgccatta 4080 ctagtccttt caagtaccag agcctgctga ccaagaataa ggcccgggtg atcattctga 4140 tggtgtggat tgtgtcaggc cttacctcct tcttgcccat tcagatgcac tggtacaggg 4200 ccacccacca ggaagccatc aactgctatg ccaatgagac ctgctgtgac ttcttcacga 4260 accaagccta tgccattgcc tcttccatcg tgtccttcta cgttcccctg gtgatcatgg 4320 tcttcgtcta ctccagggtc tttcaggagg ccaaaaggca gctccagaag attgacaaat 4380 ctgagggccg cttccatgtc cagaacctta gccaggtgga gcaggatggg cggacggggc 4440 atggactccg cagatcttcc aagttctgct tgaaggagca caaagccctc aagacgttag 4500 gcatcatcat gggcactttc accctctgct ggctgccctt cttcatcgtt aacattgtgc 4560 atgtgatcca ggataacctc atccgtaagg aagtttacat cctcctaaat tggataggct 4620 atgtcaattc tggtttcaat ccccttatct actgccggag cccagatttc aggattgcct 4680 tccaggagct tctgtgcctg cgcaggtctt ctttgaaggc ctatggcaat ggctactcca 4740 gcaacggcaa cacaggggag cagagtggat atcacgtgga acaggagaaa gaaaataaac 4800 tgctgtgtga agacctccca ggcacggaag actttgtggg ccatcaaggt actgtgccta 4860 gcgataacat tgattcacaa gggaggaatt gtagtacaaa tgactcactg ctctcgagaa 4920 tcgaggggcg gcaccaccat catcaccacg tcgaccccgg ggactacaag gatgacgatg 4980 acaagtaagc tttatccatc acactggcgg ccgctcgagc atgcatctag cggccgctcg 5040 aggccggcaa ggccggatcc ccgggaattc gcccctctcc ctcccccccc cctaacgtta 5100 ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt ctatatgtta ttttccacca 5160 tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 5220 ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 5280 aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 5340 agcggaaccc cccacctggc gacaggtgcc tctgcggcca aaagccacgt gtataagata 5400 cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 5460 tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 5520 attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 5580 taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 5640 gataatatgg cctcctttgt ctctctgctc ctggtaggca tcctattcca tgccacccag 5700 gccgagctca cccagtctcc agactccctg gctgtgtctc tgggcgagag ggccaccatc 5760 aactgcaagt ccagccagag tgttttgtac agctccaaca ataagaacta tttagcttgg 5820 tatcagcaga aaccaggaca gcctcctaag ctgctcattt actgggcatc tacccgggaa 5880 tccggggtcc ctgaccgatt cagtggcagc gggtctggga cagatttcac tctcaccatc 5940 agcagcctgc aggctgaaga tgtggcagtt tattactgtc agcaatatta tagtactcag 6000 acgttcggcc aagggaccaa ggtggaaatc aaacgaactg tggctgcacc atctgtcttc 6060 atcttcccgc catctgatga gcagttgaaa tctggaactg cctctgttgt gtgcctgctg 6120 aataacttct atcccagaga ggccaaagta cagtggaagg tggataacgc cctccaatcg 6180 ggtaactccc aggagagtgt cacagagcag gacagcaagg acagcaccta cagcctcagc 6240 agcaccctga cgctgagcaa agcagactac gagaaacaca aactctacgc ctgcgaagtc 6300 acccatcagg gcctgagatc gcccgtcaca aagagcttca acaaggggag agtgttagtt 6360 ctagataatt aattaggagg agatctcgag ctcgcgaaag cttggcactg gccgtcgttt 6420 tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 6480 cccctttcgc cagcctccta ggtcgacatc gataaaataa aagattttat ttagtctcca 6540 gaaaaagggg ggaatgaaag accccacctg taggtttggc aagctagctt aagtaacgcc 6600 attttgcaag gcatggaaaa atacataact gagaatagag aagttcagat caaggtcagg 6660 aacagatgga acagctgaat atgggccaaa caggatatct gtggtaagca gttcctgccc 6720 cggctcaggg ccaagaacag atggaacagc tgaatatggg ccaaacagga tatctgtggt 6780 aagcagttcc tgccccggct cagggccaag aacagatggt ccccagatgc ggtccagccc 6840 tcagcagttt ctagagaacc atcagatgtt tccagggtgc cccaaggacc tgaaatgacc 6900 ctgtgcctta tttgaactaa ccaatcagtt cgcttctcgc ttctgttcgc gcgcttctgc 6960 tccccgagct caataaaaga gcccacaacc cctcactcgg ggcgccagtc ctccgattga 7020 ctgagtcgcc cgggtacccg tgtatccaat aaaccctctt gcagttgcat ccgacttgtg 7080 gtctcgctgt tccttgggag ggtctcctct gagtgattga ctacccgtca gcgggggtct 7140 ttcatttggg ggctcgtccg ggatcgggag acccctgccc agggaccacc gacccaccac 7200 cgggaggtaa gctggctgcc tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat 7260 gcagctcccg gagacggtca cagcttgtct gtaagcggat gccgggagca gacaagcccg 7320 tcagggcgcg tcagcgggtg ttggcgggtg tcggggcgca gccatgaccc agtcacgtag 7380 cgatagcgga gtgtatactg gcttaactat gcggcatcag agcagattgt actgagagtg 7440 caccatatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg catcaggcgc 7500 tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta 7560 tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag 7620 aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 7680 tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg 7740 tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg 7800 cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga 7860 agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc 7920 tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt 7980 aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact 8040 ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg 8100 cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt 8160 accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt 8220 ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct 8280 ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg 8340 gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt 8400 aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 8460 gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 8520 gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 8580 cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 8640 gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 8700 gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctgca 8760 ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 8820 tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 8880 ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 8940 cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 9000 accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca 9060 cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 9120 tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 9180 cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 9240 acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 9300 atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 9360 tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 9420 aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg 9480 cgtatcacga ggccctttcg tcttcaagaa t 9511 35 30 DNA Artificial Sequence Synthetic 35 gatccactag taacggccgc cagaattcgc 30 36 43 DNA Artificial Sequence Synthetic 36 cagagagaca aaggaggcca tattatcatc gtgtttttca aag 43

Claims (33)

What is claimed is:
1. A nucleic acid comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO:1 and sequences hybridizable to SEQ ID NO:1 under low stringency conditions, wherein said nucleic acid contains sequences derived from at least two mammalian sources and causes mammary specific gene expression.
2. A vector comprising the nucleic acid sequence of claim 1.
3. The vector of claim 2, wherein said vector is a retroviral vector.
4. A host cell comprising the vector of claim 2.
5. A nucleic acid comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO:2 and sequences hybridizable to SEQ ID NO:2 under low stringency conditions, wherein said hybridizable sequence comprises ATG sequences that have been mutated at at least one of the positions corresponding to nucleic acid residues 4, 112, 131, and 238 of SEQ ID NO:2.
6. A vector comprising the nucleic acid sequence of claim 5.
7. The vector of claim 6, wherein said vector is a retroviral vector.
8. A host cell comprising the vector of claim 6.
9. A retroviral vector comprising at least one Pre-mRNA Processing Enhancer element.
10. The vector of claim 9, wherein said Pre-mRNA Processing Enhancer element is selected from SEQ ID NO:2 and sequences hybridizable to SEQ ID NO:2 under medium stringency conditions.
11. The vector of claim 9, wherein said Pre-mRNA Processing Enhancer element is a WPRE element.
12. A nucleic acid sequence encoding an IRES coding sequence and a signal peptide coding sequence, wherein said IRES and signal peptide coding sequences are adjacent to one another.
13. The nucleic acid of claim 12, wherein said signal peptide is selected from the group consisting of alpha-casein, human growth hormone, and alpha-lactalbumin signal peptides.
14. A vector comprising the nucleic acid sequence of claim 12.
15. The vector of claim 14, wherein said vector is a retroviral vector.
16. A host cell comprising the vector of claim 14.
17. A method for producing a protein of interest comprising:
a) providing
i) a host cell; and
ii) a vector comprising at least one exogenous gene encoding a protein operably linked to a bovine/human hybrid alpha-lactalbumin promoter; and
b) introducing said vector to said host cell under conditions such that expression of said protein encoded by said exogenous gene is expressed.
18. The method of claim 17, wherein said vector further comprises a mutant RNA export element comprising SEQ ID NO:2.
19. The method of claim 17, wherein said vector comprises at least two exogenous genes.
20. The method of claim 19, wherein said at least two exogenous genes are arranged in a polycistronic sequence separated by an internal ribosome entry site/bovine alpha-lactalbumin promoter signal peptide.
21. A method for producing an immunoglobulin comprising:
a) providing
i) a host cell; and
ii) a vector comprising a first exogenous gene and a second exogenous gene, wherein said first exogenous gene encodes a first immunoglobulin chain and wherein said second exogenous gene encodes a second immunoglobulin chain and wherein said first and said second genes are separated by an internal ribosome entry site; and
b) introducing said vector to said host cell under conditions such that said first immunoglobulin chain and said second immunoglobulin chain are expressed.
22. The method of claim 21, wherein one of said first immunoglobulin chain and said second immunoglobulin chain is an immunoglobulin light chain and wherein the other of said first immunoglobulin chain and said second immunoglobulin chain is an immunoglobulin heavy chain.
23. The method of claim 22, wherein said heavy chain is selected from the group consisting of γ, α, μ, δ, or ε heavy chains.
24. The method of claim 22, wherein said light chain is selected from the group consisting of κ and λ light chains.
25. The method of claim 21, wherein said immunoglobulin is a secretory immunoglobulin.
26. The method of claim 21, wherein said vector is a retroviral vector.
27. The method of claim 21, wherein said vector further comprises a bovine alpha-lactalbumin signal peptide.
28. The method of claim 21, wherein said vector further comprises a bovine/human hybrid alpha-lactalbumin promoter.
29. The method of claim 21, wherein said first antibody chain and said second antibody chain are expressed at a ratio of about 0.9:1.1.
30. An antibody produced by the method of claim 21.
31. The method of claim 21, wherein said vector is selected from the group consisting of a retroviral vector and a plasmid vector.
32. The method of claim 21, wherein said vector is a retroviral vector.
33. The method of claim 32, wherein said retroviral vector is a pseudotyped retroviral vector.
US09/897,006 2000-07-03 2001-06-29 Expression vectors Abandoned US20020106729A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU2001270252A AU2001270252B2 (en) 2000-07-03 2001-06-29 Expression vectors
US09/897,006 US20020106729A1 (en) 2000-07-03 2001-06-29 Expression vectors
US10/947,881 US7378273B2 (en) 2000-07-03 2004-09-23 Expression vectors

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US21585100P 2000-07-03 2000-07-03
US09/897,006 US20020106729A1 (en) 2000-07-03 2001-06-29 Expression vectors

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US10/947,881 Continuation US7378273B2 (en) 2000-07-03 2004-09-23 Expression vectors

Publications (1)

Publication Number Publication Date
US20020106729A1 true US20020106729A1 (en) 2002-08-08

Family

ID=22804662

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/897,006 Abandoned US20020106729A1 (en) 2000-07-03 2001-06-29 Expression vectors
US10/947,881 Expired - Lifetime US7378273B2 (en) 2000-07-03 2004-09-23 Expression vectors

Family Applications After (1)

Application Number Title Priority Date Filing Date
US10/947,881 Expired - Lifetime US7378273B2 (en) 2000-07-03 2004-09-23 Expression vectors

Country Status (7)

Country Link
US (2) US20020106729A1 (en)
EP (1) EP1297168A2 (en)
JP (1) JP4762478B2 (en)
KR (1) KR100545945B1 (en)
AU (2) AU2001270252B2 (en)
CA (1) CA2413156C (en)
WO (1) WO2002002783A2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030224408A1 (en) * 2002-03-07 2003-12-04 Dyax Corporation Ligand screening and discovery
US7384738B2 (en) 2002-03-28 2008-06-10 Bremel Robert D Retrovirus-based genomic screening
WO2019079772A1 (en) * 2017-10-20 2019-04-25 Fred Hutchinson Cancer Research Center Systems and methods to produce b cells genetically modified to express selected antibodies
US20200190524A1 (en) * 2014-04-09 2020-06-18 Dna2.0, Inc. Dna vectors, transposons and transposases for eukaryotic genome modification
US11466258B2 (en) * 2013-10-11 2022-10-11 The Schepens Eye Research Institute, Inc. Methods of predicting ancestral virus sequences and uses thereof

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100450266B1 (en) * 2002-01-16 2004-09-30 주식회사 엘지생명과학 Bicistronic vector for gene-expression by animal cell, the cell line thereof, and the preparation method for antibody peptide thereof
US20040038304A1 (en) * 2002-03-28 2004-02-26 Gala Design, Inc. Antibody libraries
WO2005017149A1 (en) * 2003-06-03 2005-02-24 Cell Genesys, Inc. Compositions and methods for enhanced expression of recombinant polypeptides from a single vector using a peptide cleavage site
KR100533794B1 (en) * 2003-10-02 2005-12-07 주식회사 프로젠 Method for mass production of human Follicle Stimulating Hormone
WO2006068794A2 (en) * 2004-11-22 2006-06-29 New York University Genetically engineered clostridial genes, proteins encoded by the engineered genes, and uses thereof
MX2007010008A (en) * 2005-02-16 2008-01-18 Lentigen Corp Lentiviral vectors and their use.
US7632509B2 (en) * 2005-07-19 2009-12-15 Biosante Pharmaceuticals, Inc. Methods to express recombinant proteins from lentiviral vectors
WO2010129602A2 (en) * 2009-05-04 2010-11-11 Fred Hutchinson Cancer Research Center Cocal vesiculovirus envelope pseudotyped retroviral vectors
EP2523971B1 (en) 2010-01-13 2015-02-25 APT Therapeutics, Inc. Therapeutic apyrase constructs, apyrase agents, and production methods
DK2640743T3 (en) 2010-11-16 2017-01-23 Excelimmune Inc PROCEDURES FOR THE PREPARATION OF RECOMBINANT PROTEINS
EP3466969A1 (en) 2011-02-07 2019-04-10 Cerenis Therapeutics Holding SA Lipoprotein complexes and manufacturing and use thereof
US8865881B2 (en) * 2011-02-22 2014-10-21 California Institute Of Technology Delivery of proteins using adeno-associated virus (AAV) vectors
US20120220648A1 (en) * 2011-02-24 2012-08-30 National Taiwan University Hospital Method of Treating Aromatic L-Amino Acid Decarboxylase (AADC) Deficiency Using Adeno-Associated Virus (AAV)-AADC Vector
US9763999B2 (en) 2011-11-09 2017-09-19 Cedars-Sinai Medical Center Transcription factor-based generation of pacemaker cells and methods of using same
WO2014070949A1 (en) 2012-11-01 2014-05-08 California Institute Of Technology Reversible gene expression
DK2948174T3 (en) 2013-01-28 2019-12-02 Univ New York Methods of treatment using atoxic neurotoxin derivatives
CN104419685B (en) * 2013-09-03 2017-01-11 中国检验检疫科学研究院 Human alpha-serum albumin monoclonal antibody as well as preparation method and application thereof
US10344285B2 (en) 2014-04-09 2019-07-09 Dna2.0, Inc. DNA vectors, transposons and transposases for eukaryotic genome modification
EP3670660A1 (en) 2014-04-09 2020-06-24 Dna Twopointo Inc. Enhanced nucleic acid constructs for eukaryotic gene expression
SG11201608395PA (en) 2014-04-10 2016-11-29 Seattle Children S Hospital Dba Seattle Children S Res Inst Defined composition gene modified t-cell products
EP3230457B1 (en) 2014-12-09 2021-06-30 New York University Clostridial neurotoxin fusion proteins, propeptide fusions, their expression, and use
US11458167B2 (en) 2015-08-07 2022-10-04 Seattle Children's Hospital Bispecific CAR T-cells for solid tumor targeting
CA3045665A1 (en) 2016-12-12 2018-06-21 Seattle Children's Hospital (dba Seattle Children's Research Institute) Chimeric transcription factor variants with augmented sensitivity to drug ligand induction of transgene expression in mammalian cells
US10898585B2 (en) 2017-04-14 2021-01-26 Ptc Therapeutics .Inc. Gene therapy for AADC deficiency
CN110202840B (en) * 2019-07-04 2024-02-02 中碳实业(珠海)有限公司 Intelligent air towel wetting machine

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5512421A (en) * 1991-02-19 1996-04-30 The Regents Of The University Of California Generation, concentration and efficient transfer of VSV-G pseudotyped retroviral vectors
US5665567A (en) * 1992-08-27 1997-09-09 Beiersdorf Ag Preparation of heterodimeric PDGF-AB using a bicistronic vector system in mammalian cells
US6479284B1 (en) * 1998-03-13 2002-11-12 Dana-Farber Cancer Institute, Inc. Humanized antibody and uses thereof

Family Cites Families (85)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4560655A (en) 1982-12-16 1985-12-24 Immunex Corporation Serum-free cell culture medium and process for making same
US4657866A (en) 1982-12-21 1987-04-14 Sudhir Kumar Serum-free, synthetic, completely chemically defined tissue culture media
CA1220733A (en) 1982-12-27 1987-04-21 Shigeru Takahashi Method for promoting fusion of plant protoplast
US4713339A (en) 1983-01-19 1987-12-15 Genentech, Inc. Polycistronic expression vector construction
US4816567A (en) 1983-04-08 1989-03-28 Genentech, Inc. Recombinant immunoglobin preparations
US4767704A (en) 1983-10-07 1988-08-30 Columbia University In The City Of New York Protein-free culture medium
US5149645A (en) 1984-06-04 1992-09-22 Rijksuniversiteit Leiden Process for introducing foreign DNA into the genome of plants
US5168062A (en) 1985-01-30 1992-12-01 University Of Iowa Research Foundation Transfer vectors and microorganisms containing human cytomegalovirus immediate-early promoter-regulatory DNA sequence
GB8516415D0 (en) 1985-06-28 1985-07-31 Celltech Ltd Culture of animal cells
US5139941A (en) * 1985-10-31 1992-08-18 University Of Florida Research Foundation, Inc. AAV transduction vectors
US5583024A (en) 1985-12-02 1996-12-10 The Regents Of The University Of California Recombinant expression of Coleoptera luciferase
US4927762A (en) 1986-04-01 1990-05-22 Cell Enterprises, Inc. Cell culture medium with antioxidant
US4946778A (en) 1987-09-21 1990-08-07 Genex Corporation Single polypeptide chain binding molecules
ES2039474T3 (en) 1986-12-05 1993-10-01 Ciba-Geigy Ag IMPROVED PROCEDURE FOR THE TRANSFORMATION OF PLANT PROTOPLASTS.
JP3095752B2 (en) 1987-07-15 2000-10-10 アメリカ合衆国 Second generation monoclonal antibodies with binding specificity for TAG-72 and human carcinoma and methods of using the same
US5892019A (en) 1987-07-15 1999-04-06 The United States Of America, As Represented By The Department Of Health And Human Services Production of a single-gene-encoded immunoglobulin
US4937190A (en) 1987-10-15 1990-06-26 Wisconsin Alumni Research Foundation Translation enhancer
US5591624A (en) 1988-03-21 1997-01-07 Chiron Viagene, Inc. Retroviral packaging cell lines
WO1990003430A1 (en) 1988-09-23 1990-04-05 Cetus Corporation Cell culture medium for enhanced cell growth, culture longevity and product expression
US5993813A (en) 1988-10-19 1999-11-30 The Dow Chemical Company Family of high affinity, modified antibodies for cancer treatment
US5215904A (en) 1989-01-27 1993-06-01 Wisconsin Alumni Research Foundation Method for producing a recombinant mammal in vivo
US5225347A (en) 1989-09-25 1993-07-06 Innovir Laboratories, Inc. Therapeutic ribozyme compositions and expression vectors
AU648523B2 (en) 1989-10-25 1994-04-28 Salk Institute For Biological Studies, The Receptor infection assay
US5364783A (en) 1990-05-14 1994-11-15 Massachusetts Institute Of Technology Retrovirus promoter-trap vectors
WO1992001070A1 (en) 1990-07-09 1992-01-23 The United States Of America, As Represented By The Secretary, U.S. Department Of Commerce High efficiency packaging of mutant adeno-associated virus using amber suppressions
US5817491A (en) 1990-09-21 1998-10-06 The Regents Of The University Of California VSV G pseusdotyped retroviral vectors
US5122469A (en) 1990-10-03 1992-06-16 Genentech, Inc. Method for culturing Chinese hamster ovary cells to improve production of recombinant proteins
US6027722A (en) 1990-10-25 2000-02-22 Nature Technology Corporation Vectors for gene transfer
US5173414A (en) 1990-10-30 1992-12-22 Applied Immune Sciences, Inc. Production of recombinant adeno-associated virus vectors
AU655299B2 (en) 1991-06-21 1994-12-15 Ludwig Institute For Cancer Research Ltd A novel receptor-type tyrosine kinase and use thereof
DE69233628T2 (en) * 1991-07-25 2007-04-26 Biogen Idec Inc., San Diego Recombinant antibodies for human therapy
EP0598029B1 (en) 1991-08-07 2002-02-27 ANDERSON, W., French Retroviral vectors containing internal ribosome entry sites
US6319707B1 (en) 1991-08-12 2001-11-20 Fred Hutchinson Cancer Research Center Board Regents Of The University Of Washington Cap-independent multicistronic retroviral vectors
US6368862B1 (en) 1991-08-12 2002-04-09 Fred Hutchinson Cancer Research Center Polymerase I promoter plasmid and vector constructs
AU663101B2 (en) * 1991-08-13 1995-09-28 Wisconsin Milk Marketing Board DNA sequence encoding bovine alpha-lactalbumin and methods of use
JP3534749B2 (en) 1991-08-20 2004-06-07 アメリカ合衆国 Adenovirus-mediated gene transfer into the gastrointestinal tract
US5994074A (en) 1991-11-18 1999-11-30 Cold Spring Harbor Laboratories Human cdc25 genes, encoded products and uses thereof
US5670113A (en) 1991-12-20 1997-09-23 Sibia Neurosciences, Inc. Automated analysis equipment and assay method for detecting cell surface protein and/or cytoplasmic receptor function using same
GB9211539D0 (en) * 1992-06-01 1992-07-15 Ducost Eng Ltd Control of paint spraying machines and the like
MX9304769A (en) 1992-08-05 1994-05-31 Max Planck Gesellschaft PTP-D SUBFAMILY OF THYROSINE PROTEIN PHOSPHATASES.
US5589375A (en) 1992-10-06 1996-12-31 Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E. V. PTP 1D: a novel protein tyrosine phosphatase
WO1994024870A1 (en) * 1993-01-20 1994-11-10 Biotransplant, Inc. Retroviral vectors capable of expressing multimeric proteins from multiple translational initiation sites
AT401526B (en) 1993-02-10 1996-09-25 Scheirer Winfried REAGENT SOLUTION TO STABILIZE LUMINESCENCE IN LUCIFERASE MEASUREMENT
US5459036A (en) 1993-03-19 1995-10-17 Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. Extracellular signal-regulated kinase, sequences, and methods of production and use
US5585233A (en) 1993-03-23 1996-12-17 Max-Planck Gesellschaft Zur Forderung Der Wissenschaften E.V. PTP-S31: A novel protein tyrosine phosphatase
US6001581A (en) 1993-06-04 1999-12-14 Sibia Neurosciences, Inc. Cation-based bioassay using human metabotropic glutamate receptors
US6051427A (en) 1993-06-11 2000-04-18 Cell Genesys, Inc. Method for production of high titer virus and high efficiency retroviral mediated transduction of mammalian cells
US5834256A (en) 1993-06-11 1998-11-10 Cell Genesys, Inc. Method for production of high titer virus and high efficiency retroviral mediated transduction of mammalian cells
WO1995001095A1 (en) 1993-06-30 1995-01-12 Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College Transformed eukaryotic cells, and transposon-based transformation vectors
US5534423A (en) 1993-10-08 1996-07-09 Regents Of The University Of Michigan Methods of increasing rates of infection by directing motion of vectors
FR2716459B1 (en) 1994-02-22 1996-05-10 Univ Paris Curie Host-vector system usable in gene therapy.
US5677170A (en) 1994-03-02 1997-10-14 The Johns Hopkins University In vitro transposition of artificial transposons
US6013517A (en) 1994-05-09 2000-01-11 Chiron Corporation Crossless retroviral vectors
WO1996004925A1 (en) 1994-08-12 1996-02-22 Immunomedics, Inc. Immunoconjugates and humanized antibodies specific for b-cell lymphoma and leukemia cells
US5874540A (en) * 1994-10-05 1999-02-23 Immunomedics, Inc. CDR-grafted type III anti-CEA humanized mouse monoclonal antibodies
DE4439662C2 (en) 1994-11-07 1997-10-02 Max Planck Gesellschaft Method for determining the activity of phosphotyrosine phosphatases and for identifying effectors thereof
US5843742A (en) 1994-12-16 1998-12-01 Avigen Incorporated Adeno-associated derived vector systems for gene delivery and integration into target cells
US5518911A (en) 1995-01-06 1996-05-21 Onyx Pharmaceuticals, Inc. Human PAK65
US5922601A (en) 1995-01-19 1999-07-13 Biotransplant, Inc. High efficiency gene trap selection of regulated genetic loci
US5686120A (en) * 1995-05-22 1997-11-11 Wisconsin Alumni Research Foundation Pre-mRNA processing enhancer and method for intron-independent gene expression
US5721121A (en) 1995-06-06 1998-02-24 Genentech, Inc. Mammalian cell culture process for producing a tumor necrosis factor receptor immunoglobulin chimeric protein
US6013516A (en) 1995-10-06 2000-01-11 The Salk Institute For Biological Studies Vector and method of use for nucleic acid delivery to non-dividing cells
DK0776895T3 (en) 1995-11-20 1999-06-23 Lilly Co Eli Protein kinase C inhibitor
US5976853A (en) 1996-03-21 1999-11-02 New York University Medical Center Growth factor inducible serine/threonine phosphatase FIN13
US5814507A (en) 1996-05-24 1998-09-29 Genentech, Inc. κ/μ-like protein tyrosine phosphatase, PTP λ
US5965443A (en) 1996-09-09 1999-10-12 Wisconsin Alumni Research Foundation System for in vitro transposition
US6025192A (en) 1996-09-20 2000-02-15 Cold Spring Harbor Laboratory Modified retroviral vectors
US6255071B1 (en) 1996-09-20 2001-07-03 Cold Spring Harbor Laboratory Mammalian viral vectors and their uses
US5976796A (en) 1996-10-04 1999-11-02 Loma Linda University Construction and expression of renilla luciferase and green fluorescent protein fusion genes
US6004791A (en) 1996-11-13 1999-12-21 Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. Protein tyrosine phosphatase PTP20 and related products and methods
DE69739351D1 (en) 1996-12-16 2009-05-20 Eisai R&D Man Co Ltd DNA construct comprising a drug resistance gene and a foreign gene
US5968800A (en) 1997-02-07 1999-10-19 Merck & Co., Inc. Cyclin-dependent protein kinase
US6080912A (en) 1997-03-20 2000-06-27 Wisconsin Alumni Research Foundation Methods for creating transgenic animals
US5876946A (en) 1997-06-03 1999-03-02 Pharmacopeia, Inc. High-throughput assay
US6074859A (en) 1997-07-08 2000-06-13 Kikkoman Corporation Mutant-type bioluminescent protein, and process for producing the mutant-type bioluminescent protein
WO1999005295A1 (en) 1997-07-25 1999-02-04 Thomas Jefferson University Composition and method for targeted integration into cells
KR100228493B1 (en) 1997-08-30 1999-11-01 윤종용 Method for controlling transmission power in asymmetric digital subscriber line system
US6136597A (en) 1997-09-18 2000-10-24 The Salk Institute For Biological Studies RNA export element
US5994136A (en) 1997-12-12 1999-11-30 Cell Genesys, Inc. Method and means for producing high titer, safe, recombinant lentivirus vectors
US6034228A (en) 1998-12-15 2000-03-07 Zeneca Limited Human signal transduction serine/threonine kinase
AU753907B2 (en) 1998-02-24 2002-10-31 Her Majesty In Right Of Canada As Represented By The Minister Of Agriculture And Agri-Food Canada Trans-somatics with gene transfer into mammary epithelial cells
ATE318922T1 (en) 1998-07-01 2006-03-15 Inst Virologie Teilrechtsfaehi TARGETED INTEGRATION IN CHROMOSOMES USING RETROVIRAL VECTORS
US6013455A (en) 1998-10-15 2000-01-11 Incyte Pharmaceuticals, Inc. Protein kinase homologs
GB9825524D0 (en) 1998-11-20 1999-01-13 Oxford Biomedica Ltd Vector
JP2002538830A (en) 1999-03-15 2002-11-19 カイロン コーポレイション Retrovirus-producing cells that utilize high multiplicity of transduction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5512421A (en) * 1991-02-19 1996-04-30 The Regents Of The University Of California Generation, concentration and efficient transfer of VSV-G pseudotyped retroviral vectors
US5665567A (en) * 1992-08-27 1997-09-09 Beiersdorf Ag Preparation of heterodimeric PDGF-AB using a bicistronic vector system in mammalian cells
US6479284B1 (en) * 1998-03-13 2002-11-12 Dana-Farber Cancer Institute, Inc. Humanized antibody and uses thereof

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030224408A1 (en) * 2002-03-07 2003-12-04 Dyax Corporation Ligand screening and discovery
US7244592B2 (en) * 2002-03-07 2007-07-17 Dyax Corp. Ligand screening and discovery
US20090105083A1 (en) * 2002-03-07 2009-04-23 Dyax Corp., A Delaware Corporation Ligand Screening and Discovery
US8445410B2 (en) 2002-03-07 2013-05-21 Dyax Corp. Ligand screening and discovery
US8846579B2 (en) 2002-03-07 2014-09-30 Dyax Corp. Ligand screening and discovery
US9562902B2 (en) 2002-03-07 2017-02-07 Dyax Corp. Ligand screening and discovery
US7384738B2 (en) 2002-03-28 2008-06-10 Bremel Robert D Retrovirus-based genomic screening
US11466258B2 (en) * 2013-10-11 2022-10-11 The Schepens Eye Research Institute, Inc. Methods of predicting ancestral virus sequences and uses thereof
US20200190524A1 (en) * 2014-04-09 2020-06-18 Dna2.0, Inc. Dna vectors, transposons and transposases for eukaryotic genome modification
WO2019079772A1 (en) * 2017-10-20 2019-04-25 Fred Hutchinson Cancer Research Center Systems and methods to produce b cells genetically modified to express selected antibodies
CN111479916A (en) * 2017-10-20 2020-07-31 弗莱德哈钦森癌症研究中心 Systems and methods for generating B cells genetically modified to express selected antibodies
US11578118B2 (en) 2017-10-20 2023-02-14 Fred Hutchinson Cancer Center Systems and methods to produce B cells genetically modified to express selected antibodies

Also Published As

Publication number Publication date
AU7025201A (en) 2002-01-14
EP1297168A2 (en) 2003-04-02
CA2413156C (en) 2009-08-18
AU2001270252B2 (en) 2007-02-08
US7378273B2 (en) 2008-05-27
CA2413156A1 (en) 2002-01-10
WO2002002783A3 (en) 2003-01-16
JP4762478B2 (en) 2011-08-31
WO2002002783A2 (en) 2002-01-10
KR20030074584A (en) 2003-09-19
KR100545945B1 (en) 2006-01-25
US20050060762A1 (en) 2005-03-17
JP2004524807A (en) 2004-08-19

Similar Documents

Publication Publication Date Title
CA2413156C (en) Expression vectors
US20030092882A1 (en) Host cells containing multiple integrating vectors
US20040235173A1 (en) Production of host cells containing multiple integrating vectors by serial transduction
CA2413151C (en) Host cells containing multiple integrating vectors
US5770428A (en) Chimeric retrovial expression vectors and particles containing a simple retroviral long terminal repeat, BLV or HIV coding regions and cis-acting regulatory sequences, and an RNA translational enhancer with internal ribsome entry site
US6808922B1 (en) Retroviral vectors comprising a functional splice donor site and a functional splice acceptor site
AU2001270252A1 (en) Expression vectors
CA2066053C (en) Recombinant retroviruses delivering vector constructs to target cells
CN106520832B (en) Bicistronic expression vector, expression system, preparation method and application
JP2014110789A (en) Bovine immunodeficiency virus-based vector
US20060160220A1 (en) Retroviral vectors with introns
EP1572912B1 (en) Selection free growth of host cells containing multiple integrating vectors
CN107058387A (en) A kind of two-cistron expression vector of suitable HEK293 cells and preparation method thereof, expression system, application
US20050221429A1 (en) Host cells containing multiple integrating vectors comprising an amplifiable marker
US20040033604A1 (en) Recombinant lentiviral vectors pseudotyped in envelopes containing filovirus binding domains
WO2001083730A2 (en) Recombinant lentiviral vectors pseudotyped in envelopes containing filovirus binding domains
KR20230058081A (en) A nucleic acid encoding a polypeptide comprising a modified FC region of human IgG1 and at least one heterologous antigen.
AU2007214314A1 (en) Host cells containing multiple integrating vectors

Legal Events

Date Code Title Description
AS Assignment

Owner name: GALA DESIGN, LLC, WISCONSIN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BLECK, GREGORY T.;REEL/FRAME:012556/0952

Effective date: 20011016

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION