[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2001060854A1 - Novel members of the h+/oligopeptide transporter gene family - Google Patents

Novel members of the h+/oligopeptide transporter gene family Download PDF

Info

Publication number
WO2001060854A1
WO2001060854A1 PCT/US2001/004799 US0104799W WO0160854A1 WO 2001060854 A1 WO2001060854 A1 WO 2001060854A1 US 0104799 W US0104799 W US 0104799W WO 0160854 A1 WO0160854 A1 WO 0160854A1
Authority
WO
WIPO (PCT)
Prior art keywords
hphtl
nucleic acid
hpht2
protein
cell
Prior art date
Application number
PCT/US2001/004799
Other languages
French (fr)
Other versions
WO2001060854A8 (en
Inventor
Wolfgang Sadee
Christopher W. Botka
Original Assignee
The Regents Of The University Of California
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Regents Of The University Of California filed Critical The Regents Of The University Of California
Priority to AU2001238290A priority Critical patent/AU2001238290A1/en
Publication of WO2001060854A1 publication Critical patent/WO2001060854A1/en
Publication of WO2001060854A8 publication Critical patent/WO2001060854A8/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C13/00Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00
    • G11C13/0002Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00 using resistive RAM [RRAM] elements
    • G11C13/0009RRAM elements whose operation depends upon chemical change
    • G11C13/0014RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material
    • G11C13/0019RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material comprising bio-molecules
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B82NANOTECHNOLOGY
    • B82YSPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
    • B82Y10/00Nanotechnology for information processing, storage or transmission, e.g. quantum computing or single electron logic
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C13/00Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00
    • G11C13/0002Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00 using resistive RAM [RRAM] elements
    • G11C13/0009RRAM elements whose operation depends upon chemical change
    • G11C13/0014RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material

Definitions

  • This invention relates to the field of oligopeptide transporters and drug transport.
  • this invention relates to the discovery of new H+/oligopeptide transporters and their use in drug delivery applications.
  • peptides are transported in and out of cells by several different transport carriers. Functionally, there are transporters responsible for the influx of peptides into the cell and transporters responsible for the efflux of peptides out of the cells. Influx transporters transport small peptides and related compounds into the cytoplasm, and are indirectly linked to an energy source through ion gradients. Efflux transporters consist of several different transporters that function to remove peptides from the cytoplasm. These include the P-glycoprotein that removes a number of oncolytics as well as hydrophobic peptides (Endicott and Ling (1989) Annu. Rev. Biochem. 58:137-171; Sharma et al. (1992) J. Biol. Chem. 267: 5731-5734).
  • the present invention relates to peptide transporters responsible for influx of peptides into cells or organelles.
  • Peptide transporters are often located in the gastrointestinal tract, kidney, placenta, and liver lysosomes (Ganapathy et al. (1991) Indian J. Biochem. Biophys. 28: 317-323; Skopicki et al. (1991) Am. J. Physiol. 261: F670-F678; Ganapathy et al. (1981) J. Biol. Chem. 256: 118-124; Bird and Lloyd (1990) Biochim. Biophys. Acta 1024:267-270).
  • the main intestinal H+/dipeptide transpoter protein, PepTl is thought to play a critical role in oral bioavailablity of peptide-like drugs (Dantzig and Bergin (1990) Biochim. Biophy. Acta, 1027: 211-217; Matsumoto et al. (1994) J. Pharmacol. Exp. Ther., 270: 498-504; Wenzel et al. (1996) J. Pharmacol. Exp. Ther., 211 ⁇ 831-839; Kramer et al. (1990) Biochim. Biophys. Acta, 1027: 25-30; Fei et al. (1994) Nature, 68: 563-566; Thwaites et al. (1993) J. Biol.
  • hPepTl is a member of a well defined small gene family, the proton-dependent oligopeptide transporters (POT, also referred to as PTR), with ancestral roots that can be traced to bacterial, fungal, and plant peptide transporters (Graul and Sadee (1997) Pharm. Res., 14: 388-400; Fei and Leibach (1998) Prog. Nucleic Acid Res. Mol. Biol, 58: 239-261; Steiner et al. (1995) Mol. Microbiol, 16: 825-834).
  • POT proton-dependent oligopeptide transporters
  • This class of secondary active transporters has broad selectivity for di- and tripeptides, whereas ability to transport longer peptides decreases with increasing length.
  • Substrates include important drug classes such as ⁇ - lactam and cephalosporin antibiotics, rennin inhibitors, ACE inhibitors, and 5' nucleoside esters of amino acids, such as valcyclovir (Han et al. (1998) Pharm. Res., 15: 1154-1159). Despite these advances, it is desirable to identify other peptide transporters to improve uptake of various drugs or prodrugs, to improve tissue specificity for particular drugs, and the like.
  • This invention provides new human proton/oligopeptide transporter (POT) genes and uses thereof. Nucleic acid sequences, amino acid sequences, and primers sufficient to amplify transporter nucleic acid and/or probes specific to these nucleic acids and/or splice variants thereof are provided herein. The new transporters are identified as hPHTl and hPHT2. . Thus, in one embodiment, this invention provides an isolated nucleic acid encoding a proton -coupled peptide transporter.
  • the nucleic acid comprises a nucleic acid or the complement of a nucleic acid selected from the group consisting of: a nucleic acid that specifically hybridizes to hPHTl (e.g.
  • SEQ ID NO: 24 or hPHT2 under stringent conditions and that encodes a proton-coupled peptide transporter; a nucleic acid that has 90% or greater sequence identity with hPHTl or hPHT2 and that encodes a proton-coupled peptide transporter; a nucleic acid that encodes an hPHTl peptide transporter protein or an hPHT2 peptide transporter protein; an hPHTl or hPHT2 splice variant; a nucleic acid that is amplified using primers of SEQ ID NO:l and SEQ ID NO:2, and human intestinal DNA as a template; a nucleic acid that is amplified using primers of SEQ ID NO:3 and SEQ ID NO:4, and human intestinal DNA as a template; a nucleic acid that hybridizes under stringent conditions to a nucleic acid amplified using primers of SEQ ID NO: 1 and SEQ ID NO:2, and human intestinal DNA as a template, where the
  • polypeptides encoded by any of these nucleic acids include, but are not limited to, polypeptides comprising a proton-coupled peptide transporter. Also provided are fragments of such polypeptides that comprise one or more epitopes specifically recognized by an antibody that specifically binds to the hPHTl and/or hPHT2 transporters. Also provided are antibodies (complete, fragments, or single chain) that specifically bind the hPHTl and/or hPHT2 transporters.
  • This invention also provides cells that are transfected with one or more of the nucleic acids described herein.
  • Particularly preferred cells express a heterologous peptide transporter (e.g. an hPHTl and/or hPHT2 transporter).
  • the cells preferably include any vertebrate cell and include somatic cells or oocytes.
  • the cells may be transfected with either a DNA or an RNA and, in this context, transfection includes essentially any method of introducing a nucleic acid into a cell (e.g. electroporation, microinjection, lipid complex, etc.).
  • this invention provides a computer readable medium having recorded thereon one or more of the nucleotide sequences described herein.
  • Particularly preferred media also include an identification of the sequences as transporters or as encoding transporters, or as components of transporters or as components of nucleic acids encoding transporters, or an association to a reference or medium identifying the sequences as encoding transporters.
  • Virtually any computer-readable medium is suitable including, but not limited to a floppy disc, a hard disc, a CD disc, a DVD disc, a random access memory (RAM), a read-only memory (ROM), and a flash memory.
  • the medium can be a component of a nucleic acid and/or peptide synthesizer or compatible with (e.g. able to provide sequence information to) a nucleic acid and/or a peptide synthesizer.
  • This invention also provides assays for identifying a compound whose cellular uptake is mediated by an hPHTl or hPHT2 peptide transporter.
  • the assays preferably involve i)contacting a cell expressing a peptide transporter selected from the group consisting of an hPHTl transporter, and an hPHT2 transporter with a test compound; and ii) detecting uptake of the test compound by the cell where elevated uptake of the compound by the cell as compared to a cell expressing the peptide transporter at a lower level indicates that said peptide transporter mediates transport of said test compound.
  • the cell can be a cell expressing an endogenous hPHTl and/or hPHT2 and/or a cell transfected with a vector that encodes the hPHTl or hPHT2 peptide transporter.
  • the cell is preferably a vertebrate somatic cell or a vertebrate oocyte.
  • amphibian oocytes e.g., Xenopus oocytes
  • mammalian somatic cells e.g. heart cells, intestinal cells, etc.
  • the compounds screened may include virtually any compound, however, in preferred embodiments, the compound is a small organic molecule, more preferably a drug or a prodrug.
  • the peptide transporter is hPHTl and/or hPHT2 and the tissue is heart.
  • This invention additionally provides methods of identifying agent(s) that modulate expression or activity of an hPHTl and/or hPHT2 peptide transporter.
  • the methods involve contacting a cell comprising a gene encoding an hPHTl and/or an hPHT2 peptide transporter with a test agent; and detecting the expression level or activity of hPHTl or hPHT2 peptide transporter(s) where a difference in expression level or activity of hPHTl or hPHT2 as compared to the expression level, or activity, of hPHTl or hPHT2 in a cell contacted with a different amount of said agent indicates that said agent modulates expression, or activity, of the hPHTl peptide transporter or the hPHT2 peptide transporter.
  • the detecting comprises detecting an hPHTl or hPHT2 nucleic acid (e.g. DNA, mRNA, cDNA, etc.), and/or an hPHTl or hPHT2 protein, and/or transport activity of an hPHTl or hPHT2 protein.
  • detecting comprises detecting an hPHTl mRNA or an hPHT2 mRNA (e.g. by hybridizing said mRNA to a probe that specifically hybridizes to an hPHT2 or to an hPHTl nucleic acid).
  • hybridization detection methods include, but are not limited to a Northern blot, a Southern blot using DNA derived from the hPHTl or hPHT2 RNA, an array hybridization, an affinity chromatography, and an in situ hybridization.
  • the hybridization probe can be a single probe or a plurality of probes, e.g. a member of a plurality of probes that forms an array of probes.
  • the level of hPHTl mRNA or hPHT2 RNA is measured using a nucleic acid amplification reaction.
  • detecting comprises detecting an hPHTl protein or an hPHT2 protein (e.g.
  • the cell contacted with the different amount of said agent is a negative control that is not contacted with said agent or the cell contacted with said different amount of the agent is a positive control that is contacted with a greater amount of the agent.
  • the cell is preferably a human somatic cell (e.g. a human heart cell, a human intestinal cell, etc.).
  • this invention provides a method of prescreening for an agent that agent that modulates expression or activity of an hPHTl peptide transporter or an hPHT2 peptide transporter.
  • the method involves contacting an hPHTl or hPHT2 nucleic acid (or fragment thereof) or an hPHTl or hPHT2 protein (or fragment thereof) with a test agent; and detecting specific binding of said test agent to said hPHTl or hPHT2 protein or nucleic acid.
  • the method can further involve recording test agents that specifically bind to said hPHTl or hPHT2 nucleic acid or protein in a database of candidate agents that alter peptide transporter activity.
  • the test agent is not an antibody and/or not a protein, and/or not a nucleic acid.
  • Preferred test agents are small organic smolecules.
  • kits comprising a container containing one or more of the nucleic acids and/or proteins and/or cells, and/or antibodies described herein.
  • the kits optionally further comprise instructional materials providing protocols for the assays described herein.
  • isolated refers to material which is substantially or essentially free from components which normally accompany it as found in its native state.
  • an isolated nucleic acid is typically free of the nucleic acid sequences by which it is flanked in nature.
  • An isolated nucleic acid can be reintroduced into a cell and such "heterologous” nucleic acids are regarded herein as isolated.
  • nucleic acids synthesized de novo or produced by cloning are also regarded as “isolated”.
  • polypeptide polypeptide
  • peptide protein
  • protein protein
  • amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
  • amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
  • the term also includes variants on the traditional peptide linkage joining the amino acids making up the polypeptide.
  • nucleic acid or “oligonucleotide” or grammatical equivalents herein refer to at least two nucleotides covalently linked together.
  • a nucleic acid of the present invention is preferably single-stranded or double stranded and will generally contain phosphodiester bonds, although in some cases, as outlined below, nucleic acid analogs are included that may have alternate backbones, comprising, for example, phosphoramide (Beaucage et al. (1993) Tetrahedron 49(10): 1925) and references therein; Letsinger (1970) J. Org. Chem. 35:3800; Sblul et al. (1977) Eur. J. Biochem.
  • Nucleic acids containing one or more carbocyclic sugars are also included within the definition of nucleic acids (see Jenkins et al. (1995), Chem. Soc. Rev. ppl69-176).
  • nucleic acid analogs are described in Rawls, C & E News June 2, 1997 page 35. These modifications of the ribose-phosphate backbone may be done to facilitate the addition of additional moieties such as labels, or to increase the stability and half-life of such molecules in physiological environments.
  • heterologous as it relates to nucleic acid sequences such as coding sequences and control sequences, denotes sequences that are not normally associated with a region of a recombinant construct, and/or are not normally associated with a particular cell.
  • a heterologous region of a nucleic acid construct is an identifiable segment of nucleic acid within or attached to another nucleic acid molecule that is not found in association with the other molecule in nature.
  • a heterologous region of a construct could include a coding sequence flanked by sequences not found in association with the coding sequence in nature.
  • heterologous coding sequence is a construct where the coding sequence itself is not found in nature (e.g., synthetic sequences having codons different from the native gene).
  • a host cell transformed with a construct which is not normally present in the host cell would be considered heterologous for purposes of this invention.
  • non-naturally occurring in reference to a cell, refers to a cell that has a non-naturally occurring nucleic acid or a non-naturally occurring peptide or is fused to a cell to which it is not fused with in nature.
  • non-naturally occurring nucleic acid refers to a portion of genomic nucleic acid, cDNA, semi-synthetic nucleic acid, or a synthetic origin nucleic acid which, by virtue of its origin or manipulation is not associated with all the nucleic acid with which it is associated in nature, or is linked to a nucleic acid or other chemical agent other than that to which it is linked in nature, or is not present in nature.
  • a non-naturally occurring peptide refers to a portion of a large naturally occurring peptide or protein, or semi-synthetic or synthetic peptide, which by virtue of its origin or manipulation is not associated with all of a peptide with which it is associated in nature, or is linked to peptides, functional groups or chemical agents other than that to which it is linked in nature, or is present in a purity that is not present in nature, or does not occur in nature.
  • proton refers to a hydrogen ion
  • transporter refers to a composition that participates in the movement of a substrate across a cellular membrane.
  • a "proton-coupled peptide transporter” transports peptides across cellular membranes, which transport is linked or coupled to the transport of a proton or hydrogen ion across the same membrane.
  • the transporter is a protein encoded by a nucleic acid comprising one or more of the nucleic acids described herein, more preferably a PHT1 or PHT2 nucleic acid.
  • nucleic acids and peptides refers to amino acids of a peptide in an order derived from the sequence of a nucleic acid or the complement of the nucleic acid.
  • an "antibody” refers to a protein consisting of one or more polypeptides substantially encoded by imrnunoglobulin genes or fragments of immunoglobulin genes.
  • the recognized imrnunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad immunoglobulin variable region genes.
  • Light chains are classified as either kappa or lambda.
  • Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.
  • a typical immunoglobulin (antibody) structural unit is known to comprise a tetramer.
  • Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light” (about 25 kD) and one "heavy” chain (about 50-70 kD).
  • the N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition.
  • the terms variable light chain (V L ) and variable heavy chain (V H ) refer to these light and heavy chains respectively.
  • Antibodies exist as intact immunoglobulins or as a number of well characterized fragments produced by digestion with various peptidases.
  • pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond.
  • the F(ab)' 2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab') dimer into a Fab' monomer.
  • the Fab' monomer is essentially a Fab with part of the hinge region (see, Fundamental Immunology, W.E. Paul, ed., Raven Press, N.Y. (1993), for a more detailed description of other antibody fragments).
  • antibody as used herein also includes antibody fragments either produced by the modification of whole antibodies or synthesized de novo using recombinant DNA methodologies.
  • Preferred antibodies include single chain antibodies (antibodies that exist as a single polypeptide chain), more preferably single chain Fv antibodies (sFv or scFv) in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a continuous polypeptide.
  • the single chain Fv antibody is a covalently linked V H -V L heterodimer which may be expressed from a nucleic acid including V H - and V - encoding sequences either joined directly or joined by a peptide-encoding linker.
  • the first functional antibody molecules to be expressed on the surface of filamentous phage were single-chain Fv's (scFv), however, alternative expression strategies have also been successful.
  • Fab molecules can be displayed on phage if one of the chains (heavy or light) is fused to g3 capsid protein and the complementary chain exported to the periplasm as a soluble molecule.
  • the two chains can be encoded on the same or on different replicons; the important point is that the two antibody chains in each Fab molecule assemble post-translationally and the dimer is incorporated into the phage particle via linkage of one of the chains to, e.g., g3p (see, e.g., U.S. Patent No: 5733743).
  • scFv antibodies and a number of other structures converting the naturally aggregated, but chemically separated light and heavy polypeptide chains from an antibody V region into a molecule that folds into a three dimensional structure substantially similar to the structure of an antigen-binding site are known to those of skill in the art (see e.g., U.S. Patent Nos. 5,091,513, 5,132,405, and 4,956,778).
  • Particularly preferred antibodies should include all that have been displayed on phage (e.g., scFv, Fv, Fab and disulfide linked Fv (Reiter et al. (1995) Protein Eng. 8: 1323- 1331).
  • binding preference e.g., affinity for the target molecule/sequence is at least 2 fold, more preferably at least 5 fold, and most preferably at least 10 or 20 fold over a nonspecific (e.g. randomly generated molecule lacking the specifically recognized amino acid or amino acid sequence) target molecule.
  • hybridizing specifically to or “specific hybridization” or “selectively hybridize to” refer to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA.
  • stringent conditions refers to conditions under which a probe will hybridize preferentially to its target subsequence, and to a lesser extent to, or not at all to, other sequences.
  • Stringent hybridization and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments such as Southern and northern hybridizations are sequence dependent, and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in Tijssen
  • An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42DC, with the hybridization being carried out overnight.
  • An example of highly stringent wash conditions is 0.15 M NaCI at 72°C for about 15 minutes.
  • An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see, Sambrook et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY, (Sambrook et al.) supra for a description of SSC buffer).
  • a high stringency wash is preceded by a low stringency wash to remove background probe signal.
  • An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is lx SSC at 45 ⁇ C for 15 minutes.
  • An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40 DC for 15 minutes.
  • a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
  • stringent conditions are characterized by hybridization in 1 M NaCI, 10 mM Tris-HCl, pH 8.0, 0.01% Triton X-100, 0.1 mg/ml fragmented herring sperm DNA with hybridization at 45°C with rotation at 50 RPM followed by washing first in 0.9 M NaCI, 0.06 M NaH 2 PO 4 , 0.006 M EDTA, 0.01% Tween-20 at 45°C for 1 hr, followed by 0.075 M NaCI, 0.005 M NaH 2 PO 4 , 0.5 mM EDTA at 45°C for 15 minutes.
  • nucleic acids or polypeptide sequences refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection.
  • substantially identical in the context of two nucleic acids or polypeptides, refers to two or more sequences or subsequences that have at least 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection.
  • the substantial identity exists over a region of the sequences that is at least about 50 residues in length, more preferably over a region of at least about 100 residues, and most preferably the sequences are substantially identical over at least about 150 residues.
  • the sequences are substantially identical over the entire length of the coding regions.
  • sequence comparison typically one sequence acts as a reference sequence, to which test sequences are compared.
  • test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated.
  • sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
  • Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman (1988) Proc.
  • PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments to show relationship and percent sequence identity. It also plots a tree or dendogram showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle (1987) J. Mol. Evol. 35:351-360.
  • the method used is similar to the method described by Higgins & Sharp (1989) CABIOS 5: 151-153.
  • the program can align up to 300 sequences, each of a maximum length of 5,000 nucleotides or amino acids.
  • the multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or cluster of aligned sequences.
  • Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two individual sequences.
  • the final alignment is achieved by a series of progressive, pairwise alignments.
  • the program is run by designating specific sequences and their amino acid or nucleotide coordinates for regions of sequence comparison and by designating the program parameters. For example, a reference sequence can be compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps.
  • HSPs high scoring sequence pairs
  • initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them.
  • the word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always ⁇ 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative- scoring residue alignments; or the end of either sequence is reached.
  • the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment.
  • the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915).
  • the BLAST algorithm In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul (1993) Proc. Natl. Acad. Sci. USA ,90: 5873-5787).
  • One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
  • P(N) the smallest sum probability
  • a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
  • small organic molecule refers to a molecule of a size comparable to those organic molecules generally used in pharmaceuticals.
  • Preferred small organic molecules range in size up to about 5000 Da, more preferably up to 2000 Da, and most preferably up to about 1000 Da.
  • Figure IA show the . hPHTl EST map.
  • Figure IB shows the alignments of the ESTs with the main contig.
  • Figure IC shows a schematic of hPHTl splice variants. The sequence missing in splice variant B is identified. Identical sequence between splice Variant A & A' is identified. Unique sequence in splice variant A' and in splice variant A is indicated.
  • Figure ID shows splice variant(hPHTl).
  • Figure IE shows splice variant A(hPHTlvA) with alignment to hPHTl.
  • Figure IF shows splice variant A'(hPHTlvA') with alignment to hPHTl.
  • Figure 1G shows splice variant B(hPHTlvB) with alignment to hPHTl.
  • Figure 2 illustrates the structure of the hPHT2 g. Sequences of exons are provided in sequence listing (SEQ ID NOS: 9-16). Sequences of introns are provided in sequence listing (SEQ ID NOS: 17-23).
  • Figure 3A provides multiple sequence alignments for PHT branch. (Click alignment to see PDF).
  • Figure 3B provides multiple sequence alignments for human PHT+PepT branch. Black: 50% or higher identity; Gray: conservative and similar substitutions.
  • Figure 4A provides hydropathy plots (Kyte Doolittle method) for hPHTl
  • FIG. 4B shows topo diagrams for hPHTl and hPHT2.
  • Figure 5A shows RT-PCR analysis of hPHTl extracted from human small intestine tissue sample.
  • Figure 5B shows Northern blot analysis of hPHTl, hPHT2, and actin expression in human tissues.
  • Figure 6 shows the structure and splicing of the hPEPTl gene.
  • hPEPTl and hPEPTl-RF completely share exons 4-6 and partially share exons 3 and 7, where the alternative splice sites are located (indicated by the vertical lines).
  • Exon T represents a repetitive element. Arrows indicate the translation start and stop sites.
  • the Exonic sequences are shown in the sequence listings as SEQ ID Nos: 11 through 16 (hpeptl-rf exon 3 through hpeptl-rf exon 7').
  • Figure 7 illustrates the membrane topology prediction for hPEPTl. The prediction was carried out using the TOPPRED program. The exon boundaries are indicated by alternating shading.
  • Figure 8 illustrates the membrane topology prediction for hPEPTl- RF. The prediction was carried out using the TOPPRED program. The exon boundaries are indicated by alternating shading.
  • Figure 9 shows the putative promoter region of the human PEPT1 gene. Sequence of nucleotides upstream of the translations (f)(ATG)is shown. The numbering starts (+1) from the transcription start site (f ) to the negative values in the promoter region. The sites for the transcription factors in the promoter region are underlined and the corresponding indicated.
  • This invention provides new members of the h+/oligopeptide transporter gene family.
  • the new genes designated herein as hHPTl and hHPT2.
  • the new genes appear to be members of the POT family of peptide transporters.
  • the human POT family appears to contain at least four genes encoding peptide transporters and, without being bound to a particular theory, it is believed that each is likely to display a distinct pattern of tissue expression.
  • Tissue distribution of POT gene expression is of particular interest for achieving oral bioavailability or for targeting drugs to tumor tissues.
  • each member of the peptide transporter family is believed to exhibit some selectivity for peptides, peptoid drugs, and other agents.
  • hPepTl is highly expressed in pancreatic and colon adenocarcinomas, including liver metastases considerably above the level seen in surrounding normal tissues.
  • substrates for hPepTl can be "specifically" delivered to these tissues.
  • drugs e.g. drugs, prodrugs, etc.
  • hPHTl or hPHT2 e.g. drugs, prodrugs, etc.
  • drugs or prodrugs can be engineered with domains/sites, preferentially transported by hPHTl or hPHT2 and thereby enhance availability of these agents to various tissues.
  • agents that modulate e.g. up-regulate or downregulate expression or activity of hPHTl or hPHTl.
  • agents that modulate can be administered along with drugs transported by hPHTl or hPHT2 transporters to either enhance availability of the drug (e.g. upregulate hPHTl or hPHT2 expression) or diminish availability of the drug (e.g. down-regulate hPHTl or hPHT2) to tissues harboring hPHTl or hPHT2 genes.
  • gene therapy methods can be used to specifically deliver and express hPHTl or hPHT2 to preselected target tissues and thereby increase the availability of an hPHTl or hPHT2 transported agent to that tissue.
  • Nucleic acids encoding H + /oligopeptide transporters proteases The nucleic acid and amino acid sequences of hPHTl and hPHT2 and primers sufficient to amplify the nucleic acid sequences are provided herein (see. e.g. SEQ ID NO: 17 for the full hPHTl sequence). It is noted that there are two splice variants of hPHTl. The sequence listing aligns the splice variants with each other. (A vs B). Another splice variant is actually a sequence obtained from a PCR amplification product using cDNAs from skeletal muscle.
  • PCR run gave a band with the expected sequence (509 bps of hPHTl), and a faster moving band (PCR-2) with 169 bps missing (probable frameshift mutation). Its alignment with hPHTl is given in the sequence listing as well.
  • nucleic acids encoding the full length peptide transporters or fragments of such nucleic acids are prepared using standard methods well known to those of skill in the art.
  • the nucleic acid(s) may be cloned, or amplified by in vitro methods, such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), the transcription-based amplification system (TAS), the self-sustained sequence replication system (SSR), etc.
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • TAS transcription-based amplification system
  • SSR self-sustained sequence replication system
  • the transporter DNA, or their subsequences are to be used as nucleic acid probes, it is often desirable to label the nucleic acids with detectable labels.
  • the labels may be incorporated by any of a number of means well known to those of skill in the art.
  • the label is simultaneously incorporated during the amplification step in the preparation of the sample nucleic acids.
  • PCR polymerase chain reaction
  • transcription amplification using a labeled nucleotide incorporates a label into the transcribed nucleic acids.
  • a label may be added directly to an original nucleic acid sample (e.g., mRNA, polyA mRNA, cDNA, etc.) or to the amplification product after the amplification is completed.
  • Means of attaching labels to nucleic acids are well known to those of skill in the art and include, for example nick translation or end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label (e.g., a fluorophore). Suitable labels are described below.
  • proton/oligopeptide transporters Cloning and expression of proton/oligopeptide transporters. It is desirable express the proton/oligopeptide transporters of this invention in heterologous cells for use in the assays described herein. In addition, it is also useful to express the transporter proteins, or fragments thereof, to generate antibodies for a variety of applications (e.g. determining transporter expression level, etc.).
  • hPHTl and or hPHT2 polypeptides and various fragments thereof can be conveniently produced using synthetic chemical processes or recombinant expression methodologies.
  • the transporter polypeptides of this invention or fragments thereof may be synthesized using standard chemical peptide synthesis techniques. Where the desired subsequences are relatively short (e.g., when a particular antigenic determinant is desired) the molecule may be synthesized as a single contiguous polypeptide. Where larger molecules are desired, subsequences can be synthesized separately (in one or more units) and then fused by condensation of the amino terminus of one molecule with the carboxyl terminus of the other molecule thereby forming a peptide bond.
  • Solid phase synthesis in which the C-terminal amino acid of the sequence is attached to an insoluble support followed by sequential addition of the remaining amino acids in the sequence is the preferred method for the chemical synthesis of the polypeptides of this invention.
  • Techniques for solid phase synthesis are described by Barany and Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A., Merrifield, et al. (1963) J. Am. Chem. Soc, 85: 2149-2156, and Stewart et al. (1984) Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 111.
  • the transporter proteins of this invention are synthesized using recombinant expression systems. Generally this involves creating a DNA sequence that encodes the desired protein, placing the DNA in an expression cassette under the control of a particular promoter, and expressing the protein in a host cell. The host cell can then be used in the assays described herein. Alternatively, were isolated transporter proteins are desired, the expressed transporter can be recovered from the cell.
  • DNA encoding the transporter proteins described herein can be prepared by any suitable method as described above, including, for example, cloning and restriction of appropriate sequences or direct chemical synthesis by methods such as the phosphotriester method of Narang et al. (1979) Meth. Enzymol. 68: 90-99; the phosphodiester method of Brown et ⁇ /.(1979) Meth. Enzymol. 68: 109-151; the diethylphosphoramidite method of Beaucage et al. (1981) Tetra. Lett, 22: 1859-1862; and the solid support method of U.S. Patent No. 4,458,066.
  • Chemical synthesis produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence, or by polymerization with a DNA polymerase using the single strand as a template.
  • a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.
  • One of skill would recognize that while chemical synthesis of DNA is limited to sequences of about 100 bases, longer sequences may be obtained by the ligation of shorter sequences. Alternatively, subsequences may be cloned and the appropriate subsequences cleaved using appropriate restriction enzymes. The fragments may then be ligated to produce the desired DNA sequence.
  • the nucleic acids of this invention can be cloned using DNA amplification methods such as polymerase chain reaction (PCR).
  • PCR polymerase chain reaction
  • the nucleic acid sequence or subsequence is PCR amplified, using a sense primer containing one restriction site (e.g., Ndel) and an antisense primer containing another restriction site (e.g., HindlTJ).
  • a sense primer containing one restriction site e.g., Ndel
  • an antisense primer containing another restriction site e.g., HindlTJ
  • This nucleic acid can then be easily ligated into a vector that can be transfected into an appropriate host cell (e.g. an oocyte, a mammalian somatic cell, etc.)
  • Suitable PCR primers can be determined by one of skill in the art using the sequence information provided herein and representative primers are illustrated herein as well.
  • Appropriate restriction sites can also be added to the nucleic acid encoding the transporter protein or protein subsequence by site-directed mutagenesis.
  • the nucleic acid sequences encoding human transporter proteins or protein subsequences may be expressed in a variety of host cells, including E. coli, other bacterial hosts, yeast, and various higher eukaryotic cells such as the COS, CHO and HeLa cells lines and myeloma cell lines, and various vertebrate oocytes (e.g. Xenopus oocytes).
  • the recombinant protein gene will be operably linked to appropriate expression control sequences for each host cell.
  • this includes a promoter such as the T7, trp, or lambda promoters, a ribosome binding site and preferably a transcription termination signal.
  • control sequences will include a promoter and often an enhancer (e.g., an enhancer derived from immunoglobulin genes, SV40, cytomegalo virus, etc.), and a polyadenylation sequence, and may include splice donor and acceptor sequences.
  • an enhancer e.g., an enhancer derived from immunoglobulin genes, SV40, cytomegalo virus, etc.
  • a polyadenylation sequence may include splice donor and acceptor sequences.
  • the vectors of the invention can be transferred into the chosen host cell by well-known methods such as calcium chloride transformation for E. coli and calcium phosphate treatment, microinjection, or electroporation for vertebrate cells.
  • Cells transformed by the plasmids can be selected by resistance to antibiotics conferred by genes contained on the plasmids, such as the amp, gpt, neo and hyg genes.
  • the recombinant hPHTl and/or hPHT2 protein(s) can be purified according to standard procedures of the art, including ammonium sulfate precipitation, affinity columns, column chromatography, gel electrophoresis and the like (see, generally, R. Scopes, (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification., Academic Press, Inc. N.Y.). Substantially pure compositions of at least about 90 to 95% homogeneity are preferred, and 98 to 99% or more homogeneity are most preferred. Once purified, partially or to homogeneity as desired, the polypeptides may then be used (e.g., as immunogens for antibody production).
  • modifications can be made to the transporter proteins without diminishing their biological activity. Some modifications may be made to facilitate the cloning, expression, or incorporation of the targeting molecule into a fusion protein. Such modifications are well known to those of skill in the art and include, for example, a methionine added at the amino terminus to provide an initiation site, or additional amino acids (e.g., poly His) placed on either terminus to create conveniently located restriction sites or termination codons or purification sequences.
  • compositions transported by hPHTl or hPHT2 transporters are transported by hPHTl or hPHT2 transporters. Having discovered new human peptide transporters, it is possible to screen for compounds specifically transported by these transporters. Other transporters have been shown to transport a wide variety of compositions in addition to peptides and/or amino acids. Such compounds include, but are not limited to, antibiotics (including several oral .beta.- lactams), oral angiotensin converting enzyme (ACE) inhibitors, oral renin inhibitors and the like. Thus, the transporters of this invention can readily be utilized in a screening system to identify molecules that they transport.
  • antibiotics including several oral .beta.- lactams
  • ACE angiotensin converting enzyme
  • oral renin inhibitors include, but are not limited to, antibiotics (including several oral .beta.- lactams), oral angiotensin converting enzyme (ACE) inhibitors, oral renin inhibitors and the like.
  • such assays involve expressing the transporters of this invention in a cell contacting the cell with the agent(s) it is desired to screen for the ability to be transported by the transporters of this invention and detecting and/or quantifying the amount of the agent(s) that are transported into the cell.
  • the amount of transported agent can be compared to the amount of that agent transported by cells lacking the transporter and/or to the amount of an agent known not to be transported by the transporters of this invention (negative controls). Preferred embodiments, can also include a comparison to the amount of an agent transported by the cells where it is known that that agent is transported by the transporters of this invention (positive controls).
  • the assay is typically scored as positive where there is a difference between the amount of test agent(s) transported and the negative control(s), preferably where the difference is statistically significant (e.g. at greater than 80%, preferably greater than about 90%, more preferably greater than about 98%, and most preferably greater than about 99% confidence level).
  • Cells suitable for such screening systems preferably include vertebrate cells (e.g., amphibian cells, mammalian cells, etc.) and, in certain embodiments, more preferably include mammalian cells of the tissue to which it is ultimately desired to deliver the test agent(s).
  • the cells may be cells of heart tissue.
  • the assays can be convenientlyl run using oocytes. It is possible to simply inject mRNAs encoding the transporters of these cells into oocytes where they are expressed thereby providing a convenient system for the cellular assay. While the present invention contemplates the use of oocytes isolated from any non-human vertebrate organism, preferred embodiments of the assay feature amphibian oocytes, particularly oocytes which are approximately the same size, or larger, than oocytes which can be isolated from frog species of the genus Xenopus, e.g. Xenopus laevis. In general, the larger oocytes are preferred for ease of manipulation. Furthermore, expression of recombinant proteins and cell culturing techniques are each better characterized for amphibian oocytes, and a greater diversity of expression vectors are available for these systems.
  • Xenopus oocytes can be harvested from female Xenopus laevis and processed using published techniques (Coleman et al, eds., Transcription and Translation: A Practical Approach. IRL Press, pp. 271-302; and Williams et al. (1988) Proc. Natl. Acad. Sci., USA, 85: 4939-4943).
  • preparation of the assay includes obtaining oocytes from the excised ovaries of female frogs anesthetized by hypothermia and from which follicle cells have been removed by treatment with collagenase.
  • Oocytes at a particular stage e.g. Dumont stage V, can be selected and microinjected with the mRNA to be tested, e.g. for in vitro transcribed RNA ("cRNA").
  • cRNA in vitro transcribed RNA
  • Isolation of other suitable oocytes can be, as a matter of course, carried out by one of ordinary skill in the art.
  • techniques routinely used in generation of transgenic animals such as protocols for inducing superovulation and isolating fertilized eggs from various mammals (e.g. mice, rabbits, rats, sheep, goats or pigs) can be slightly modified (i.e. no fertilization step) in order to allow for isolation of mammalian oocytes for use in the subject method (see, e.g., U.S. Pat. No. 4,994,384).
  • protocols exist for in vitro maturation of mammalian oocytes such as mature metaphase II oocytes.
  • telomeres Several methods for expressing recombinant proteins in oocytes (and other cells) are generally known in the art.
  • expression of the recombinant protein(s) to be tested in the subject assay can be carried out by microinjection of cRNA encoding the protein, or by microinjection (or by other form of transfection) of an expression vector encoding the protein of interest. Either method can be carried out by employing the basics of expression cloning strategies known in the art.
  • cDNA libraries are cloned into vectors that can be used for in vitro RNA synthesis.
  • the pCS2+/- vector contains SP6, T7 and T3 promoters that have been introduced upstream and downstream of a cloning site in order to permit in vitro RNA synthesis upon linearization of the plasmid.
  • a plasmid containing the cDNA to be tested can be linearized by cutting downstream from the cDNA insert with a restriction enzyme. The post-restriction digest is digested with Proteinase K and then extracted with two phenol: chloroform (1:1) extractions. The resulting DNA fragments are then ethanol precipitated.
  • the precipitated fragments are mixed with either T3 RNA polymerase (to make sense strand), or T7 RNA polymerase (to make anti-sense strand), plus rATP, rCTP, rGTP, rUTP, and RNase inhibitor. Simultaneously, capped RNA can be produced in vitro (Krieg and Melton, (1987) Meth Enzymol 155: 397-415; and Richardson et al. (1988) Bio/Technology 6: 565-570).
  • Other exemplary vectors useful in the subject assay include: the pSP64T vector (Kreig et al.
  • a marker gene in the oocyte may be desirable to co-express a marker gene in the oocyte in order to standardize the comparison of effects based on level of expression occurring in the oocytes.
  • an ⁇ -amylase gene construct can be provided in the oocyte, and the amylase activity measured in the oocyte (Urnes et al. (1990) Gene 95: 267-274.
  • the level of expression for other proteins can therefore be standardized based on the amount of recombinant amylase produced.
  • Dose response curves can be constructed based on the level of expression of the amylase reporter in the oocyte.
  • the cell expressing the peptide transporter(s) of this invention can be contacted with the agent(s) to be screened and the amount of agent that is internalized is detected.
  • The is routinely accomplished by either measuring depletion of the agent in the media contacting the cell or measuring the amount of the agent internalized by the cell.
  • the test agent(s) are labeled with a detectable label to facilitate their detection in the subject cell and/or media.
  • Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.
  • Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads ), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, poly
  • Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Fluorescent labels, colorimetric labels, and radiolabels are particularly preferred.
  • assays are intended to be illustrative and not limiting. Using the teachings provided herein, other variations of such assays will be apparent to one of skill in the art. Such variations include, but are not limited to,, the use of a tissues and/or cells that express endogenous transporters of this invention in the assays described above. It is also noted that assays for screening of uptake of test agents by various peptide transporters is described in U.S. Patents 6,020,479 , 5,919,699, and 5,919,628.
  • this invention is premised, in part, on the discovery of new h+/oligopeptide transporter (e.g. hPHTl and/or hPHT2).
  • hPHTl and/or hPHT2 agents that downregulate expression of decrease the bioavailability of compounds internalized by these receptors, while agents that upregulate hPHTl or hPHT2 increase the bioavailability of compounds internalized by these transporters.
  • this invention provides methods of screening for agents that modulate expression and/or activity.
  • the methods involve detecting the expression level and/or activity level of hPHTl or hPHT2 genes or gene products (e.g. hPHTl or hPHT2 mRNA or proteins) in the presence of the agent(s) in question.
  • a reduced hPHTl or hPHT2 expression level or activity level in the presence of the agent as compared to a negative control where the test agent is absent or at reduced concentration indicates that the agent downregulates hPHTl or hPHT2 activity or expression.
  • hPHTl or hPHT2 expression level or activity level in the presence of the agent as compared to a negative control where the test agent is absent or at reduced concentration indicates that the agent up-regulates hPHTl or hPHT2 activity or expression
  • Expression levels of a gene can be altered by changes in the transcription of the gene product (i.e. transcription of mRNA), and/or by changes in translation of the gene product (i.e. translation of the protein), and/or by post-translational modification(s) (e.g. protein folding, glycosylation, etc.).
  • preferred assays of this invention include assaying for level of transcribed mRNA (or other nucleic acids derived from the hPHTl or hPHT2 genes), level of translated protein, activity of translated protein, etc. Examples of such approaches are described below.
  • Changes in expression level can be detected by measuring changes in hPHTl and/or hPHT2 genomic DNA or a nucleic acid derived from the genomic DNA (e.g., hPHTl or hPHT2 mRNA, reverse-transcribed cDNA, etc.).
  • a nucleic acid sample for such analysis.
  • the nucleic acid is found in or derived from a biological sample.
  • biological sample refers to a sample obtained from an organism or from components (e.g., cells) of an organism. The sample may be of any biological tissue or fluid. Biological samples may also include organs or sections of tissues such as frozen sections taken for histological purposes.
  • the nucleic acid (e.g., mRNA or a nucleic acid derived from an mRNA) is, in certain preferred embodiments, isolated from the sample according to any of a number of methods well known to those of skill in the art. Methods of isolating mRNA are well known to those of skill in the art. For example, methods of isolation and purification of nucleic acids are described in detail in by Tijssen ed., (1993) Chapter 3 of Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I. Theory and Nucleic Acid Preparation, Elsevier, N.Y. and Tijssen ed.
  • the "total" nucleic acid is isolated from a given sample using, for example, an acid guanidinium-phenol-chloroform extraction method and polyA+ mRNA is isolated by oligo dT column chromatography or by using (dT)n magnetic beads (see, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989), or Current Protocols in Molecular Biology, F. Ausubel et al., ed. (1987) Greene Publishing and Wiley-Interscience, New York).
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • the nucleic acid sample is one in which the concentration of the hPHTl and/or hPHT2 mRNA transcript(s), or the concentration of the nucleic acids derived from the hPHTl and/or hPHT2 mRNA transcript(s), is proportional to the transcription level (and therefore expression level) of that gene.
  • the hybridization signal intensity be proportional to the amount of hybridized nucleic acid.
  • the proportionality be relatively strict (e.g., a doubling in transcription rate results in a doubling in mRNA transcript in the sample nucleic acid pool and a doubling in hybridization signal), one of skill will appreciate that the proportionality can be more relaxed and even non-linear. Thus, for example, an assay where a 5 fold difference in concentration of the target mRNA results in a 3 to 6 fold difference in hybridization intensity is sufficient for most purposes.
  • the hPHTl and/or bPHT2-containing nucleic acid sample is the total mRNA or a total cDNA isolated and/or otherwise derived from a biological sample.
  • the nucleic acid may be isolated from the sample according to any of a number of methods well known to those of skill in the art as indicated above.
  • hPHTl and/or hPHT2 Using the known sequence of hPHTl and/or hPHT2 (see sequence listing) detecting and/or quantifying the hPHTl and/or hPHT2 transcript(s) can be routinely accomplished using nucleic acid hybridization techniques (see, e.g., Sambrook et al. supra). For example, one method for evaluating the presence, absence, or quantity of hPHTl and/or hPHT2 genomic DNA or reverse-transcribed cDNA involves a "Southern Blot". In a Southern Blot, the DNA typically fragmented and separated on an electrophoretic gel, is hybridized to a probe specific for hPHTl and/or hPHT2.
  • Comparison of the intensity of the hybridization signal from the hPHTl and/or hPHT2 probe with a "control" probe provides an estimate of the relative expression level of the target nucleic acid.
  • the hPHTl and/or hPHT2 mRNA can be directly quantified in a Northern blot.
  • the mRNA is isolated from a given cell sample using, for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA is then electrophoresed to separate the mRNA species and the mRNA is transferred from the gel to a nitrocellulose membrane.
  • labeled probes are used to identify and/or quantify the target hPHTl and/or hPHT2 mRNA.
  • Appropriate controls e.g. probes to housekeeping genes provide a reference for evaluating relative expression level.
  • in situ hybridization An alternative means for determining the hPHTl and/or hPHT2 expression level is in situ hybridization.
  • In situ hybridization assays are well known (e.g., Angerer (1987) Meth. Enzymol 152: 649).
  • in situ hybridization comprises the following major steps: (1) fixation of tissue or biological structure to be analyzed; (2) prehybridization treatment of the biological structure to increase accessibility of target DNA, and to reduce nonspecific binding; (3) hybridization of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid fragments.
  • the reagent used in each of these steps and the conditions for use vary depending on the particular application.
  • tRNA, human genomic DNA, or Cot-1 DNA is used to block non-specific hybridization.
  • amplification-based assays can be used to measure hPHTl and/or hPHT2 expression (transcription) level.
  • the target nucleic acid sequences i.e., hPHTl and/or hPHT2
  • act as template(s) in amplification reaction(s) e.g. Polymerase Chain Reaction (PCR) or reverse-transcription PCR (RT-PCR)
  • PCR Polymerase Chain Reaction
  • RT-PCR reverse-transcription PCR
  • the amount of amplification product will be proportional to the amount of template (e.g., hPHTl and/or hPHT2 mRNA) in the original sample.
  • PCR Polymerase Chain Reaction
  • RT-PCR reverse-transcription PCR
  • Quantitative amplification involves simultaneously co-amplifying a known quantity of a control sequence using the same primers. This provides an internal standard that may be used to calibrate the PCR reaction.
  • Detailed protocols for quantitative PCR are provided in Innis et al. (1990) PCR Protocols, A Guide to Methods and Applications, Academic Press, Inc. N.Y.).
  • One approach for example, involves simultaneously co-amplifying a known quantity of a control sequence using the same primers as those used to amplify the target. This provides an internal standard that may be used to calibrate the PCR reaction.
  • One preferred internal standard is a synthetic AW106 cRNA.
  • the AW106 cRNA is combined with RNA isolated from the sample according to standard techniques known to those of skill in the art.
  • the RNA is then reverse transcribed using a reverse transcriptase to provide copy DNA.
  • the cDNA sequences are then amplified (e.g., by PCR) using labeled primers.
  • the amplification products are separated, typically by electrophoresis, and the amount of labeled nucleic acid (proportional to the amount of amplified product) is determined.
  • the amount of mRNA in the sample is then calculated by comparison with the signal produced by the known AW106 RNA (or other) standard.
  • PCR Protocols A Guide to Methods and Applications, Innis et al. (1990) Academic Press, Inc. N.Y..
  • the nucleic acid sequence(s) for hPHTl and hPHT2 provided herein are sufficient to enable one of skill to routinely select primers to amplify any portion of the gene.
  • the methods of this invention can be utilized in array- based hybridization formats.
  • Arrays are a multiplicity of different "probe” or “target” nucleic acids (or other compounds) attached to one or more surfaces (e.g., solid, membrane, or gel).
  • the multiplicity of nucleic acids (or other moieties) is attached to a single contiguous surface or to a multiplicity of surfaces juxtaposed to each other.
  • "low density" arrays can simply be produced by spotting (e.g. by hand using a pipette) different nucleic acids at different locations on a solid support (e.g. a glass surface, a membrane, etc.).
  • a solid support e.g. a glass surface, a membrane, etc.
  • Arrays can also be produced using oligonucleotide synthesis technology.
  • U.S. Patent No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high density oligonucleotide arrays. Synthesis of high density arrays is also described in U.S. Patents 5,744,305, 5,800,992 and 5,445,934.
  • nucleic acid hybridization formats are known to those skilled in the art.
  • common formats include sandwich assays and competition or displacement assays.
  • assay formats are generally described in Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al. (1969) Nature 223: 582-587.
  • Sandwich assays are commercially useful hybridization assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. The sample will provide the target nucleic acid. The "capture” nucleic acid and “signal” nucleic acid probe hybridize with the target nucleic acid to form a "sandwich” hybridization complex. To be most effective, the signal nucleic acid should not hybridize with the capture nucleic acid. Typically, labeled signal nucleic acids are used to detect hybridization.
  • Complementary nucleic acids or signal nucleic acids may be labeled by any one of several methods typically used to detect the presence of hybridized polynucleotides. The most common method of detection is the use of autoradiography with 3 H, 125 1, 35 S, 14 C, or 32 P- labelled probes or the like. Other labels include ligands that bind to labeled antibodies, fluorophores, chemi-luminescent agents, enzymes, and antibodies which can serve as specific binding pair members for a labeled ligand. Detection of a hybridization complex may require the binding of a signal generating complex to a duplex of target and probe polynucleotides or nucleic acids.
  • such binding occurs through ligand and anti-ligand interactions as between a ligand-conjugated probe and an anti-ligand conjugated with a signal.
  • the sensitivity of the hybridization assays may be enhanced through use of a nucleic acid amplification system that multiplies the target nucleic acid being detected. Examples of such systems include the polymerase chain reaction (PCR) system and the ligase chain reaction (LCR) system.
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • Other methods recently described in the art are the nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and Q Beta Replicase systems.
  • Nucleic acid hybridization simply involves providing a denatured probe and target nucleic acid under conditions where the probe and its complementary target can form stable hybrid duplexes through complementary base pairing. The nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be detected, typically through detection of an attached detectable label. It is generally recognized that nucleic acids are denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids, or in the addition of chemical agents, or the raising of the pH. Under low stringency conditions (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes (e.g., DNA:DNA,
  • hybridization conditions may be selected to provide any degree of stringency. In a preferred embodiment, hybridization is performed at low stringency to ensure hybridization and then subsequent washes are performed at higher stringency to eliminate mismatched hybrid duplexes. Successive washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 X SSPE at 37°C to 70°C) until a desired level of hybridization specificity is obtained. Stringency can also be increased by addition of agents such as formamide. Hybridization specificity may be evaluated by comparison of hybridization to the test probes with hybridization to the various controls that can be present.
  • the wash is performed at the highest stringency that produces consistent results and that provides a signal intensity greater than approximately 10% of the background intensity.
  • the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably altered and which provides adequate signal for the particular probes of interest.
  • background signal is reduced by the use of a blocking reagent (e.g., tRNA, sperm DNA, cot-1 DNA, etc.) during the hybridization to reduce non-specific binding.
  • a blocking reagent e.g., tRNA, sperm DNA, cot-1 DNA, etc.
  • the use of blocking agents in hybridization is well known to those of skill in the art (see, e.g., Chapter 8 in P. Tijssen, supra.) Methods of optimizing hybridization conditions are well known to those of skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.).
  • Optimal conditions are also a function of the sensitivity of label (e.g., fluorescence) detection for different combinations of substrate type, fluorochrome, excitation and emission bands, spot size and the like.
  • label e.g., fluorescence
  • Low fluorescence background surfaces can be used (see, e.g., Chu (1992) Electrophoresis 13:105-114).
  • the sensitivity for detection of spots ("target elements") of various diameters on the candidate surfaces can be readily determined by, e.g., spotting a dilution series of fluorescently end labeled DNA fragments. These spots are then imaged using conventional fluorescence microscopy.
  • the sensitivity, linearity, and dynamic range achievable from the various combinations of fluorochrome and solid surfaces can thus be determined.
  • Serial dilutions of pairs of fluorochrome in known relative proportions can also be analyzed. This determines the accuracy with which fluorescence ratio measurements reflect actual fluorochrome ratios over the dynamic range permitted by the detectors and fluorescence of the substrate upon which the probe has been fixed. d) Labeling and detection of nucleic acids.
  • the probes used herein for detection of hPHTl and/or hPHT2 expression levels can be full length or less than the full length of the hPHTl and/or hPHT2 mRNA. Shorter probes are empirically tested for specificity. Preferred probes are sufficiently long so as to specifically hybridize with the hPHTl and/or hPHT2 target nucleic acid(s) under stringent conditions.
  • the preferred size range is from about 20 bases to the length of the hPHTl and/or hPHT2 mRNA, more preferably from about 30 bases to the length of the hPHTl and/or hPHT2 mRNA, and most preferably from about 40 bases to the length of the hPHTl and/or hPHT2 mRNA.
  • the probes are typically labeled, with a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.
  • Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., DynabeadsTM), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads.
  • Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850
  • a fluorescent label is preferred because it provides a very strong signal with low background. It is also optically detectable at high resolution and sensitivity through a quick scanning procedure.
  • the nucleic acid samples can all be labeled with a single label, e.g., a single fluorescent label.
  • different nucleic acid samples can be simultaneously hybridized where each nucleic acid sample has a different label. For instance, one target could have a green fluorescent label and a second target could have a red fluorescent label. The scanning step will distinguish sites of binding of the red label from those binding the green fluorescent label.
  • Each nucleic acid sample (target nucleic acid) can be analyzed independently from one another.
  • Suitable chromogens which can be employed include those molecules and compounds which absorb light in a distinctive range of wavelengths so that a color can be observed or, alternatively, which emit light when irradiated with radiation of a particular wave length or wave length range, e.g., fluorescent molecules.
  • fluorescent labels should absorb light above about 300 nm, preferably about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths greater than about 10 nm higher than the wavelength of the light absorbed. It should be noted that the absorption and emission characteristics of the bound dye can differ from the unbound dye. Therefore, when referring to the various wavelength ranges and characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye which is unconjugated and characterized in an arbitrary solvent. Fluorescent labels are generally preferred because by irradiating a fluorescent molecule with light, one can obtain a plurality of emissions. Thus, a single label can provide for a plurality of measurable events.
  • Detectable signal can also be provided by chemi luminescent and bioluminescent sources.
  • Chemiluminescent sources include a compound which becomes electronically excited by a chemical reaction and can then emit light which serves as the detectable signal or donates energy to a fluorescent acceptor.
  • luciferins can be used in conjunction with luciferase or lucigenins to provide bioluminescence.
  • Spin labels are provided by reporter molecules with an unpaired electron spin which can be detected by electron spin resonance (ESR) spectroscopy.
  • exemplary spin labels include organic free radicals, transitional metal complexes, particularly vanadium, copper, iron, and manganese, and the like.
  • exemplary spin labels include nitroxide free radicals.
  • the label may be added to the target (sample) nucleic acid(s) prior to, or after the hybridization.
  • direct labels are detectable labels that are directly attached to or incorporated into the target (sample) nucleic acid prior to hybridization.
  • indirect labels are joined to the hybrid duplex after hybridization.
  • the indirect label is attached to a binding moiety that has been attached to the target nucleic acid prior to the hybridization.
  • the target nucleic acid may be biotinylated before the hybridization. After hybridization, an avidin-conjugated fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily detected.
  • Fluorescent labels are easily added during an in vitro transcription reaction.
  • fluorescein labeled UTP and CTP can be incorporated into the RNA produced in an in vitro transcription.
  • the labels can be attached directly or through a linker moiety.
  • the site of label or linker-label attachment is not limited to any specific position.
  • a label may be attached to a nucleoside, nucleotide, or analogue thereof at any position that does not interfere with detection or hybridization as desired.
  • certain Label-On Reagents from Clontech provide for labeling interspersed throughout the phosphate backbone of an oligonucleotide and for terminal labeling at the 3' and 5' ends.
  • labels can be attached at positions on the ribose ring or the ribose can be modified and even eliminated as desired.
  • the base moieties of useful labeling reagents can include those that are naturally occurring or modified in a manner that does not interfere with the purpose to which they are put.
  • Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A and G, and other heterocyclic moieties.
  • fluorescent labels are not to be limited to single species organic molecules, but include inorganic molecules, multi -molecular mixtures of organic and/or inorganic molecules, crystals, heteropolymers, and the like.
  • CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily derivatized for coupling to a biological molecule (Bruchez et al.
  • alterations in expression of hPHTl and/or hPHT2 transporters can be detected and/or quantified by detecting and/or quantifying the amount and/or activity of translated hPHTl and or hPHT2 polypeptide or fragments thereof.
  • the polypeptide(s) encoded by the hPHTl and/or hPHT2 gene(s) can be detected and quantified by any of a number of methods well known to those of skill in the art. These may include analytic biochemical methods such as electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, or various immunological methods such as fluid or gel precipitin reactions, immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassay (RIA), enzyme-linked immunosorbent assays (ELISAs), immunofluorescent assays, western blotting, and the like.
  • analytic biochemical methods such as electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like
  • various immunological methods such as fluid or gel precipitin reactions, immuno
  • the hPHTl and/or hPHT2 polypeptide(s) are detected quantified in an electrophoretic protein separation (e.g. a 1- or 2-dimensional electrophoresis).
  • electrophoretic protein separation e.g. a 1- or 2-dimensional electrophoresis.
  • Means of detecting proteins using electrophoretic techniques are well known to those of skill in the art (see generally, R. Scopes (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher, (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, Inc., N.Y.).
  • Western blot (immunoblot) analysis is used to detect and quantify the presence of polypeptide(s) of this invention in the sample.
  • This technique generally comprises separating sample proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample with the antibodies that specifically bind the target polypeptide(s).
  • the antibodies specifically bind to the target polypeptide(s) and may be directly labeled or alternatively may be subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically bind to the a domain of the antibody.
  • labeled antibodies e.g., labeled sheep anti-mouse antibodies
  • an immunoassay is an assay that utilizes an antibody to specifically bind to the analyte (e.g., the target polypeptide(s)).
  • the immunoassay is thus characterized by detection of specific binding of a polypeptide of this invention to an antibody as opposed to the use of other physical or chemical properties to isolate, target, and quantify the analyte.
  • Immunological binding assays typically utilize a "capture agent" to specifically bind to and often immobilize the analyte (hPHTl and/or hPHT2 polypeptide).
  • the capture agent is an antibody.
  • Immunoassays also often utilize a labeling agent to specifically bind to and label the binding complex formed by the capture agent and the analyte.
  • the labeling agent may itself be one of the moieties comprising the antibody/analyte complex.
  • the labeling agent may be a labeled polypeptide or a labeled antibody that specifically recognizes the already bound target polypeptide.
  • the labeling agent may be a third moiety, such as another antibody, that specifically binds to the capture agent /polypeptide complex.
  • proteins capable of specifically binding immunoglobulin constant regions such as protein A or protein G may also be used as the label agent. These proteins are normal constituents of the cell walls of streptococcal bacteria. They exhibit a strong non- immunogenic reactivity with immunoglobulin constant regions from a variety of species (see, generally Kronval, et al. (1973) J. Immunol, 111: 1401-1406, and Akerstrom (1985) J. Immunol, 135: 2589-2542).
  • Preferred immunoassays for detecting the target polypeptide(s) are either competitive or noncompetitive.
  • Noncompetitive immunoassays are assays in which the amount of captured analyte is directly measured.
  • the capture agents can be bound directly to a solid substrate where they are immobilized. These immobilized antibodies then capture the target polypeptide present in the test sample. The target polypeptide thus immobilized is then bound by a labeling agent, such as a second antibody bearing a label.
  • the amount of analyte (hPHTl and/or hPHT2 polypeptide) present in the sample is measured indirectly by measuring the amount of an added (exogenous) analyte displaced (or competed away) from a capture agent (antibody) by the analyte present in the sample.
  • a known amount of, in this case, labeled hPHTl and/or hPHT2 polypeptide is added to the sample and the sample is then contacted with a capture agent.
  • the amount of labeled polypeptide bound to the antibody is inversely proportional to the concentration of target hPHTl and/or hPHT2 polypeptide present in the sample.
  • the antibody is immobilized on a solid substrate.
  • the amount of target polypeptide bound to the antibody may be determined either by measuring the amount of target polypeptide present in a polypeptide/antibody complex, or alternatively by measuring the amount of remaining uncomplexed polypeptide.
  • the immunoassay methods of the present invention include an enzyme immunoassay (EIA) which utilizes, depending on the particular protocol employed, unlabeled or labeled (e.g., enzyme-labeled) derivatives of polyclonal or monoclonal antibodies or antibody fragments or single-chain antibodies that bind hPHTl and/or hPHT2 polypeptide(s), either alone or in combination.
  • EIA enzyme immunoassay
  • unlabeled or labeled e.g., enzyme-labeled derivatives of polyclonal or monoclonal antibodies or antibody fragments or single-chain antibodies that bind hPHTl and/or hPHT2 polypeptide(s)
  • a different detectable marker for example, an enzyme-labeled antibody capable of binding to the monoclonal antibody which binds the hPHTl and/or hPHT2 polypeptide, may be employed.
  • EIA enzyme-linked immunoabsorbent assay
  • ELISA enzyme-linked immunoabsorbent assay
  • immunoblotting immunoassay techniques such as western blotting employing an enzymatic detection system.
  • the immunoassay methods of the present invention may also be other known immunoassay methods, for example, fluorescent immunoassays using antibody conjugates or antigen conjugates of fluorescent substances such as fluorescein or rhodamine, latex agglutination with antibody-coated or antigen-coated latex particles, haemagglutination with antibody-coated or antigen-coated red blood corpuscles, and immunoassays employing an avidin-biotin or strepavidin-biotin detection systems, and the like.
  • the particular parameters employed in the immunoassays of the present invention can vary widely depending on various factors such as the concentration of antigen in the sample, the nature of the sample, the type of immunoassay employed and the like.
  • the amount of antibody that binds hPHTl and/or hPHT2 polypeptide is typically selected to give 50% binding of detectable marker in the absence of sample. If purified antibody is used as the antibody source, the amount of antibody used per assay will generally range from about 1 ng to about 100 ng.
  • Typical assay conditions include a temperature range of about 4°C. to about 45°C, preferably about 25°C to about 37°C, and most preferably about 25°C, a pH value range of about 5 to 9, preferably about 7, and an ionic strength varying from that of distilled water to that of about 0.2M sodium chloride, preferably about that of 0.15M sodium chloride.
  • Times will vary widely depending upon the nature of the assay, and generally range from about 0.1 minute to about 24 hours.
  • buffers for example PBS
  • other reagents such as salt to enhance ionic strength, proteins such as serum albumins, stabilizers, biocides and non-ionic detergents may also be included.
  • the assays of this invention are scored (as positive or negative or quantity of target polypeptide) according to standard methods well known to those of skill in the art.
  • the particular method of scoring will depend on the assay format and choice of label.
  • a Western Blot assay can be scored by visualizing the colored product produced by the enzymatic label. A clearly visible colored band or spot at the correct molecular weight is scored as a positive result, while the absence of a clearly visible spot or band is scored as a negative.
  • the intensity of the band or spot can provide a quantitative measure of target polypeptide concentration.
  • Antibodies for use in the various immunoassays described herein are commercially available or can be produced as described below.
  • Either polyclonal or monoclonal antibodies may be used in the immunoassays of the invention described herein.
  • Polyclonal antibodies are preferably raised by multiple injections (e.g. subcutaneous or intramuscular injections) of substantially pure polypeptides (hPHTl and/or hPHT2 or fragments thereof) or antigenic polypeptides into a suitable non-human mammal.
  • the antigenicity of the target peptides can be determined by conventional techniques to determine the magnitude of the antibody response of an animal that has been immunized with the peptide.
  • the peptides that are used to raise antibodies for use in the methods of this invention should generally be those which induce production of high titers of antibody with relatively high affinity for target polypeptides encoded by hPHTl and/or hPHT2.
  • the immunizing peptide may be coupled to a carrier protein by conjugation using techniques that are well-known in the art.
  • a carrier protein such commonly used carriers which are chemically coupled to the peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid.
  • KLH keyhole limpet hemocyanin
  • BSA bovine serum albumin
  • tetanus toxoid tetanus toxoid.
  • the coupled peptide is then used to immunize the animal (e.g. a mouse or a rabbit).
  • the antibodies are then obtained from blood samples taken from the mammal.
  • the techniques used to develop polyclonal antibodies are known in the art (see, e.g.,
  • Polyclonal antibodies produced by the animals can be further purified, for example, by binding to and elution from a matrix to which the peptide to which the antibodies were raised is bound.
  • Those of skill in the art will know of various techniques common in the immunology arts for purification and/or concentration of polyclonal antibodies, as well as monoclonal antibodies see, for example, Coligan, et al. (1991) Unit 9, Current Protocols in Immunology, Wiley Interscience).
  • the antibodies produced will be monoclonal antibodies ("mAb's").
  • mAb's monoclonal antibodies
  • immunization of a mouse or rat is preferred.
  • antibody as used in this invention includes intact molecules as well as fragments thereof, such as, Fab and F(ab') 2 , and/or single-chain antibodies (e.g. scFv) which are capable of binding an epitopic determinant.
  • hybridomas secreting mAbs The general method used for production of hybridomas secreting mAbs is well known (Kohler and Milstein (1975) Nature, 256:495). Briefly, as described by Kohler and Milstein the technique comprises fusing an antibody-secreting cell (e.g. a splenocyte) with an immortalized cell (e.g. a myeloma cell). Hybridomas are then screened for production of antibodies that bind to hPHTl and/or hPHT2 or a fragment thereof.
  • an antibody-secreting cell e.g. a splenocyte
  • an immortalized cell e.g. a myeloma cell
  • Confirmation of specificity among mAb's can be accomplished using relatively routine screening techniques (such as the enzyme-linked immunosorbent assay, or "ELISA", BiaCore, etc.) to determine the binding specificity and/or avidity of the mAb of interest.
  • Antibodies fragments e.g. single chain antibodies (scFv or others), can also be produced/selected using phage display technology.
  • the ability to express antibody fragments on the surface of viruses that infect bacteria (bacteriophage or phage) makes it possible to isolate a single binding antibody fragment, e.g., from a library of greater than 10 10 nonbinding clones.
  • an antibody fragment gene is inserted into the gene encoding a phage surface protein (e.g., plU) and the antibody fragment-pHI fusion protein is displayed on the phage surface (McCafferty et al. (1990) Nature, 348: 552-554; Hoogenboom et al. (1991) Nucleic Acids Res. 19: 4133-4137).
  • a phage surface protein e.g., plU
  • phage bearing antigen binding antibody fragments can be separated from non-binding phage by antigen affinity chromatography (McCafferty et al. (1990) Nature, 348: 552-554).
  • affinity chromatography McCafferty et al. (1990) Nature, 348: 552-554
  • enrichment factors of 20 fold - 1,000,000 fold are obtained for a single round of affinity selection.
  • more phage can be grown and subjected to another round of selection. In this way, an enrichment of 1000 fold in one round can become 1,000,000 fold in two rounds of selection (McCafferty et al. (1990) Nature, 348: 552-554).
  • Human antibodies can be produced without prior immunization by displaying very large and diverse V-gene repertoires on phage (Marks et al. (1991) J. Mol. Biol. 222: 581-597).
  • natural VH and VL repertoires present in human peripheral blood lymphocytes are were isolated from unimmunized donors by PCR.
  • the V-gene repertoires were spliced together at random using PCR to create a scFv gene repertoire which is was cloned into a phage vector to create a library of 30 million phage antibodies (Id.).
  • binding antibody fragments have been isolated against more than 17 different antigens, including haptens, polysaccharides and proteins (Marks et al. (1991) J. Mol. Biol. 222: 581-597; Marks et al. (1993). Bio/Technology. 10: 779-783; Griffiths et al. (1993) EMBO J. 12: 725-734; Clackson et al. (1991) Nature. 352: 624-628). Antibodies have been produced against self proteins, including human thyroglobulin, immunoglobulin, tumor necrosis factor and CEA (Griffiths et al. (1993) EMBO J. 12: 725-734).
  • antibodies can be prepared by any of a number of commercial services (e.g., Berkeley antibody laboratories, Bethyl Laboratories, Anawa, Eurogenetec, etc.).
  • hPHTl and/or hPHT2 are transporters.
  • endogenous hPHTl and/or hPHT2 activity in a cell can be readily measured by providing a suitable substrate (e.g. one identified according to the methods described herein) and detecting the uptake of that substrate by hPHTl or hPHT2.
  • test agents for the ability to interact with (e.g. specifically bind to) an hPHTl or hPHT2 nucleic acid or polypeptide. Specifically, binding test agents are more likely to interact with and thereby modulate hPHTl or hPHT2 expression and/or activity.
  • the test agent(s) are pre-screened for binding to hPHTl and/or hPHT2 nucleic acids or to hPHTl and/or hPHT2 proteins before performing the more complex assays described above.
  • such pre-screening is accomplished with simple binding assays.
  • Means of assaying for specific binding or the binding affinity of a particular ligand for a nucleic acid or for a protein are well known to those of skill in the art.
  • the hPHTl and/or hPHT2 protein or protein fragment, or nucleic acid is immobilized and exposed to a test agent (which can be labeled), or alternatively, the test agent(s) are immobilized and exposed to an hPHTl and/or hPHT2 protein (or fragment) or to an hPHTl or hPHT2 nucleic acid or fragment thereof (which can be labeled).
  • the immobilized moiety is then washed to remove any unbound material and the bound test agent or bound hPHTl or hPHT2 nucleic acid or protein is detected (e.g. by detection of a label attached to the bound molecule).
  • the amount of immobilized label is proportional to the degree of binding between the hPHTl and/or hPHT2 protein or nucleic acid and the test agent.
  • the assays for modulators of peptide transporter expression and/or activity or for agents transported by the transporters of this invention are also amenable to "high- throughput" modalities.
  • new chemical entities with useful properties e.g., modulation of transporter activity or expression, or ability to be transported by the transporters of this invention
  • a chemical compound called a "lead compound”
  • HTS high throughput screening
  • high throughput screening methods involve providing a library containing a large number of compounds (candidate compounds) potentially having the desired activity. Such “combinatorial chemical libraries” are then screened in one or more assays, as described herein, to identify those library members (particular chemical species or subclasses) that display a desired characteristic activity. The compounds thus identified can serve as conventional "lead compounds" or can themselves be used as potential or actual therapeutics.
  • a combinatorial chemical library is a collection of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of chemical "building blocks" such as reagents.
  • a linear combinatorial chemical library such as a polypeptide library is formed by combining a set of chemical building blocks called amino acids in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds can be synthesized through such combinatorial mixing of chemical building blocks. For example, one commentator has observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds (Gallop et al. (1994) 37(9): 1233-1250).
  • combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka (1991) Int. J. Pept. Prot. Res., 37: 487-493, Houghton et al. (1991) Nature, 354: 84-88).
  • Peptide synthesis is by no means the only approach envisioned and intended for use with the present invention.
  • Other chemistries for generating chemical diversity libraries can also be used. Such chemistries include, but are not limited to: peptoids (PCT Publication No WO 91/19735, 26 Dec.
  • nucleic acid libraries see, e.g., Strategene, Corp.
  • peptide nucleic acid libraries see, e.g., U.S. Patent 5,539,083
  • antibody libraries see, e.g., Vaughn et al. (1996) Nature Biotechnology, 14(3): 309-314
  • PCT/US96/10287 carbohydrate libraries
  • carbohydrate libraries see, e.g., Liang et al. (1996) Science, 274: 1520-1522, and U.S. Patent 5,593,853
  • small organic molecule libraries see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33, isoprenoids U.S.
  • Patent 5,569,588, thiazolidinones and metathiazanones U.S. Patent 5,549,974, pyrrolidines
  • U.S. Patents 5,525,735 and 5,519,134, morpholino compounds U.S. Patent 5,506,337, benzodiazepines 5,288,514, and the like.
  • Devices for the preparation of combinatorial libraries are commercially available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, MA).
  • a number of well known robotic systems have also been developed for solution phase chemistries. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate ⁇ , Zymark Corporation, Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual synthetic operations performed by a chemist. Any of the above devices are suitable for use with the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein will be apparent to persons skilled in the relevant art.
  • Preferred assays thus detect inhibition of transcription (i.e., inhibition of mRNA production) by the test compound(s), inhibition of protein expression by the test compound(s), binding to the gene (e.g., gDNA, or cDNA) or gene product (e.g., mRNA or expressed protein) by the test compound(s) in the case of expression assays, while transport assays preferably measure internalization of the test agent.
  • High throughput assays for the presence, absence, or quantification of particular nucleic acids or protein products are well known to those of skill in the art.
  • binding assays are similarly well known.
  • U.S. Patent 5,559,410 discloses high throughput screening methods for proteins
  • U.S. Patent 5,585,639 discloses high throughput screening methods for nucleic acid binding (i.e., in arrays)
  • U.S. Patents 5,576,220 and 5,541,061 disclose high throughput methods of screening for ligand/antibody binding.
  • high throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically automate entire procedures including all sample and reagent pipetting, liquid dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate for the assay.
  • These configurable systems provide high throughput and rapid start up as well as a high degree of flexibility and customization. The manufacturers of such systems provide detailed protocols the various high throughput.
  • Zymark Corp. provides technical bulletins describing screening systems for detecting the modulation of gene transcription, ligand binding, and the like. VI. Kits.
  • kits for isolation and/or detection and/or cloning of the transporter genes of this invention are provided for the practice of any of the assay methods described herein.
  • the kits comprise one or more containers containing nucleic acids encoding one or more of the H+/oligopeptide transporters or fragments thereof, or (optionally labeled) probes that specifically bind to one or more of the H+/oligopeptide transporters of this invention.
  • kits comprise one or more containers containing a vector encoding one or more of the transporters of this invention and/or cells or cell lines optionally transfected with one or more of these vectors.
  • the kit contain mRNA(s) encoding one or more of the transporters of this invention and/or cells suitable for transfection with such mRNAs.
  • the kits may optionally contain DNA template(s) suitable for preparation of such mRNAs.
  • the kits may optionally include one or more reagents for use in the methods of this invention. Such “reagents” may include, but are not limited to, cells and/or cell lines, transfection reagents (e.g.
  • kits may include instructional materials containing directions
  • kits for creating or modifying cells encoding one or more of the transporters of this invention, and or utilizing the kit contents for measuring expression of one or more of the transporters of this invention, or for screening for agents that are transported by one or more of the transporters of this invention.
  • the instructional materials typically comprise written or printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this invention. Such media include, but are not limited to electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such media may include addresses to internet sites that provide such instructional materials.
  • the proton-dependent oligopeptide transporters (POT) gene family currently consists of -70 cloned cDNAs derived from diverse organisms. In mammals, two genes encoding peptide transporters, PepTI and PepT2 have been cloned in several species including humans, in addition to a rat histidine/peptide transporter (rPHTI). Because the Candida elegans genome contains five putative POT genes, we searched the available protein and nucleic acid databases for additional mammalian/human POT genes, using iterative BLAST runs and the human expressed sequence tags (EST) database. The apparent human orthologue of rPHTI (expression largely confined to rat brain and retina) was represented by numerous ESTs originating from many tissues.
  • EST human expressed sequence tags
  • HVdipeptide transporter protein PepTI
  • PepTI HVdipeptide transporter protein
  • hPepTl is a member of a well defined small gene family, the proton-dependent oligopeptide transporters (POT, also referred to as PTR), with ancestral roots that can be traced to bacterial, fungal, and plant peptide transporters (Graul and Sadee (1997) Pharm. Res., 14: 388-400; Fei et al. (1998) Prog. Nucleic Acid Res. Mol. Biol, 58: 239-261; Paulsen and Skurray (1994) Trends Biochem. Sci., 19: 404; Steiner et al. (1995) Mol. Microbiol, 16: 825-834).
  • POT proton-dependent oligopeptide transporters
  • This class of secondary active transporters has broad selectivity for di- and tripeptides, whereas ability to transport longer peptides decreases drastically with increasing length.
  • Most of the POT members share a common structural architecture with -12 predicted transmembrane domains (TMDs), but among the dipeptide transporters in distant phyla, variations on this theme do occur.
  • TMDs transmembrane domains
  • PepTI and PepT2 the main renal peptide transporters
  • Substrates include important drug classes, such as ⁇ -lactam and cephalosporin antibiotics, renin inhibitors, ACE inhibitors, and 5'-nucleoside esters of amino acids, such as valcyclovir (Han et al. (1998) Pharm. Res., 15: 1154-1159).
  • *POT indicates proton-dependent oligopeptide transporters; TMD indicates transmembrane domain. fPossible alternative splicing product of hPepTl.
  • a greater repertoire of the dipeptide transporter gene family in humans must be considered in the interpretation of pharmacological studies with peptoid drugs and could also serve for targeting drugs to specific tissues. Moreover, sequence variations in these transporters could account for interindividual genetic differences in the disposition of peptoid drugs.
  • TMHMM http://www.cbs.dtu.dk/services/TMHMM-1.0/
  • TMPRED http://www.ch.embnet.org/software/TMPRED_form.html
  • HMMTOP http://www.enzim.hu/hmmtop/
  • MEMSTAT http://globin.bio. warwick.ac.uk/psipred/
  • the transmembrane topology schematic was rendered using TOPO (S.J. Johns and R.C. Speth, Transmembrane protein display software, http://www.sacs.ucsf.edu/TOPO/topo.html).
  • Sequence identities were calculated using the Smith Waterman algorithm (Smith and Waterman (1981) J. Mol. Biol, 147: 195-197) by the program ssearch3, a component of the FASTA programs (Pearson (1991) Genomics, 11: 635-650).
  • RNA samples extracted from human intestinal biopsy specimen were analyzed by reverse transcriptase-polymerase chain reaction (RT-PCR).
  • RT-PCR reverse transcriptase-polymerase chain reaction
  • RT-PCR was performed with the GeneAmp RNA PCR Kit Part No. N808-0017 from Perkin Elmer (Wellesley, MA) using 0.5ul AmpliTaq® DNA Polymerase.
  • cDNA samples from skeletal muscle and pancreas purchased from CLONTECH Laboratories, Palo Alto, CA
  • the thermocycle included heating at 95°C, annealing at 60°C, reaction temperature at 70°C, for 35 cycles.
  • 3'MHPHT1 (GAGGATGAGCACAGCATCAA, SEQ ED NO: , right primer, at position
  • the amplified product was electrophoresed on a 1% Agarose gel, extracted, and sequenced by the University of California San Francisco Human Genetics Sequencing Service, San Francisco, CA.
  • Membrane blots containing size-fractionated poly(A)+ mRNA from 12 tissues of human origin were purchased from CLONTECH Laboratories.
  • the accession codes of the ESTs used as probes are W53019 and AA242853 for hPHTl and hPHT2, respectively.
  • These cDNA's were labeled with ⁇ 32 P-dATP (3000 Ci/mmol; Amersham, Piscataway, NJ) according to the random priming method, using a kStrip-EZ DNA kit (Ambion, Austin, TX). Hybridization was performed at 42°C overnight after purifying 32 -P- cDNA on a Sephadex G-50 spin column (mini Quick Spin Columns; Boehringer Mannheim, Indianapolis, IN).
  • the blots were washed twice at 42°C for 5 min with low stringency solution (Ambion) and for 15 min with high stringency solution (Ambion).
  • the membranes were exposed against x-ray film at -80°C for 3 to 7 days with intensifying screens.
  • the hybridized probe was removed from the membranes by using a Strip-EZ DNA kit (Ambion) before rehybridization.
  • each of the 68 members of the core cluster was run against the human EST database, and the results tabulated such that the identified ESTs were listed with the core sequence providing the highest score.
  • a bacterial drug-resistance transporter in the protein core cluster (bold-face); this sequence would have identified more members of the core cluster in a third iteration (not done here).
  • These include bacterial drug resistance transporters which are currently listed outside the core cluster. Putative POT members from C elegans are shown in bold-face with italics.
  • the human ESTs mainly cluster with sequences 11 (hPepT2), 50 (rPHT), and 51 (mouse cAMP-inducible 1 protein).
  • RCH2 protein [Brassica napus] 123 6e-27
  • Table 2 contains only two iterations. A number of these drug-resistance transporters of the major facilitator type transporter family appear in the list of neighbor sequences outside the core cluster (Table 2).
  • the core cluster contains 5 putative POT genes from the completely sequenced genome of C elegans (Table 2, bold-face and italics). Each of these deduced proteins has high similarity to hPepTl. This finding suggests that the human genome may also contain more POT members than are currently cloned.
  • Table 2 includes a number of deposited sequences encoding the main intestinal and renal transporters, hPepTl (sequences 1-3) and hPepT2 (sequence 12), and their orthologues in other mammalian species .
  • the pH sensing regulatory factor of peptide transporter (sequence 20) (Saito et al.
  • rPHTI rat peptide/histidine transporter
  • rPHTI and mouse cAMP-inducible 1 protein have apparent human orthologues, which we term hPHTl and hPHT2, respectively.
  • rPHTI and mouse cAMP-inducible 1 protein represent two distinct but closely related genes belonging to the POT family.
  • the core cluster of POT sequences contains one additional human sequence, namely, erythroid differentiation-related factor 2 (sequence 42.41, Table 2).
  • the E value (4 x 10 " ) suggests probable homology to a putative POT transporter of Bacillus subtilis (sequence 42; 13 predicted TMDs). However, this sequence is rather short (107 residues), showing good sequence similarity with TMD11 and adjacent loop of the B subtilis transporter.
  • Second Pass INCA Scanning the Human EST Database
  • PepT2 vs human EST gi
  • a schematics of the hPHTl contig assembly Figure IA contains the minimum number of EST's spanning the length of the deduced hPHTl sequence. To view the EST coverage of each segment, click on the EST/region of interest. This will reveal segments with numerous overlapping ESTs. Because multiple ESTs cover the same regions of hPHTl, one can deduce possible sequence variations in the human population where EST sequences are not identical. Clearly, many of these variations may be due to sequencing errors, but in a few cases a variation occurs more than once at the same location.
  • the contig sequence is closely related to that of rPHTI; however, the first -50 5'-terminal base pairs are missing. (It appears that there is a rare N ⁇ tl site at the 5'-end, which could have caused truncation during preparation for EST sequence analysis.) Thus, the EST contig is likely to represent >95% of the coding region of hPHTl.
  • the deduced hPHTl amino acid sequence is shown in SEQ D NO: 5).
  • the faster migrating band contained a gap of 169 bps in the middle of hPHTl Figure IC, Variant B; for RT-PCR results see Figure 5 A).
  • Some of the putative hPHTl splice variants may introduce a frame shift and would not be expected to result in a functionally active transporter. These variants need to be cloned individually and tested experimentally. The information presented here is therefore important for guiding cloning efforts to produce functional hPHTl protein. Genomic hPHT2 Sequence.
  • the human EST contig sequence corresponding to rat cAMP-inducible 1 protein served to scan the human nr nucleotide databases. This revealed a PAC clone (Pl- derived artificial chromosome) containing human genomic sequences closely related to the mouse cDNA-encoding cAMP-inducible 1 protein.
  • PAC clone Pl- derived artificial chromosome
  • cDNA sequence of cAMP- inducible 1 protein we were able to identify the likely introns and exons representing the presumed hPHT2 gene Figure 2; (SEQ ED Nos: 7-23). Each of the intron-exon boundaries are flanked by GT. . . AG in the intron sequence.
  • the intron structure follows the GT- AG rule, where GT is called the splice donor and AT is called the splice acceptor.
  • the deduced cDNA coding sequence and protein sequence are shown in
  • Table 4 lists the identities and similarities among the protein sequences of the main mammalian members of the POT family. While hPepTl and hPEPT2 represent one branch of this family, hPHTl, hPHT2, rPHT, and mouse cAMP-inducible 1 protein are closely related and form a second branch. hPHTl has 89% identity to rPHTI, while hPHT2 is 81% identical to mouse cAMP-inducible 1 protein. Multiple sequence alignments are provided in Figure 3, including either the PHT branch only (Figure 3A), or both branches ( Figure 3B).
  • the deduced putative cDNA coding sequence is the identical length of the coding sequence suggested by our genomic hPHT2 sequence, and it is identical to hPHT2 over a large portion of the presumed coding region.
  • a fragment of 50 bps in the hPHT2 coding region from position 61-110 is replaced in the cloned cDNA by a 50-bp fragment of low complexity (cg-rich), which is excluded from BLAST analysis by a low complexity filter.
  • This 50-bp cDNA fragment did not recognize any sequence in the PAC clone containing the hPHT2 genomic sequence, but it did recognize fragments in a number of unrelated genes, therefore, possibly representing a low complexity repeat fragment.
  • cDNA sequence was identical to that of hPHT2, except for an insertion of three nucleotides each in three different locations of hPHT2 (at positions 837, 1271, and 1428 of hPHT2). These sequence variations would indicate the presence of three additional amino acids at these respective positions, without disturbing the overall reading frame. It remains to be seen how these changes from our deduced coding sequence came about and whether they are of functional significance. In any case, comparing the cDNA and genomic sequences reveals many details of the possible protein structure not available otherwise.
  • hPHTl was mainly expressed in skeletal muscle, followed by kidney, heart, and liver, with relatively little expression in colon and brain. mRNA bands were detected at apparent molecular weight 2.8 kb and 5.1 kb, indicating the presence of possible mRNA variants.
  • the mRNA tissue distribution of hPHT2 differed significantly from that of hPHTl Figure 5B. A single major band appeared at 2.4 kb, with highest expression in spleen, placenta, lung, and leukocytes, followed by heart, kidney, and liver.
  • hPHTl a contig sequence termed hPHTl
  • EST sequencing is not rigorously quality controlled and single nucleotide variants occur only sporadically, multiple overlapping ESTs could nevertheless assist in finding single nucleotide polymorphisms (SNPs).
  • hPHTl The strong representation of hPHTl in the EST database suggests that the presumed hPHTl is widely expressed in human tissues, largely in the CNS, in contrast to its restricted expression in rats.
  • One of these ESTs stems from a human colon carcinoma, an indication that hPHTl may also be expressed in human intestines.
  • RT-PCR Northern blot analysis has revealed that hPHTl and KPHT2 are not highly expressed in intestines relative to other tissues. Protein expression and functional studies are required to determine whether these transporters, in addition to hPepTl, could play a role in intestinal peptoid drug absorption.
  • hPHTl or 2 could play a role in oral antibiotic bioavailability remains to be seen.
  • Our Northern blot analysis revealed strong expression for hPHTl in skeletal muscle and kidney while hPHT2 was highly expressed in leukocytes, lung placenta, and spleen. Detectable expression of both genes in organs, such as the heart, may be of interest in understanding the efficacy of antibiotic treatment of localized infections -; particularly if the infectious agent resides intracellularly.
  • the tissue distribution of gene expression differs from that of hPepTl (mainly intestinal) and hPepT2 (mainly renal) which underscores the relevance of our findings to targeting therapy to specific organs.
  • hPepTl hPepTl
  • affinity of these nucleoside prodrugs for other peptide transporters remains to be determined.
  • PEPTI Proton-coupled oligopeptide transporter PEPTI facilitates the transport of dipeptides and peptoid drugs (including antibiotics) across the cell membranes of endothelial and epithelial cells.
  • Substrate transport by the proton symport is driven by pH gradients, while the profile of pH sensitivity is regulated by a closely related protein, hPEPTl-RF.
  • hPEPTl-RF a closely related protein
  • hPEPTl is encoded by 23 exons and hPEPTl-RF by 6 exons. Coding sequences of hPEPTl- RF share 3 exons completely and 2 exons partially with hPEPTl.
  • the genomic organization of hPEPTl shows high similarity with its mouse orthologue. Exon-intron boundaries occur mostly in the loops connecting transmembrane segments (TMSs), suggesting a modular gene structure reflecting the TMS-loop repeat units in hPEPTl.
  • TMSs transmembrane segments
  • the putative promoter region of hPEPTl contains TATA boxes and GC-rich regions and a potential insulin responsive element.
  • POTs Proton-coupled oligopeptide transporters
  • A.17 for transporter classification see http://www.biology.ucsd.edu/ ⁇ msaier/transport/titlep age.html.
  • Oligopeptide transporters are symporters driven by the flux of protons; they have a molecular architecture consisting of -12 predicted TMSs (Sadee et al. (1995) Pharm Res. 12: 1823-1837).
  • Members of the POT family include peptide transporter 1 (PEPTI) (Fei et al. (1994) Nature, 368: 563-566; Liang et al. (1995) J. Biol. Chem. 270: 6456-6463), peptide transporter 2 (PEPT2) (Liu et al.
  • Human PEPTI cDNA contains 3105 base pairs (bp), and the predicted protein consists of 708 amino acids.
  • the transporter protein has 12 predicted TMSs and 2 putative protein kinase C phosphorylation sites.
  • the membrane topology of the human dipeptide transporter, hPEPTl was determined by epitope insertions by Covitz et al (Covitz et al. (1998) Biochem. 37: 15214-15221).
  • PEPTI is expressed in the intestine (brush border), early proximal kidney tubuli, liver, placenta, and pancreas (Liang et al. (1995) 7. Biol. Chem. 270: 6456-6463; Shen et al. (1999) Am J Physiol. 276: F658- F665). In the intestines, PEPTI facilitates absorption of digested dipeptides so that most of the dietary nitrogen is absorbed as dipeptides rather than as amino acids (Ganapathy and Leibach (1999) pages 456-467 In: Yamada T, ed. Textbook of Gastroenterology. Philadelphia, PA: Lippincott Williams and Wilkins).
  • Human PEPTI has broad substrate specificity.
  • the substrates include di- and tripeptides and peptoid drugs.
  • PEPTI mediates the high bioavailability of many hydrophilic beta-lactam antibiotics (Terada et al. (1999) Am J Physiol. 276: G1435-G1441).
  • PEPTI is suggested to play a role in intracellular peptide transport, including lysosomal transport (Gonzales et al. (1998) Cancer Res. 519-525). Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582, have described a highly related transcript, termed hPEPTl-RF, which modulates the activity of human PEPTI.
  • the cDNA for the regulatory factor encodes an open reading frame of 208 amino acids. Residues 18-195 are identical to residues 8-185 in hPEPTl, while sequences 1- 17 and 196-208 are unique. Both hPEPTl and hPEPTl-RF are expressed in Caco-2 cells. Expression studies in Xenopus oocytes and Caco-2 cells showed that the regulatory factor shifted the pH-sensitivity profile of hPEPTl -mediated peptide transport (Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582). Although somatic cell hybrid analysis and in situ hybridization studies of Liang et al. (1995) J. Biol. Chem.
  • BLAST National Center for Biotechnology Information
  • NCBI National Center for Biotechnology Information
  • BLOSUM62 matrix was used with default parameters. The analysis was done with and without filtering of the low-complexity sequences and without masking of repetitive elements. Queries used the cDNA sequences of human PEPTI (accession number: NM_005073) and hPEPTl-RF (AB001328) and the high-throughput genomic sequence (HTGS) database.
  • sequences 2 kb upstream from the transcription start sites of hPEPTl and hPEPTl -RF were investigated using programs FindPatterns and FitConsensus (Genetics Computer Group, Madison, WI)to locate possible promoters and enhancer sites.
  • Codon phase refers to the codon in the 5' end of the exon.
  • ⁇ xon T is repetitive element UTR.
  • the hPEPTl gene structure shows several interesting features.
  • the start sites of the transcripts for hPEPTl and pH-regulatory factor are located in different exons ( Figure 6).
  • exon 1 located >20 kb upstream of exon 2 contains only the first 4 nucleotides of the hPEPTl coding region.
  • Alternative splicing occurs in exon 3, and 118 bases in the 5' end of exon 3 are spliced out of the mRNA of hPEPTl.
  • Another site for differential splicing is exon 7 of hPEPTl -RF. In this case, 41 bases in the 3' end of the exon are spliced out of hPEPTl hmRNA ( Figure 6).
  • Membrane topology predictions of hPEPTl and hPEPTl -RF proteins are shown in Figures 2 and 3.
  • the transmembrane topology schematics were rendered using TOPO (S.J. Johns and R.C. Speth, Transmembrane protein display software, http://www.sacs.ucsf.edu/TOPO/topo.html, unpublished data).
  • the figures show the peptide sequences that are encoded by each exon.
  • hPEPTl is predicted to have 12 transmembrane segments (TMSs).
  • TMSs transmembrane segments
  • the upstream region (2 kb) from the transcription start sites of hPEPTl is shown in Figure 4. TATA boxes were found about 520 bp upstream from the transcription start site in hPEPTl.
  • the putative regulatory region also contains GC boxes, so several GC boxes are located within 300 bp from the transcription site in hPEPTl. Binding sites for transcription factors did not include any amino acid responsive element. Some other transcription factor binding sites of the regulatory regions are illustrated in Figure 9.
  • the genomic structure of hPEPTl and hPEPTl-RF presented here is based on a sequence in the HTGS database.
  • the HTGS contains yet unordered pieces of genomic sequences.
  • Three introns of hPEPTl include such gaps (indicated by > signs in Table 5), while hPEPTl -RF exons are all located in one contig. Within the contigs the sequences are likely to be unaffected, and intron sizes are reliable
  • PEPT1-RF and PEPTI share 5 identical TMSs, while the extramembraneous terminals differ ( Figure 6, Figure 7, and Figure 8).
  • PEPT1-RF is not capable of transporting substrates across the membrane, but it is thought to sense pH changes and modulate the response of PEPTI to these changes (Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582).
  • Fei et al. (1998) Biochem Biophys Res Commun. 246: 39-44, have shown by using chimeric PEPT1- PEPT2 proteins that the TMSs 7-9 are important for substrate recognition by hPEPTl.
  • PEPT1-RF does not have these TMSs and does not transport substrates.
  • Insulin regulation was mediated by transporter translocation to the basolateral side of the cells upon release of hPEPTl from the translated intracellular pool to the plasma membrane. Changes in hPEPTl mRNA were not seen in that study. However, the putative insulin responsive element is located upstream from the transcription start site ( Figure 9), suggesting that insulin might be involved in the regulation of hPEPTl transcriptional activity.
  • the genomic organization of hPEPTl and hPEPTl- RF indicates that they are splice variants of the same gene ( Figure 6). Expression of hPEPTl -RF has not been studied in detail. Nevertheless, the splice variants may be expressed in different proportions depending on, for example, the stage of differentiation, hormonal regulation signals, and cell type.
  • Human PEPTI is expressed in several tissues (intestine, kidney, brain, liver) where the pH environment is quite different. Also, an intracellular pool of hPEPTl may be associated with peptide trafficking in lysosomes and endosomes that have different pH depending on the maturity of the vesicle (Gonzales et al. (1998) Cancer Res. 519-525).

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Nanotechnology (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Immunology (AREA)
  • Toxicology (AREA)
  • Zoology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Biochemistry (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

This invention relates to the field of oligopeptide transporters and drug transport. In particular this invention relates to the discovery of new H+/oligopeptide transporters and their use in drug delivery applications. This invention also provides assays to identify agents that are transported by the newly discovered transporters and/or to identify modulators of transporter expression.

Description

NOVEL MEMBERS OF THE H+/OLIGOPEPTIDE TRANSPORTER
GENE FAMILY
CROSS-REFERENCE TO RELATED APPLICATIONS
This claims priority to and benefit of United States provisional application USSN 60/182,328, filed on February 14, 2000, and corresponding non-provisional U.S. application entitled Novel Members Of The H+/Oligopeptide Transporter Gene Family, filed on February 13, 2001, both of which are incorporated herein by reference in their entirety for all purposes.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT
T Not Applicable 1
FIELD OF THE INVENTION
This invention relates to the field of oligopeptide transporters and drug transport. In particular this invention relates to the discovery of new H+/oligopeptide transporters and their use in drug delivery applications.
BACKGROUND OF THE INVENTION
In mammalian cells, peptides are transported in and out of cells by several different transport carriers. Functionally, there are transporters responsible for the influx of peptides into the cell and transporters responsible for the efflux of peptides out of the cells. Influx transporters transport small peptides and related compounds into the cytoplasm, and are indirectly linked to an energy source through ion gradients. Efflux transporters consist of several different transporters that function to remove peptides from the cytoplasm. These include the P-glycoprotein that removes a number of oncolytics as well as hydrophobic peptides (Endicott and Ling (1989) Annu. Rev. Biochem. 58:137-171; Sharma et al. (1992) J. Biol. Chem. 267: 5731-5734).
The present invention relates to peptide transporters responsible for influx of peptides into cells or organelles. Peptide transporters are often located in the gastrointestinal tract, kidney, placenta, and liver lysosomes (Ganapathy et al. (1991) Indian J. Biochem. Biophys. 28: 317-323; Skopicki et al. (1991) Am. J. Physiol. 261: F670-F678; Ganapathy et al. (1981) J. Biol. Chem. 256: 118-124; Bird and Lloyd (1990) Biochim. Biophys. Acta 1024:267-270).
Small peptides and peptide-like drugs are often too polar to cross lipid bilayers by simple diffusion and their translocation therefore depends on active transport by a suitable carrier. As a result, orally administered peptoid drugs and mimetics thereof would be poorly absorbed unless transported (Yang (1999) Pharm. Res., 16: 1331-1343). Indeed, it has been demonstrated that many different solutes including small peptides (di- and tripeptides), antibiotics (including several oral .beta.-lactams), oral angiotensin converting enzyme (ACE) inhibitors, and oral renin inhibitors are transported into the cytoplasm of the enterocyte by the influx peptide transporter (Ganapathy and Leibach (1991) Curr. Biol. 3: 695-701; Okano et al. (1986) J. Biol. Chem. 261: 14130-14134; Nakashima et al. (1984) Biochem. Pharm. 33: 3345-3352; Muranushi et al. (1989) Pharm. Res. 6: 308-312; Friedman and Amidon (1989) Pharm. Res. 6: 1043-1047; Friedman and Amidon (1990) J. Control Rel. 13: 141-146; Kramer (1991) 17th International Congress of Chemotherapy, June 23-28, Berlin, F.R.G., Abstract No. 1415).
The main intestinal H+/dipeptide transpoter protein, PepTl, is thought to play a critical role in oral bioavailablity of peptide-like drugs (Dantzig and Bergin (1990) Biochim. Biophy. Acta, 1027: 211-217; Matsumoto et al. (1994) J. Pharmacol. Exp. Ther., 270: 498-504; Wenzel et al. (1996) J. Pharmacol. Exp. Ther., 211 ± 831-839; Kramer et al. (1990) Biochim. Biophys. Acta, 1027: 25-30; Fei et al. (1994) Nature, 68: 563-566; Thwaites et al. (1993) J. Biol. Chem., 268: 7640-7642). hPepTl is a member of a well defined small gene family, the proton-dependent oligopeptide transporters (POT, also referred to as PTR), with ancestral roots that can be traced to bacterial, fungal, and plant peptide transporters (Graul and Sadee (1997) Pharm. Res., 14: 388-400; Fei and Leibach (1998) Prog. Nucleic Acid Res. Mol. Biol, 58: 239-261; Steiner et al. (1995) Mol. Microbiol, 16: 825-834). This class of secondary active transporters has broad selectivity for di- and tripeptides, whereas ability to transport longer peptides decreases with increasing length. Commonly, single amino acids are not substrates, wit ha few exceptions. Most of the POT members share a common structural architecture with -12 predicted transmembrane domains (TMDs), but among the dipeptide transporters in distant phyla, variations of this theme do occur. Until recently, only two POT genes had been identified in mammalian species, Pep TI and PepT2, the main renal peptide transporter (Fei et al. (1994) Nature, 368: 563-566; Liu et al. (1995) Biochim. Biophys. Acta 1235: 461-466). cDNAs of the respective human orthologues have been cloned since (orthologue refers to the same gene in different species) (Liu et al. (1995) Biochim. Biophys. Acta 1235: 461-466; Liang et al. (1995) J. Biol. Chem., 270: 6456-6463). These transporters display overlapping broad substrate selectivity . As a result, a broad spectrum of drugs interacts with them, some with chemical structures quite distinct from peptides. Substrates include important drug classes such as β- lactam and cephalosporin antibiotics, rennin inhibitors, ACE inhibitors, and 5' nucleoside esters of amino acids, such as valcyclovir (Han et al. (1998) Pharm. Res., 15: 1154-1159). Despite these advances, it is desirable to identify other peptide transporters to improve uptake of various drugs or prodrugs, to improve tissue specificity for particular drugs, and the like.
SUMMARY OF THE INVENTION This invention provides new human proton/oligopeptide transporter (POT) genes and uses thereof. Nucleic acid sequences, amino acid sequences, and primers sufficient to amplify transporter nucleic acid and/or probes specific to these nucleic acids and/or splice variants thereof are provided herein. The new transporters are identified as hPHTl and hPHT2. . Thus, in one embodiment, this invention provides an isolated nucleic acid encoding a proton -coupled peptide transporter. The nucleic acid comprises a nucleic acid or the complement of a nucleic acid selected from the group consisting of: a nucleic acid that specifically hybridizes to hPHTl (e.g. SEQ ID NO: 24) or hPHT2 under stringent conditions and that encodes a proton-coupled peptide transporter; a nucleic acid that has 90% or greater sequence identity with hPHTl or hPHT2 and that encodes a proton-coupled peptide transporter; a nucleic acid that encodes an hPHTl peptide transporter protein or an hPHT2 peptide transporter protein; an hPHTl or hPHT2 splice variant; a nucleic acid that is amplified using primers of SEQ ID NO:l and SEQ ID NO:2, and human intestinal DNA as a template; a nucleic acid that is amplified using primers of SEQ ID NO:3 and SEQ ID NO:4, and human intestinal DNA as a template; a nucleic acid that hybridizes under stringent conditions to a nucleic acid amplified using primers of SEQ ID NO: 1 and SEQ ID NO:2, and human intestinal DNA as a template, where the nucleic acid encodes a peptide transporter; a nucleic acid that hybridizes under stringent conditions to a nucleic acid amplified using primers of SEQ ID NO:3 and SEQ ID NO:4, and human intestinal DNA as a template, where the nucleic acid encodes a peptide transporter; a nucleic acid that comprises at least 15 contiguous nucleotides of an a an hPHTl peptide transporter protein or an hPHT2 transporter protein. Also included are (optionally labeled) nucleic acids that specifically hybridize to any of the above-described nucleic acids under stringent conditions (e.g., probes). In preferred embodiments, the nucleic acid is a vector.
Also provided are polypeptides encoded by any of these nucleic acids. Particularly preferred polypeptides include, but are not limited to, polypeptides comprising a proton-coupled peptide transporter. Also provided are fragments of such polypeptides that comprise one or more epitopes specifically recognized by an antibody that specifically binds to the hPHTl and/or hPHT2 transporters. Also provided are antibodies (complete, fragments, or single chain) that specifically bind the hPHTl and/or hPHT2 transporters.
This invention also provides cells that are transfected with one or more of the nucleic acids described herein. Particularly preferred cells express a heterologous peptide transporter (e.g. an hPHTl and/or hPHT2 transporter). The cells preferably include any vertebrate cell and include somatic cells or oocytes. The cells may be transfected with either a DNA or an RNA and, in this context, transfection includes essentially any method of introducing a nucleic acid into a cell (e.g. electroporation, microinjection, lipid complex, etc.).
In still another embodiment, this invention provides a computer readable medium having recorded thereon one or more of the nucleotide sequences described herein. Particularly preferred media, also include an identification of the sequences as transporters or as encoding transporters, or as components of transporters or as components of nucleic acids encoding transporters, or an association to a reference or medium identifying the sequences as encoding transporters. Virtually any computer-readable medium is suitable including, but not limited to a floppy disc, a hard disc, a CD disc, a DVD disc, a random access memory (RAM), a read-only memory (ROM), and a flash memory. The medium can be a component of a nucleic acid and/or peptide synthesizer or compatible with (e.g. able to provide sequence information to) a nucleic acid and/or a peptide synthesizer.
This invention also provides assays for identifying a compound whose cellular uptake is mediated by an hPHTl or hPHT2 peptide transporter. The assays preferably involve i)contacting a cell expressing a peptide transporter selected from the group consisting of an hPHTl transporter, and an hPHT2 transporter with a test compound; and ii) detecting uptake of the test compound by the cell where elevated uptake of the compound by the cell as compared to a cell expressing the peptide transporter at a lower level indicates that said peptide transporter mediates transport of said test compound. The cell can be a cell expressing an endogenous hPHTl and/or hPHT2 and/or a cell transfected with a vector that encodes the hPHTl or hPHT2 peptide transporter. The cell is preferably a vertebrate somatic cell or a vertebrate oocyte. In one embodiment, amphibian oocytes (e.g., Xenopus oocytes) are particularly preferred, while in certain other embodiments, mammalian somatic cells (e.g. heart cells, intestinal cells, etc.) are preferred. The compounds screened may include virtually any compound, however, in preferred embodiments, the compound is a small organic molecule, more preferably a drug or a prodrug.
Also provided are methods of targeting a drug to a tissue that expresses a hPHTl or hPHT2 peptide transporter. These methods preferably involve identifying a drug or prodrug that is transported by a hPHTl or hPHT2 peptide transporter; and contacting the tissue with said drug. This method may further involve identifying a tissue that expresses a hPHTl or hPHT2 peptide transporter. The identification can involve selecting a drug, or prodrug, known to be transported by a hPHTl or hPHT2 peptide transporter or screening for such a drug or prodrug (e.g. as described herein). In one particularly preferred embodiment, the peptide transporter is hPHTl and/or hPHT2 and the tissue is heart. This invention additionally provides methods of identifying agent(s) that modulate expression or activity of an hPHTl and/or hPHT2 peptide transporter. The methods involve contacting a cell comprising a gene encoding an hPHTl and/or an hPHT2 peptide transporter with a test agent; and detecting the expression level or activity of hPHTl or hPHT2 peptide transporter(s) where a difference in expression level or activity of hPHTl or hPHT2 as compared to the expression level, or activity, of hPHTl or hPHT2 in a cell contacted with a different amount of said agent indicates that said agent modulates expression, or activity, of the hPHTl peptide transporter or the hPHT2 peptide transporter. In particularly preferred embodiments, the detecting comprises detecting an hPHTl or hPHT2 nucleic acid (e.g. DNA, mRNA, cDNA, etc.), and/or an hPHTl or hPHT2 protein, and/or transport activity of an hPHTl or hPHT2 protein. In certain embodiments, detecting comprises detecting an hPHTl mRNA or an hPHT2 mRNA (e.g. by hybridizing said mRNA to a probe that specifically hybridizes to an hPHT2 or to an hPHTl nucleic acid). Certain preferred hybridization detection methods include, but are not limited to a Northern blot, a Southern blot using DNA derived from the hPHTl or hPHT2 RNA, an array hybridization, an affinity chromatography, and an in situ hybridization. The hybridization probe can be a single probe or a plurality of probes, e.g. a member of a plurality of probes that forms an array of probes. In certain embodiments, the level of hPHTl mRNA or hPHT2 RNA is measured using a nucleic acid amplification reaction. In certain embodiments, detecting comprises detecting an hPHTl protein or an hPHT2 protein (e.g. via capillary electrophoresis, a Western blot, mass spectroscopy, ELISA, immunochromatography, immunohistochemistry, etc.). In various embodiments, the cell contacted with the different amount of said agent is a negative control that is not contacted with said agent or the cell contacted with said different amount of the agent is a positive control that is contacted with a greater amount of the agent. The cell is preferably a human somatic cell (e.g. a human heart cell, a human intestinal cell, etc.).
In still another embodiment, this invention provides a method of prescreening for an agent that agent that modulates expression or activity of an hPHTl peptide transporter or an hPHT2 peptide transporter. The method involves contacting an hPHTl or hPHT2 nucleic acid (or fragment thereof) or an hPHTl or hPHT2 protein (or fragment thereof) with a test agent; and detecting specific binding of said test agent to said hPHTl or hPHT2 protein or nucleic acid. The method can further involve recording test agents that specifically bind to said hPHTl or hPHT2 nucleic acid or protein in a database of candidate agents that alter peptide transporter activity. In preferred embodiments, the test agent is not an antibody and/or not a protein, and/or not a nucleic acid. Preferred test agents are small organic smolecules.
Also provided are kits comprising a container containing one or more of the nucleic acids and/or proteins and/or cells, and/or antibodies described herein. The kits optionally further comprise instructional materials providing protocols for the assays described herein.
DEFINITIONS
The terms "isolated" "purified" or "biologically pure" refer to material which is substantially or essentially free from components which normally accompany it as found in its native state. In the case of a nucleic acid, an isolated nucleic acid is typically free of the nucleic acid sequences by which it is flanked in nature. An isolated nucleic acid can be reintroduced into a cell and such "heterologous" nucleic acids are regarded herein as isolated. In addition, nucleic acids synthesized de novo or produced by cloning (e.g. recombinant DNA technology) are also regarded as "isolated".
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The term also includes variants on the traditional peptide linkage joining the amino acids making up the polypeptide.
The terms "nucleic acid" or "oligonucleotide" or grammatical equivalents herein refer to at least two nucleotides covalently linked together. A nucleic acid of the present invention is preferably single-stranded or double stranded and will generally contain phosphodiester bonds, although in some cases, as outlined below, nucleic acid analogs are included that may have alternate backbones, comprising, for example, phosphoramide (Beaucage et al. (1993) Tetrahedron 49(10): 1925) and references therein; Letsinger (1970) J. Org. Chem. 35:3800; Sprinzl et al. (1977) Eur. J. Biochem. 81: 579; Letsinger et al. (1986) Nucl. Acids Res. 14: 3487; Sawai et al. (1984) Chem. Lett. 805, Letsinger et al. (1988) J. Am. Chem. Soc. 110: 4470; and Pauwels et al. (1986) Chemica Scripta 26: 1419), phosphorothioate (Mag et al. (1991) Nucleic Acids Res. 19: 1437; and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al. (1989) J. Am. Chem. Soc. I l l :2321, O- methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A
Practical Approach, Oxford University Press), and peptide nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. Soc. 114:1895; Meier et al. (1992) Chem. Int. Ed. Engl. 31: 1008; Nielsen (1993) Nature, 365: 566; Carlsson et al. (1996) Nature 380: 207). Other analog nucleic acids include those with positive backbones (Denpcy et al. (1995) Proc. Natl. Acad. Sci. USA 92: 6097; non-ionic backbones (U.S. Patent Nos. 5,386,023,
5,637,684, 5,602,240, 5,216,141 and 4,469,863; Angew. (1991) Chem. Intl. Ed. English 30: 423; Letsinger et al. (1988) J. Am. Chem. Soc. 110:4470; Letsinger et al. (1994) Nucleoside & Nucleotide 13:1597; Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al. (1994), Bioorganic & Medicinal Chem. Lett. 4: 395; Jeffs et al. (1994) J. Biomolecular NMR 34:17; Tetrahedron Lett. 37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, Carbohydrate Modifications in Antisense Research, Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or more carbocyclic sugars are also included within the definition of nucleic acids (see Jenkins et al. (1995), Chem. Soc. Rev. ppl69-176). Several nucleic acid analogs are described in Rawls, C & E News June 2, 1997 page 35. These modifications of the ribose-phosphate backbone may be done to facilitate the addition of additional moieties such as labels, or to increase the stability and half-life of such molecules in physiological environments.
The term "heterologous" as it relates to nucleic acid sequences such as coding sequences and control sequences, denotes sequences that are not normally associated with a region of a recombinant construct, and/or are not normally associated with a particular cell. Thus, a "heterologous" region of a nucleic acid construct is an identifiable segment of nucleic acid within or attached to another nucleic acid molecule that is not found in association with the other molecule in nature. For example, a heterologous region of a construct could include a coding sequence flanked by sequences not found in association with the coding sequence in nature. Another example of a heterologous coding sequence is a construct where the coding sequence itself is not found in nature (e.g., synthetic sequences having codons different from the native gene). Similarly, a host cell transformed with a construct which is not normally present in the host cell would be considered heterologous for purposes of this invention.
As used herein, the term "non-naturally occurring", in reference to a cell, refers to a cell that has a non-naturally occurring nucleic acid or a non-naturally occurring peptide or is fused to a cell to which it is not fused with in nature.
The term "non-naturally occurring nucleic acid" refers to a portion of genomic nucleic acid, cDNA, semi-synthetic nucleic acid, or a synthetic origin nucleic acid which, by virtue of its origin or manipulation is not associated with all the nucleic acid with which it is associated in nature, or is linked to a nucleic acid or other chemical agent other than that to which it is linked in nature, or is not present in nature.
The term "a non-naturally occurring peptide" refers to a portion of a large naturally occurring peptide or protein, or semi-synthetic or synthetic peptide, which by virtue of its origin or manipulation is not associated with all of a peptide with which it is associated in nature, or is linked to peptides, functional groups or chemical agents other than that to which it is linked in nature, or is present in a purity that is not present in nature, or does not occur in nature. The term "proton" refers to a hydrogen ion and the term "transporter" refers to a composition that participates in the movement of a substrate across a cellular membrane.
A "proton-coupled peptide transporter" transports peptides across cellular membranes, which transport is linked or coupled to the transport of a proton or hydrogen ion across the same membrane. Preferably, the transporter is a protein encoded by a nucleic acid comprising one or more of the nucleic acids described herein, more preferably a PHT1 or PHT2 nucleic acid.
The term ''corresponding" means homologous to or complementary to a particular sequence of nucleic acid. As between nucleic acids and peptides, corresponding refers to amino acids of a peptide in an order derived from the sequence of a nucleic acid or the complement of the nucleic acid.
As used herein, an "antibody" refers to a protein consisting of one or more polypeptides substantially encoded by imrnunoglobulin genes or fragments of immunoglobulin genes. The recognized imrnunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.
A typical immunoglobulin (antibody) structural unit is known to comprise a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms variable light chain (VL) and variable heavy chain (VH) refer to these light and heavy chains respectively. Antibodies exist as intact immunoglobulins or as a number of well characterized fragments produced by digestion with various peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab') dimer into a Fab' monomer. The Fab' monomer is essentially a Fab with part of the hinge region (see, Fundamental Immunology, W.E. Paul, ed., Raven Press, N.Y. (1993), for a more detailed description of other antibody fragments). While various antibody fragments are defined in terms of the digestion of an intact antibody, one of skill will appreciate that such Fab' fragments may be synthesized de novo either chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, as used herein also includes antibody fragments either produced by the modification of whole antibodies or synthesized de novo using recombinant DNA methodologies. Preferred antibodies include single chain antibodies (antibodies that exist as a single polypeptide chain), more preferably single chain Fv antibodies (sFv or scFv) in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a continuous polypeptide. The single chain Fv antibody is a covalently linked VH-VL heterodimer which may be expressed from a nucleic acid including VH- and V - encoding sequences either joined directly or joined by a peptide-encoding linker. Huston, et al. (1988) Proc. Nat. Acad. Sci. USA, 85: 5879-5883. While the VH and V are connected to each as a single polypeptide chain, the VH and VL domains associate non-covalently. The first functional antibody molecules to be expressed on the surface of filamentous phage were single-chain Fv's (scFv), however, alternative expression strategies have also been successful. For example Fab molecules can be displayed on phage if one of the chains (heavy or light) is fused to g3 capsid protein and the complementary chain exported to the periplasm as a soluble molecule. The two chains can be encoded on the same or on different replicons; the important point is that the two antibody chains in each Fab molecule assemble post-translationally and the dimer is incorporated into the phage particle via linkage of one of the chains to, e.g., g3p (see, e.g., U.S. Patent No: 5733743). The scFv antibodies and a number of other structures converting the naturally aggregated, but chemically separated light and heavy polypeptide chains from an antibody V region into a molecule that folds into a three dimensional structure substantially similar to the structure of an antigen-binding site are known to those of skill in the art (see e.g., U.S. Patent Nos. 5,091,513, 5,132,405, and 4,956,778). Particularly preferred antibodies should include all that have been displayed on phage (e.g., scFv, Fv, Fab and disulfide linked Fv (Reiter et al. (1995) Protein Eng. 8: 1323- 1331).
The term "specifically binds" when used to refer to binding proteins herein indicates that the binding preference (e.g., affinity for the target molecule/sequence is at least 2 fold, more preferably at least 5 fold, and most preferably at least 10 or 20 fold over a nonspecific (e.g. randomly generated molecule lacking the specifically recognized amino acid or amino acid sequence) target molecule. The phrases "hybridizing specifically to" or "specific hybridization" or "selectively hybridize to", refer to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. The term "stringent conditions" refers to conditions under which a probe will hybridize preferentially to its target subsequence, and to a lesser extent to, or not at all to, other sequences. "Stringent hybridization" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and northern hybridizations are sequence dependent, and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in Tijssen
(1993) Laboratory Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic Acid Probes part I chapter 2 Overview of principles of hybridization and the strategy of nucleic acid probe assays, Elsevier, New York. Generally, highly stringent hybridization and wash conditions are selected to be about 5L-J C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe.
An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42DC, with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.15 M NaCI at 72°C for about 15 minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see, Sambrook et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY, (Sambrook et al.) supra for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is lx SSC at 45 ϋC for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40 DC for 15 minutes. In general, a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
In one particularly preferred embodiment, stringent conditions are characterized by hybridization in 1 M NaCI, 10 mM Tris-HCl, pH 8.0, 0.01% Triton X-100, 0.1 mg/ml fragmented herring sperm DNA with hybridization at 45°C with rotation at 50 RPM followed by washing first in 0.9 M NaCI, 0.06 M NaH2PO4, 0.006 M EDTA, 0.01% Tween-20 at 45°C for 1 hr, followed by 0.075 M NaCI, 0.005 M NaH2PO4, 0.5 mM EDTA at 45°C for 15 minutes.
The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection.
The phrase "substantially identical," in the context of two nucleic acids or polypeptides, refers to two or more sequences or subsequences that have at least 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. Preferably, the substantial identity exists over a region of the sequences that is at least about 50 residues in length, more preferably over a region of at least about 100 residues, and most preferably the sequences are substantially identical over at least about 150 residues. In a most preferred embodiment, the sequences are substantially identical over the entire length of the coding regions.
For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman (1988) Proc. Natl. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by visual inspection (see generally Ausubel et al, supra). One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments to show relationship and percent sequence identity. It also plots a tree or dendogram showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle (1987) J. Mol. Evol. 35:351-360. The method used is similar to the method described by Higgins & Sharp (1989) CABIOS 5: 151-153. The program can align up to 300 sequences, each of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two individual sequences. The final alignment is achieved by a series of progressive, pairwise alignments. The program is run by designating specific sequences and their amino acid or nucleotide coordinates for regions of sequence comparison and by designating the program parameters. For example, a reference sequence can be compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps.
Another example of algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al. (1990) J. Mol. Biol. 215: 403-410. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative- scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915).
In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul (1993) Proc. Natl. Acad. Sci. USA ,90: 5873-5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
The term "small organic molecule" refers to a molecule of a size comparable to those organic molecules generally used in pharmaceuticals. The term excludes biological macromolecules (e.g., proteins, nucleic acids, etc.). Preferred small organic molecules range in size up to about 5000 Da, more preferably up to 2000 Da, and most preferably up to about 1000 Da.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure IA show the . hPHTl EST map. Figure IB shows the alignments of the ESTs with the main contig. Figure IC shows a schematic of hPHTl splice variants. The sequence missing in splice variant B is identified. Identical sequence between splice Variant A & A' is identified. Unique sequence in splice variant A' and in splice variant A is indicated. Figure ID shows splice variant(hPHTl). Figure IE shows splice variant A(hPHTlvA) with alignment to hPHTl. Figure IF shows splice variant A'(hPHTlvA') with alignment to hPHTl. Figure 1G shows splice variant B(hPHTlvB) with alignment to hPHTl.
Figure 2 illustrates the structure of the hPHT2 g. Sequences of exons are provided in sequence listing (SEQ ID NOS: 9-16). Sequences of introns are provided in sequence listing (SEQ ID NOS: 17-23).
Figure 3A provides multiple sequence alignments for PHT branch. (Click alignment to see PDF). Figure 3B provides multiple sequence alignments for human PHT+PepT branch. Black: 50% or higher identity; Gray: conservative and similar substitutions. Figure 4A provides hydropathy plots (Kyte Doolittle method) for hPHTl
(upper left panel), hPHT2 (upper right panel), rPHTl (middle left panel), camp (middle right panel), hPepTl (lower left panel), and hPepT2 (lower right panel). Figure 4B shows topo diagrams for hPHTl and hPHT2.
Figure 5A shows RT-PCR analysis of hPHTl extracted from human small intestine tissue sample. Figure 5B shows Northern blot analysis of hPHTl, hPHT2, and actin expression in human tissues.
Figure 6 shows the structure and splicing of the hPEPTl gene. hPEPTl and hPEPTl-RF completely share exons 4-6 and partially share exons 3 and 7, where the alternative splice sites are located (indicated by the vertical lines). Exon T represents a repetitive element. Arrows indicate the translation start and stop sites. The Exonic sequences are shown in the sequence listings as SEQ ID Nos: 11 through 16 (hpeptl-rf exon 3 through hpeptl-rf exon 7').
Figure 7 illustrates the membrane topology prediction for hPEPTl. The prediction was carried out using the TOPPRED program. The exon boundaries are indicated by alternating shading.
Figure 8 illustrates the membrane topology prediction for hPEPTl- RF. The prediction was carried out using the TOPPRED program. The exon boundaries are indicated by alternating shading.
Figure 9 shows the putative promoter region of the human PEPT1 gene. Sequence of nucleotides upstream of the translations (f)(ATG)is shown. The numbering starts (+1) from the transcription start site (f ) to the negative values in the promoter region. The sites for the transcription factors in the promoter region are underlined and the corresponding indicated. DETAILED DESCRIPTION
This invention provides new members of the h+/oligopeptide transporter gene family. The new genes designated herein as hHPTl and hHPT2. The new genes appear to be members of the POT family of peptide transporters. In view of these results, the human POT family appears to contain at least four genes encoding peptide transporters and, without being bound to a particular theory, it is believed that each is likely to display a distinct pattern of tissue expression.
Tissue distribution of POT gene expression is of particular interest for achieving oral bioavailability or for targeting drugs to tumor tissues. Moreover, each member of the peptide transporter family is believed to exhibit some selectivity for peptides, peptoid drugs, and other agents. For example, we have determined that hPepTl is highly expressed in pancreatic and colon adenocarcinomas, including liver metastases considerably above the level seen in surrounding normal tissues. Thus substrates for hPepTl can be "specifically" delivered to these tissues. Having identified two new transporters, it is now possible to screen for agents
(e.g. drugs, prodrugs, etc.) whose uptake is mediated by these transporters. Having identified such agents they can be selectively prescribed when their deliver to tissues expressing hPHTl or hPHT2. Alternatively, drugs or prodrugs can be engineered with domains/sites, preferentially transported by hPHTl or hPHT2 and thereby enhance availability of these agents to various tissues.
It is also possible to screen for agents that modulate (e.g. up-regulate or downregulate) expression or activity of hPHTl or hPHTl. Such agents can be administered along with drugs transported by hPHTl or hPHT2 transporters to either enhance availability of the drug (e.g. upregulate hPHTl or hPHT2 expression) or diminish availability of the drug (e.g. down-regulate hPHTl or hPHT2) to tissues harboring hPHTl or hPHT2 genes.
In addition, gene therapy methods can be used to specifically deliver and express hPHTl or hPHT2 to preselected target tissues and thereby increase the availability of an hPHTl or hPHT2 transported agent to that tissue.
Nucleic acids encoding H+/oligopeptide transporters proteases. The nucleic acid and amino acid sequences of hPHTl and hPHT2 and primers sufficient to amplify the nucleic acid sequences are provided herein (see. e.g. SEQ ID NO: 17 for the full hPHTl sequence). It is noted that there are two splice variants of hPHTl. The sequence listing aligns the splice variants with each other. (A vs B). Another splice variant is actually a sequence obtained from a PCR amplification product using cDNAs from skeletal muscle. The PCR run gave a band with the expected sequence (509 bps of hPHTl), and a faster moving band (PCR-2) with 169 bps missing (probable frameshift mutation). Its alignment with hPHTl is given in the sequence listing as well.
Using the information provided herein, (e.g. hPHTl or hPHT2 sequences, primers, etc.) the nucleic acids encoding the full length peptide transporters or fragments of such nucleic acids (e.g. useful as probes in isolating transporter genes or measuring hPHTl or hPHT2 expression levels) are prepared using standard methods well known to those of skill in the art.
For example, the nucleic acid(s) may be cloned, or amplified by in vitro methods, such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), the transcription-based amplification system (TAS), the self-sustained sequence replication system (SSR), etc. A wide variety of cloning and in vitro amplification methodologies are well known to persons of skill in the art. Examples of these techniques and instructions sufficient to direct persons of skill through many cloning exercises are found in Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 152 Academic Press, Inc., San Diego, CA (Berger); Sambrook et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY, (Sambrook et al); Current Protocols in Molecular Biology, F.M. Ausubel et al, eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1994 Supplement) (Ausubel); Cashion et al., U.S. patent number 5,017,478; and Carr, European Patent No. 0,246,864.
Examples of techniques sufficient to direct persons of skill through in vitro amplification methods are found in Berger, Sambrook, and Ausubel, as well as Mullis et al, (1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis); Arnheim & Levinson (October 1, 1990) C&EN 36-47; The Journal Of N1H Research (1991) 3: 81-94; (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87, 1874; Lomell et al. (1989) J. Clin. Chem., 35: 1826; Landegren et al, (1988)
Science, 241: 1077-1080; Van Brunt (1990) Biotechnology, 8: 291-294; Wu and Wallace, (1989) Gene, 4: 560; and Barringer et al. (1990) Gene, 89: 117. The isolation and expression of the transporter nucleic acids is illustrated herein as well.
Where the transporter DNA, or their subsequences, are to be used as nucleic acid probes, it is often desirable to label the nucleic acids with detectable labels. The labels may be incorporated by any of a number of means well known to those of skill in the art. In one preferred embodiment, the label is simultaneously incorporated during the amplification step in the preparation of the sample nucleic acids. Thus, for example, polymerase chain reaction (PCR) with labeled primers or labeled nucleotides will provide a labeled amplification product. In another preferred embodiment, transcription amplification using a labeled nucleotide (e.g. fluorescein-labeled UTP and/or CTP) incorporates a label into the transcribed nucleic acids.
Alternatively, a label may be added directly to an original nucleic acid sample (e.g., mRNA, polyA mRNA, cDNA, etc.) or to the amplification product after the amplification is completed. Means of attaching labels to nucleic acids are well known to those of skill in the art and include, for example nick translation or end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label (e.g., a fluorophore). Suitable labels are described below.
II. Cloning and expression of proton/oligopeptide transporters. It is desirable express the proton/oligopeptide transporters of this invention in heterologous cells for use in the assays described herein. In addition, it is also useful to express the transporter proteins, or fragments thereof, to generate antibodies for a variety of applications (e.g. determining transporter expression level, etc.).
As explained below, the hPHTl and or hPHT2 polypeptides and various fragments thereof can be conveniently produced using synthetic chemical processes or recombinant expression methodologies.
A) De novo chemical synthesis.
The transporter polypeptides of this invention or fragments thereof, may be synthesized using standard chemical peptide synthesis techniques. Where the desired subsequences are relatively short (e.g., when a particular antigenic determinant is desired) the molecule may be synthesized as a single contiguous polypeptide. Where larger molecules are desired, subsequences can be synthesized separately (in one or more units) and then fused by condensation of the amino terminus of one molecule with the carboxyl terminus of the other molecule thereby forming a peptide bond.
Solid phase synthesis in which the C-terminal amino acid of the sequence is attached to an insoluble support followed by sequential addition of the remaining amino acids in the sequence is the preferred method for the chemical synthesis of the polypeptides of this invention. Techniques for solid phase synthesis are described by Barany and Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A., Merrifield, et al. (1963) J. Am. Chem. Soc, 85: 2149-2156, and Stewart et al. (1984) Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 111.
B) Recombinant expression.
In a preferred embodiment, the transporter proteins of this invention, or subsequences thereof, are synthesized using recombinant expression systems. Generally this involves creating a DNA sequence that encodes the desired protein, placing the DNA in an expression cassette under the control of a particular promoter, and expressing the protein in a host cell. The host cell can then be used in the assays described herein. Alternatively, were isolated transporter proteins are desired, the expressed transporter can be recovered from the cell.
DNA encoding the transporter proteins described herein can be prepared by any suitable method as described above, including, for example, cloning and restriction of appropriate sequences or direct chemical synthesis by methods such as the phosphotriester method of Narang et al. (1979) Meth. Enzymol. 68: 90-99; the phosphodiester method of Brown et α/.(1979) Meth. Enzymol. 68: 109-151; the diethylphosphoramidite method of Beaucage et al. (1981) Tetra. Lett, 22: 1859-1862; and the solid support method of U.S. Patent No. 4,458,066.
Chemical synthesis produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence, or by polymerization with a DNA polymerase using the single strand as a template. One of skill would recognize that while chemical synthesis of DNA is limited to sequences of about 100 bases, longer sequences may be obtained by the ligation of shorter sequences. Alternatively, subsequences may be cloned and the appropriate subsequences cleaved using appropriate restriction enzymes. The fragments may then be ligated to produce the desired DNA sequence.
In one embodiment, the nucleic acids of this invention can be cloned using DNA amplification methods such as polymerase chain reaction (PCR). Thus, for example, the nucleic acid sequence or subsequence is PCR amplified, using a sense primer containing one restriction site (e.g., Ndel) and an antisense primer containing another restriction site (e.g., HindlTJ). This will produce a nucleic acid encoding the desired transporter sequence or subsequence and having terminal restriction sites. This nucleic acid can then be easily ligated into a vector that can be transfected into an appropriate host cell (e.g. an oocyte, a mammalian somatic cell, etc.)
Suitable PCR primers can be determined by one of skill in the art using the sequence information provided herein and representative primers are illustrated herein as well.. Appropriate restriction sites can also be added to the nucleic acid encoding the transporter protein or protein subsequence by site-directed mutagenesis.
The nucleic acid sequences encoding human transporter proteins or protein subsequences may be expressed in a variety of host cells, including E. coli, other bacterial hosts, yeast, and various higher eukaryotic cells such as the COS, CHO and HeLa cells lines and myeloma cell lines, and various vertebrate oocytes (e.g. Xenopus oocytes). The recombinant protein gene will be operably linked to appropriate expression control sequences for each host cell. For E. coli this includes a promoter such as the T7, trp, or lambda promoters, a ribosome binding site and preferably a transcription termination signal. For eukaryotic cells, the control sequences will include a promoter and often an enhancer (e.g., an enhancer derived from immunoglobulin genes, SV40, cytomegalo virus, etc.), and a polyadenylation sequence, and may include splice donor and acceptor sequences.
The vectors of the invention can be transferred into the chosen host cell by well-known methods such as calcium chloride transformation for E. coli and calcium phosphate treatment, microinjection, or electroporation for vertebrate cells. Cells transformed by the plasmids can be selected by resistance to antibiotics conferred by genes contained on the plasmids, such as the amp, gpt, neo and hyg genes.
Where it is desired to recover the transporter proteins of this invention, the recombinant hPHTl and/or hPHT2 protein(s) can be purified according to standard procedures of the art, including ammonium sulfate precipitation, affinity columns, column chromatography, gel electrophoresis and the like (see, generally, R. Scopes, (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification., Academic Press, Inc. N.Y.). Substantially pure compositions of at least about 90 to 95% homogeneity are preferred, and 98 to 99% or more homogeneity are most preferred. Once purified, partially or to homogeneity as desired, the polypeptides may then be used (e.g., as immunogens for antibody production).
One of skill would recognize that modifications can be made to the transporter proteins without diminishing their biological activity. Some modifications may be made to facilitate the cloning, expression, or incorporation of the targeting molecule into a fusion protein. Such modifications are well known to those of skill in the art and include, for example, a methionine added at the amino terminus to provide an initiation site, or additional amino acids (e.g., poly His) placed on either terminus to create conveniently located restriction sites or termination codons or purification sequences.
III. Assays for compositions transported by hPHTl or hPHT2 transporters. Having discovered new human peptide transporters, it is possible to screen for compounds specifically transported by these transporters. Other transporters have been shown to transport a wide variety of compositions in addition to peptides and/or amino acids. Such compounds include, but are not limited to, antibiotics (including several oral .beta.- lactams), oral angiotensin converting enzyme (ACE) inhibitors, oral renin inhibitors and the like. Thus, the transporters of this invention can readily be utilized in a screening system to identify molecules that they transport.
In a preferred embodiment, such assays involve expressing the transporters of this invention in a cell contacting the cell with the agent(s) it is desired to screen for the ability to be transported by the transporters of this invention and detecting and/or quantifying the amount of the agent(s) that are transported into the cell.
In preferred embodiments, the amount of transported agent can be compared to the amount of that agent transported by cells lacking the transporter and/or to the amount of an agent known not to be transported by the transporters of this invention (negative controls). Preferred embodiments, can also include a comparison to the amount of an agent transported by the cells where it is known that that agent is transported by the transporters of this invention (positive controls). The assay is typically scored as positive where there is a difference between the amount of test agent(s) transported and the negative control(s), preferably where the difference is statistically significant (e.g. at greater than 80%, preferably greater than about 90%, more preferably greater than about 98%, and most preferably greater than about 99% confidence level).
Cells suitable for such screening systems preferably include vertebrate cells (e.g., amphibian cells, mammalian cells, etc.) and, in certain embodiments, more preferably include mammalian cells of the tissue to which it is ultimately desired to deliver the test agent(s). Thus, for example, where it is ultimately desired to deliver agents to heart muscle, the cells may be cells of heart tissue.
However, in one particularly preferred embodiment, the assays can be convenientlyl run using oocytes. It is possible to simply inject mRNAs encoding the transporters of these cells into oocytes where they are expressed thereby providing a convenient system for the cellular assay. While the present invention contemplates the use of oocytes isolated from any non-human vertebrate organism, preferred embodiments of the assay feature amphibian oocytes, particularly oocytes which are approximately the same size, or larger, than oocytes which can be isolated from frog species of the genus Xenopus, e.g. Xenopus laevis. In general, the larger oocytes are preferred for ease of manipulation. Furthermore, expression of recombinant proteins and cell culturing techniques are each better characterized for amphibian oocytes, and a greater diversity of expression vectors are available for these systems.
Isolation of amphibian oocytes is well known in the art (see, for example, Soreq i. (1992) Methods in Enzymology 207: 225-265; Wang i. (1991) Int J Biochem 23: 271-276; Sigel (1990) J Membr Biol 117: 201-221; Martial et al. (1991) Biochem Biophys Acta 1090: 86-90; Brockes (1992) Proc. Natl. Acad. Sci., USA, 89:11386-11390; and U.S. Patent Nos. 4,985,352, 5,288,621, 6,020,479 , 5,919,699 5,919,628, and 5,202,257). For example, Xenopus oocytes can be harvested from female Xenopus laevis and processed using published techniques (Coleman et al, eds., Transcription and Translation: A Practical Approach. IRL Press, pp. 271-302; and Williams et al. (1988) Proc. Natl. Acad. Sci., USA, 85: 4939-4943). In one practice of the present invention, preparation of the assay includes obtaining oocytes from the excised ovaries of female frogs anesthetized by hypothermia and from which follicle cells have been removed by treatment with collagenase. Oocytes at a particular stage, e.g. Dumont stage V, can be selected and microinjected with the mRNA to be tested, e.g. for in vitro transcribed RNA ("cRNA").
Isolation of other suitable oocytes can be, as a matter of course, carried out by one of ordinary skill in the art. For instance, techniques routinely used in generation of transgenic animals, such as protocols for inducing superovulation and isolating fertilized eggs from various mammals (e.g. mice, rabbits, rats, sheep, goats or pigs) can be slightly modified (i.e. no fertilization step) in order to allow for isolation of mammalian oocytes for use in the subject method (see, e.g., U.S. Pat. No. 4,994,384). Moreover, protocols exist for in vitro maturation of mammalian oocytes, such as mature metaphase II oocytes. Several methods for expressing recombinant proteins in oocytes (and other cells) are generally known in the art. For example, expression of the recombinant protein(s) to be tested in the subject assay can be carried out by microinjection of cRNA encoding the protein, or by microinjection (or by other form of transfection) of an expression vector encoding the protein of interest. Either method can be carried out by employing the basics of expression cloning strategies known in the art. In one embodiment, cDNA libraries are cloned into vectors that can be used for in vitro RNA synthesis. For instance, the pCS2+/- vector contains SP6, T7 and T3 promoters that have been introduced upstream and downstream of a cloning site in order to permit in vitro RNA synthesis upon linearization of the plasmid. In an illustrative embodiment, a plasmid containing the cDNA to be tested can be linearized by cutting downstream from the cDNA insert with a restriction enzyme. The post-restriction digest is digested with Proteinase K and then extracted with two phenol: chloroform (1:1) extractions. The resulting DNA fragments are then ethanol precipitated. The precipitated fragments are mixed with either T3 RNA polymerase (to make sense strand), or T7 RNA polymerase (to make anti-sense strand), plus rATP, rCTP, rGTP, rUTP, and RNase inhibitor. Simultaneously, capped RNA can be produced in vitro (Krieg and Melton, (1987) Meth Enzymol 155: 397-415; and Richardson et al. (1988) Bio/Technology 6: 565-570). Other exemplary vectors useful in the subject assay include: the pSP64T vector (Kreig et al. (1984) Nuc Acid Res 12:7057-7071) which contains the SP6 promoter and the 5' and 3' untranslated flanking regions of Xenopus β-globin cDNA to provide more stable RNA for translation in injected oocytes; the pOEV expression vector (Pfaff et al. (1990) Anal Biochem 188: 192-199) which permits cloned DNA to be transcribed and translated directly in oocytes; and the pMT2 expression vector (Swick et al. (1992) Proc. Natl. Acad. Sci, USA, 89:1812-1816).
It may be desirable to co-express a marker gene in the oocyte in order to standardize the comparison of effects based on level of expression occurring in the oocytes. For example, an α-amylase gene construct can be provided in the oocyte, and the amylase activity measured in the oocyte (Urnes et al. (1990) Gene 95: 267-274. The level of expression for other proteins can therefore be standardized based on the amount of recombinant amylase produced. Dose response curves can be constructed based on the level of expression of the amylase reporter in the oocyte. As indicated above, the cell expressing the peptide transporter(s) of this invention can be contacted with the agent(s) to be screened and the amount of agent that is internalized is detected. The is routinely accomplished by either measuring depletion of the agent in the media contacting the cell or measuring the amount of the agent internalized by the cell. Typically the test agent(s) are labeled with a detectable label to facilitate their detection in the subject cell and/or media.
Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads ), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3H, 1251, 35S, 14C, or 32P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Fluorescent labels, colorimetric labels, and radiolabels are particularly preferred.
The foregoing description of assays is intended to be illustrative and not limiting. Using the teachings provided herein, other variations of such assays will be apparent to one of skill in the art. Such variations include, but are not limited to,, the use of a tissues and/or cells that express endogenous transporters of this invention in the assays described above. It is also noted that assays for screening of uptake of test agents by various peptide transporters is described in U.S. Patents 6,020,479 , 5,919,699, and 5,919,628.
IV. Assays for modulators of hPHTl and/or hPHT2 expression.
As indicated above, in one aspect, this invention is premised, in part, on the discovery of new h+/oligopeptide transporter (e.g. hPHTl and/or hPHT2). Agents that downregulate expression of decrease the bioavailability of compounds internalized by these receptors, while agents that upregulate hPHTl or hPHT2 increase the bioavailability of compounds internalized by these transporters.
Thus, in one embodiment, this invention provides methods of screening for agents that modulate expression and/or activity. The methods involve detecting the expression level and/or activity level of hPHTl or hPHT2 genes or gene products (e.g. hPHTl or hPHT2 mRNA or proteins) in the presence of the agent(s) in question. A reduced hPHTl or hPHT2 expression level or activity level in the presence of the agent as compared to a negative control where the test agent is absent or at reduced concentration indicates that the agent downregulates hPHTl or hPHT2 activity or expression. Conversely, increased hPHTl or hPHT2 expression level or activity level in the presence of the agent as compared to a negative control where the test agent is absent or at reduced concentration indicates that the agent up-regulates hPHTl or hPHT2 activity or expression
Expression levels of a gene can be altered by changes in the transcription of the gene product (i.e. transcription of mRNA), and/or by changes in translation of the gene product (i.e. translation of the protein), and/or by post-translational modification(s) (e.g. protein folding, glycosylation, etc.). Thus preferred assays of this invention include assaying for level of transcribed mRNA (or other nucleic acids derived from the hPHTl or hPHT2 genes), level of translated protein, activity of translated protein, etc. Examples of such approaches are described below.
A) Nucleic-acid based assays.
1) Target molecules.
Changes in expression level can be detected by measuring changes in hPHTl and/or hPHT2 genomic DNA or a nucleic acid derived from the genomic DNA (e.g., hPHTl or hPHT2 mRNA, reverse-transcribed cDNA, etc.). In order to measure the hPHTl or hPHT2 expression level it is desirable to provide a nucleic acid sample for such analysis. In preferred embodiments the nucleic acid is found in or derived from a biological sample. The term "biological sample", as used herein, refers to a sample obtained from an organism or from components (e.g., cells) of an organism. The sample may be of any biological tissue or fluid. Biological samples may also include organs or sections of tissues such as frozen sections taken for histological purposes.
The nucleic acid (e.g., mRNA or a nucleic acid derived from an mRNA) is, in certain preferred embodiments, isolated from the sample according to any of a number of methods well known to those of skill in the art. Methods of isolating mRNA are well known to those of skill in the art. For example, methods of isolation and purification of nucleic acids are described in detail in by Tijssen ed., (1993) Chapter 3 of Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I. Theory and Nucleic Acid Preparation, Elsevier, N.Y. and Tijssen ed.
In a preferred embodiment, the "total" nucleic acid is isolated from a given sample using, for example, an acid guanidinium-phenol-chloroform extraction method and polyA+ mRNA is isolated by oligo dT column chromatography or by using (dT)n magnetic beads (see, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989), or Current Protocols in Molecular Biology, F. Ausubel et al., ed. (1987) Greene Publishing and Wiley-Interscience, New York).
Frequently, it is desirable to amplify the nucleic acid sample prior to assaying for expression level. Methods of amplifying nucleic acids are well known to those of skill in the art and include, but are not limited to polymerase chain reaction (PCR, see. e.g, Innis, et al, (1990) PCR Protocols. A guide to Methods and Application. Academic Press, Inc. San Diego,), ligase chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et al. (1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117, transcription amplification (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA_86: 1173), self- sustained sequence replication (Guatelli et al. (1990) Proc. Nat. Acad. Sci. USA 87: 1874), dot PCR, and linker adapter PCR, etc.).
In a particularly preferred embodiment, where it is desired to quantify the transcription level (and thereby expression) of hPHTl and/or hPHT2 in a sample, the nucleic acid sample is one in which the concentration of the hPHTl and/or hPHT2 mRNA transcript(s), or the concentration of the nucleic acids derived from the hPHTl and/or hPHT2 mRNA transcript(s), is proportional to the transcription level (and therefore expression level) of that gene. Similarly, it is preferred that the hybridization signal intensity be proportional to the amount of hybridized nucleic acid. While it is preferred that the proportionality be relatively strict (e.g., a doubling in transcription rate results in a doubling in mRNA transcript in the sample nucleic acid pool and a doubling in hybridization signal), one of skill will appreciate that the proportionality can be more relaxed and even non-linear. Thus, for example, an assay where a 5 fold difference in concentration of the target mRNA results in a 3 to 6 fold difference in hybridization intensity is sufficient for most purposes.
Where more precise quantification is required appropriate controls can be run to correct for variations introduced in sample preparation and hybridization as described herein. In addition, serial dilutions of "standard" target nucleic acids (e.g., mRNAs) can be used to prepare calibration curves according to methods well known to those of skill in the art. Of course, where simple detection of the presence or absence of a transcript or large differences of changes in nucleic acid concentration is desired, no elaborate control or calibration is required.
In the simplest embodiment, the hPHTl and/or bPHT2-containing nucleic acid sample is the total mRNA or a total cDNA isolated and/or otherwise derived from a biological sample. The nucleic acid may be isolated from the sample according to any of a number of methods well known to those of skill in the art as indicated above.
2) Ηybridization-based assays.
Using the known sequence of hPHTl and/or hPHT2 (see sequence listing) detecting and/or quantifying the hPHTl and/or hPHT2 transcript(s) can be routinely accomplished using nucleic acid hybridization techniques (see, e.g., Sambrook et al. supra). For example, one method for evaluating the presence, absence, or quantity of hPHTl and/or hPHT2 genomic DNA or reverse-transcribed cDNA involves a "Southern Blot". In a Southern Blot, the DNA typically fragmented and separated on an electrophoretic gel, is hybridized to a probe specific for hPHTl and/or hPHT2. Comparison of the intensity of the hybridization signal from the hPHTl and/or hPHT2 probe with a "control" probe (e.g. a probe for a "housekeeping gene) provides an estimate of the relative expression level of the target nucleic acid.
Alternatively, the hPHTl and/or hPHT2 mRNA can be directly quantified in a Northern blot. In brief, the mRNA is isolated from a given cell sample using, for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA is then electrophoresed to separate the mRNA species and the mRNA is transferred from the gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are used to identify and/or quantify the target hPHTl and/or hPHT2 mRNA. Appropriate controls (e.g. probes to housekeeping genes) provide a reference for evaluating relative expression level.
An alternative means for determining the hPHTl and/or hPHT2 expression level is in situ hybridization. In situ hybridization assays are well known (e.g., Angerer (1987) Meth. Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: (1) fixation of tissue or biological structure to be analyzed; (2) prehybridization treatment of the biological structure to increase accessibility of target DNA, and to reduce nonspecific binding; (3) hybridization of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid fragments. The reagent used in each of these steps and the conditions for use vary depending on the particular application.
In some applications it is necessary to block the hybridization capacity of repetitive sequences. Thus, in some embodiments, tRNA, human genomic DNA, or Cot-1 DNA is used to block non-specific hybridization.
3) Amplification-based assays.
In another embodiment, amplification-based assays can be used to measure hPHTl and/or hPHT2 expression (transcription) level. In such amplification-based assays, the target nucleic acid sequences (i.e., hPHTl and/or hPHT2) act as template(s) in amplification reaction(s) (e.g. Polymerase Chain Reaction (PCR) or reverse-transcription PCR (RT-PCR)). In a quantitative amplification, the amount of amplification product will be proportional to the amount of template (e.g., hPHTl and/or hPHT2 mRNA) in the original sample. Comparison to appropriate (e.g. healthy tissue or cells unexposed to the test agent) controls provides a measure of the hPHTl and/or hPHT2 transcript level.
Methods of "quantitative" amplification are well known to those of skill in the art. For example, quantitative PCR involves simultaneously co-amplifying a known quantity of a control sequence using the same primers. This provides an internal standard that may be used to calibrate the PCR reaction. Detailed protocols for quantitative PCR are provided in Innis et al. (1990) PCR Protocols, A Guide to Methods and Applications, Academic Press, Inc. N.Y.). One approach, for example, involves simultaneously co-amplifying a known quantity of a control sequence using the same primers as those used to amplify the target. This provides an internal standard that may be used to calibrate the PCR reaction.
One preferred internal standard is a synthetic AW106 cRNA. The AW106 cRNA is combined with RNA isolated from the sample according to standard techniques known to those of skill in the art. The RNA is then reverse transcribed using a reverse transcriptase to provide copy DNA. The cDNA sequences are then amplified (e.g., by PCR) using labeled primers. The amplification products are separated, typically by electrophoresis, and the amount of labeled nucleic acid (proportional to the amount of amplified product) is determined. The amount of mRNA in the sample is then calculated by comparison with the signal produced by the known AW106 RNA (or other) standard.
Detailed protocols for quantitative PCR are provided in PCR Protocols, A Guide to Methods and Applications, Innis et al. (1990) Academic Press, Inc. N.Y.. The nucleic acid sequence(s) for hPHTl and hPHT2 provided herein are sufficient to enable one of skill to routinely select primers to amplify any portion of the gene.
4) Hybridization Formats and Optimization of hybridization conditions.
a) Array-based hybridization formats.
In one embodiment, the methods of this invention can be utilized in array- based hybridization formats. Arrays are a multiplicity of different "probe" or "target" nucleic acids (or other compounds) attached to one or more surfaces (e.g., solid, membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other moieties) is attached to a single contiguous surface or to a multiplicity of surfaces juxtaposed to each other.
In an array format a large number of different hybridization reactions can be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation of a number of hybridizations in a single "experiment". Methods of performing hybridization reactions in array based formats are well known to those of skill in the art (see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958, Pinkel et al. (1998) Nature Genetics 20: 207-211). Arrays, particularly nucleic acid arrays can be produced according to a wide variety of methods well known to those of skill in the art. For example, in a simple embodiment, "low density" arrays can simply be produced by spotting (e.g. by hand using a pipette) different nucleic acids at different locations on a solid support (e.g. a glass surface, a membrane, etc.).
This simple spotting, approach has been automated to produce high density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the use of an automated system that taps a microcapillary against a surface to deposit a small volume of a biological sample. The process is repeated to generate high density arrays.
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for example, U.S. Patent No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high density oligonucleotide arrays. Synthesis of high density arrays is also described in U.S. Patents 5,744,305, 5,800,992 and 5,445,934.
b) Other hybridization formats.
As indicated above a variety of nucleic acid hybridization formats are known to those skilled in the art. For example, common formats include sandwich assays and competition or displacement assays. Such assay formats are generally described in Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al. (1969) Nature 223: 582-587.
Sandwich assays are commercially useful hybridization assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid to form a "sandwich" hybridization complex. To be most effective, the signal nucleic acid should not hybridize with the capture nucleic acid. Typically, labeled signal nucleic acids are used to detect hybridization.
Complementary nucleic acids or signal nucleic acids may be labeled by any one of several methods typically used to detect the presence of hybridized polynucleotides. The most common method of detection is the use of autoradiography with 3H, 1251, 35S, 14C, or 32P- labelled probes or the like. Other labels include ligands that bind to labeled antibodies, fluorophores, chemi-luminescent agents, enzymes, and antibodies which can serve as specific binding pair members for a labeled ligand. Detection of a hybridization complex may require the binding of a signal generating complex to a duplex of target and probe polynucleotides or nucleic acids. Typically, such binding occurs through ligand and anti-ligand interactions as between a ligand-conjugated probe and an anti-ligand conjugated with a signal. The sensitivity of the hybridization assays may be enhanced through use of a nucleic acid amplification system that multiplies the target nucleic acid being detected. Examples of such systems include the polymerase chain reaction (PCR) system and the ligase chain reaction (LCR) system. Other methods recently described in the art are the nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and Q Beta Replicase systems.
c) Optimization of hybridization conditions.
Nucleic acid hybridization simply involves providing a denatured probe and target nucleic acid under conditions where the probe and its complementary target can form stable hybrid duplexes through complementary base pairing. The nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be detected, typically through detection of an attached detectable label. It is generally recognized that nucleic acids are denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids, or in the addition of chemical agents, or the raising of the pH. Under low stringency conditions (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes (e.g., DNA:DNA,
RNA:RNA, or RNA:DNA) will form even where the annealed sequences are not perfectly complementary. Thus specificity of hybridization is reduced at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower salt) successful hybridization requires fewer mismatches. One of skill in the art will appreciate that hybridization conditions may be selected to provide any degree of stringency. In a preferred embodiment, hybridization is performed at low stringency to ensure hybridization and then subsequent washes are performed at higher stringency to eliminate mismatched hybrid duplexes. Successive washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 X SSPE at 37°C to 70°C) until a desired level of hybridization specificity is obtained. Stringency can also be increased by addition of agents such as formamide. Hybridization specificity may be evaluated by comparison of hybridization to the test probes with hybridization to the various controls that can be present.
In general, there is a tradeoff between hybridization specificity (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed at the highest stringency that produces consistent results and that provides a signal intensity greater than approximately 10% of the background intensity. Thus, in a preferred embodiment, the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably altered and which provides adequate signal for the particular probes of interest.
In a preferred embodiment, background signal is reduced by the use of a blocking reagent (e.g., tRNA, sperm DNA, cot-1 DNA, etc.) during the hybridization to reduce non-specific binding. The use of blocking agents in hybridization is well known to those of skill in the art (see, e.g., Chapter 8 in P. Tijssen, supra.) Methods of optimizing hybridization conditions are well known to those of skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.).
Optimal conditions are also a function of the sensitivity of label (e.g., fluorescence) detection for different combinations of substrate type, fluorochrome, excitation and emission bands, spot size and the like. Low fluorescence background surfaces can be used (see, e.g., Chu (1992) Electrophoresis 13:105-114). The sensitivity for detection of spots ("target elements") of various diameters on the candidate surfaces can be readily determined by, e.g., spotting a dilution series of fluorescently end labeled DNA fragments. These spots are then imaged using conventional fluorescence microscopy. The sensitivity, linearity, and dynamic range achievable from the various combinations of fluorochrome and solid surfaces (e.g., glass, fused silica, etc.) can thus be determined. Serial dilutions of pairs of fluorochrome in known relative proportions can also be analyzed. This determines the accuracy with which fluorescence ratio measurements reflect actual fluorochrome ratios over the dynamic range permitted by the detectors and fluorescence of the substrate upon which the probe has been fixed. d) Labeling and detection of nucleic acids.
The probes used herein for detection of hPHTl and/or hPHT2 expression levels can be full length or less than the full length of the hPHTl and/or hPHT2 mRNA. Shorter probes are empirically tested for specificity. Preferred probes are sufficiently long so as to specifically hybridize with the hPHTl and/or hPHT2 target nucleic acid(s) under stringent conditions. The preferred size range is from about 20 bases to the length of the hPHTl and/or hPHT2 mRNA, more preferably from about 30 bases to the length of the hPHTl and/or hPHT2 mRNA, and most preferably from about 40 bases to the length of the hPHTl and/or hPHT2 mRNA. The probes are typically labeled, with a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3H, 1251, 35S, 14C, or 32P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241.
A fluorescent label is preferred because it provides a very strong signal with low background. It is also optically detectable at high resolution and sensitivity through a quick scanning procedure. The nucleic acid samples can all be labeled with a single label, e.g., a single fluorescent label. Alternatively, in another embodiment, different nucleic acid samples can be simultaneously hybridized where each nucleic acid sample has a different label. For instance, one target could have a green fluorescent label and a second target could have a red fluorescent label. The scanning step will distinguish sites of binding of the red label from those binding the green fluorescent label. Each nucleic acid sample (target nucleic acid) can be analyzed independently from one another.
Suitable chromogens which can be employed include those molecules and compounds which absorb light in a distinctive range of wavelengths so that a color can be observed or, alternatively, which emit light when irradiated with radiation of a particular wave length or wave length range, e.g., fluorescent molecules.
Desirably, fluorescent labels should absorb light above about 300 nm, preferably about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths greater than about 10 nm higher than the wavelength of the light absorbed. It should be noted that the absorption and emission characteristics of the bound dye can differ from the unbound dye. Therefore, when referring to the various wavelength ranges and characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye which is unconjugated and characterized in an arbitrary solvent. Fluorescent labels are generally preferred because by irradiating a fluorescent molecule with light, one can obtain a plurality of emissions. Thus, a single label can provide for a plurality of measurable events.
Detectable signal can also be provided by chemi luminescent and bioluminescent sources. Chemiluminescent sources include a compound which becomes electronically excited by a chemical reaction and can then emit light which serves as the detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can be used in conjunction with luciferase or lucigenins to provide bioluminescence.
Spin labels are provided by reporter molecules with an unpaired electron spin which can be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels include organic free radicals, transitional metal complexes, particularly vanadium, copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free radicals.
The label may be added to the target (sample) nucleic acid(s) prior to, or after the hybridization. So called "direct labels" are detectable labels that are directly attached to or incorporated into the target (sample) nucleic acid prior to hybridization. In contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization. Often, the indirect label is attached to a binding moiety that has been attached to the target nucleic acid prior to the hybridization. Thus, for example, the target nucleic acid may be biotinylated before the hybridization. After hybridization, an avidin-conjugated fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily detected. For a detailed review of methods of labeling nucleic acids and detecting labeled hybridized nucleic acids see
Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With
Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., (1993)). Fluorescent labels are easily added during an in vitro transcription reaction. Thus, for example, fluorescein labeled UTP and CTP can be incorporated into the RNA produced in an in vitro transcription.
The labels can be attached directly or through a linker moiety. In general, the site of label or linker-label attachment is not limited to any specific position. For example, a label may be attached to a nucleoside, nucleotide, or analogue thereof at any position that does not interfere with detection or hybridization as desired. For example, certain Label-On Reagents from Clontech (Palo Alto, CA) provide for labeling interspersed throughout the phosphate backbone of an oligonucleotide and for terminal labeling at the 3' and 5' ends. As shown for example herein, labels can be attached at positions on the ribose ring or the ribose can be modified and even eliminated as desired. The base moieties of useful labeling reagents can include those that are naturally occurring or modified in a manner that does not interfere with the purpose to which they are put. Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A and G, and other heterocyclic moieties. It will be recognized that fluorescent labels are not to be limited to single species organic molecules, but include inorganic molecules, multi -molecular mixtures of organic and/or inorganic molecules, crystals, heteropolymers, and the like. Thus, for example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily derivatized for coupling to a biological molecule (Bruchez et al. (1998) Science, 281: 2013- 2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium selenide) have been covalently coupled to biomolecules for use in ultrasensitive biological detection (Warren and Nie (1998) Science, 281: 2016-2018).
B) Polypeptide-based assays -- Polypeptide expression.
1) Assay Formats. In addition to, or in alternative to, the detection of hPHTl and/or hPHT2 nucleic acid expression level(s), alterations in expression of hPHTl and/or hPHT2 transporters can be detected and/or quantified by detecting and/or quantifying the amount and/or activity of translated hPHTl and or hPHT2 polypeptide or fragments thereof.
2) Detection of expressed protein. The polypeptide(s) encoded by the hPHTl and/or hPHT2 gene(s) can be detected and quantified by any of a number of methods well known to those of skill in the art. These may include analytic biochemical methods such as electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, or various immunological methods such as fluid or gel precipitin reactions, immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassay (RIA), enzyme-linked immunosorbent assays (ELISAs), immunofluorescent assays, western blotting, and the like.
In one preferred embodiment, the hPHTl and/or hPHT2 polypeptide(s) are detected quantified in an electrophoretic protein separation (e.g. a 1- or 2-dimensional electrophoresis). Means of detecting proteins using electrophoretic techniques are well known to those of skill in the art (see generally, R. Scopes (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher, (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, Inc., N.Y.).
In another preferred embodiment, Western blot (immunoblot) analysis is used to detect and quantify the presence of polypeptide(s) of this invention in the sample. This technique generally comprises separating sample proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample with the antibodies that specifically bind the target polypeptide(s).
The antibodies specifically bind to the target polypeptide(s) and may be directly labeled or alternatively may be subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically bind to the a domain of the antibody.
In preferred embodiments, the hPHTl and/or hPHT2 polypeptide(s) are detected using an immunoassay. As used herein, an immunoassay is an assay that utilizes an antibody to specifically bind to the analyte (e.g., the target polypeptide(s)). The immunoassay is thus characterized by detection of specific binding of a polypeptide of this invention to an antibody as opposed to the use of other physical or chemical properties to isolate, target, and quantify the analyte.
Any of a number of well recognized immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168) are well suited to detection or quantification of the polypeptide(s) identified herein.. For a review of the general immunoassays, see also Asai (1993) Methods in Cell Biology Volume 37: Antibodies in Cell Biology, Academic Press, Inc. New York; Stites & Terr (1991) Basic and Clinical Immunology 7th Edition. Immunological binding assays (or immunoassays) typically utilize a "capture agent" to specifically bind to and often immobilize the analyte (hPHTl and/or hPHT2 polypeptide). In preferred embodiments, the capture agent is an antibody.
Immunoassays also often utilize a labeling agent to specifically bind to and label the binding complex formed by the capture agent and the analyte. The labeling agent may itself be one of the moieties comprising the antibody/analyte complex. Thus, the labeling agent may be a labeled polypeptide or a labeled antibody that specifically recognizes the already bound target polypeptide. Alternatively, the labeling agent may be a third moiety, such as another antibody, that specifically binds to the capture agent /polypeptide complex.
Other proteins capable of specifically binding immunoglobulin constant regions, such as protein A or protein G may also be used as the label agent. These proteins are normal constituents of the cell walls of streptococcal bacteria. They exhibit a strong non- immunogenic reactivity with immunoglobulin constant regions from a variety of species (see, generally Kronval, et al. (1973) J. Immunol, 111: 1401-1406, and Akerstrom (1985) J. Immunol, 135: 2589-2542).
Preferred immunoassays for detecting the target polypeptide(s) are either competitive or noncompetitive. Noncompetitive immunoassays are assays in which the amount of captured analyte is directly measured. In one preferred "sandwich" assay, for example, the capture agents (antibodies) can be bound directly to a solid substrate where they are immobilized. These immobilized antibodies then capture the target polypeptide present in the test sample. The target polypeptide thus immobilized is then bound by a labeling agent, such as a second antibody bearing a label.
In competitive assays, the amount of analyte (hPHTl and/or hPHT2 polypeptide) present in the sample is measured indirectly by measuring the amount of an added (exogenous) analyte displaced (or competed away) from a capture agent (antibody) by the analyte present in the sample. For example, in one competitive assay, a known amount of, in this case, labeled hPHTl and/or hPHT2 polypeptide is added to the sample and the sample is then contacted with a capture agent. The amount of labeled polypeptide bound to the antibody is inversely proportional to the concentration of target hPHTl and/or hPHT2 polypeptide present in the sample.
In one particularly preferred embodiment, the antibody is immobilized on a solid substrate. The amount of target polypeptide bound to the antibody may be determined either by measuring the amount of target polypeptide present in a polypeptide/antibody complex, or alternatively by measuring the amount of remaining uncomplexed polypeptide.
The immunoassay methods of the present invention include an enzyme immunoassay (EIA) which utilizes, depending on the particular protocol employed, unlabeled or labeled (e.g., enzyme-labeled) derivatives of polyclonal or monoclonal antibodies or antibody fragments or single-chain antibodies that bind hPHTl and/or hPHT2 polypeptide(s), either alone or in combination. In the case where the antibody that binds hPHTl and/or hPHT2 polypeptide is not labeled, a different detectable marker, for example, an enzyme-labeled antibody capable of binding to the monoclonal antibody which binds the hPHTl and/or hPHT2 polypeptide, may be employed. Any of the known modifications of EIA, for example, enzyme-linked immunoabsorbent assay (ELISA), may also be employed. As indicated above, also contemplated by the present invention are immunoblotting immunoassay techniques such as western blotting employing an enzymatic detection system. The immunoassay methods of the present invention may also be other known immunoassay methods, for example, fluorescent immunoassays using antibody conjugates or antigen conjugates of fluorescent substances such as fluorescein or rhodamine, latex agglutination with antibody-coated or antigen-coated latex particles, haemagglutination with antibody-coated or antigen-coated red blood corpuscles, and immunoassays employing an avidin-biotin or strepavidin-biotin detection systems, and the like.. The particular parameters employed in the immunoassays of the present invention can vary widely depending on various factors such as the concentration of antigen in the sample, the nature of the sample, the type of immunoassay employed and the like. Optimal conditions can be readily established by those of ordinary skill in the art. In certain embodiments, the amount of antibody that binds hPHTl and/or hPHT2 polypeptide is typically selected to give 50% binding of detectable marker in the absence of sample. If purified antibody is used as the antibody source, the amount of antibody used per assay will generally range from about 1 ng to about 100 ng. Typical assay conditions include a temperature range of about 4°C. to about 45°C, preferably about 25°C to about 37°C, and most preferably about 25°C, a pH value range of about 5 to 9, preferably about 7, and an ionic strength varying from that of distilled water to that of about 0.2M sodium chloride, preferably about that of 0.15M sodium chloride. Times will vary widely depending upon the nature of the assay, and generally range from about 0.1 minute to about 24 hours. A wide variety of buffers, for example PBS, may be employed, and other reagents such as salt to enhance ionic strength, proteins such as serum albumins, stabilizers, biocides and non-ionic detergents may also be included.
The assays of this invention are scored (as positive or negative or quantity of target polypeptide) according to standard methods well known to those of skill in the art. The particular method of scoring will depend on the assay format and choice of label. For example, a Western Blot assay can be scored by visualizing the colored product produced by the enzymatic label. A clearly visible colored band or spot at the correct molecular weight is scored as a positive result, while the absence of a clearly visible spot or band is scored as a negative. The intensity of the band or spot can provide a quantitative measure of target polypeptide concentration.
Antibodies for use in the various immunoassays described herein, are commercially available or can be produced as described below.
4) Antibodies to hPHTl and/or hPHT2 polypeptides.
Either polyclonal or monoclonal antibodies (anti-hPHTl or anti-hPHT2 antibodies) may be used in the immunoassays of the invention described herein. Polyclonal antibodies are preferably raised by multiple injections (e.g. subcutaneous or intramuscular injections) of substantially pure polypeptides (hPHTl and/or hPHT2 or fragments thereof) or antigenic polypeptides into a suitable non-human mammal. The antigenicity of the target peptides can be determined by conventional techniques to determine the magnitude of the antibody response of an animal that has been immunized with the peptide. Generally, the peptides that are used to raise antibodies for use in the methods of this invention should generally be those which induce production of high titers of antibody with relatively high affinity for target polypeptides encoded by hPHTl and/or hPHT2.
If desired, the immunizing peptide may be coupled to a carrier protein by conjugation using techniques that are well-known in the art. Such commonly used carriers which are chemically coupled to the peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid. The coupled peptide is then used to immunize the animal (e.g. a mouse or a rabbit).
The antibodies are then obtained from blood samples taken from the mammal. The techniques used to develop polyclonal antibodies are known in the art (see, e.g.,
Methods ofEnzymology, "Production of Antisera With Small Doses of Immunogen: Multiple Intradermal Injections", Langone, et al. eds. (Acad. Press, 1981)). Polyclonal antibodies produced by the animals can be further purified, for example, by binding to and elution from a matrix to which the peptide to which the antibodies were raised is bound. Those of skill in the art will know of various techniques common in the immunology arts for purification and/or concentration of polyclonal antibodies, as well as monoclonal antibodies see, for example, Coligan, et al. (1991) Unit 9, Current Protocols in Immunology, Wiley Interscience).
Preferably, however, the antibodies produced will be monoclonal antibodies ("mAb's"). For preparation of monoclonal antibodies, immunization of a mouse or rat is preferred. The term "antibody" as used in this invention includes intact molecules as well as fragments thereof, such as, Fab and F(ab')2, and/or single-chain antibodies (e.g. scFv) which are capable of binding an epitopic determinant.
The general method used for production of hybridomas secreting mAbs is well known (Kohler and Milstein (1975) Nature, 256:495). Briefly, as described by Kohler and Milstein the technique comprises fusing an antibody-secreting cell (e.g. a splenocyte) with an immortalized cell (e.g. a myeloma cell). Hybridomas are then screened for production of antibodies that bind to hPHTl and/or hPHT2 or a fragment thereof. Confirmation of specificity among mAb's can be accomplished using relatively routine screening techniques (such as the enzyme-linked immunosorbent assay, or "ELISA", BiaCore, etc.) to determine the binding specificity and/or avidity of the mAb of interest. Antibodies fragments, e.g. single chain antibodies (scFv or others), can also be produced/selected using phage display technology. The ability to express antibody fragments on the surface of viruses that infect bacteria (bacteriophage or phage) makes it possible to isolate a single binding antibody fragment, e.g., from a library of greater than 1010 nonbinding clones. To express antibody fragments on the surface of phage (phage display), an antibody fragment gene is inserted into the gene encoding a phage surface protein (e.g., plU) and the antibody fragment-pHI fusion protein is displayed on the phage surface (McCafferty et al. (1990) Nature, 348: 552-554; Hoogenboom et al. (1991) Nucleic Acids Res. 19: 4133-4137).
Since the antibody fragments on the surface of the phage are functional, phage bearing antigen binding antibody fragments can be separated from non-binding phage by antigen affinity chromatography (McCafferty et al. (1990) Nature, 348: 552-554). Depending on the affinity of the antibody fragment, enrichment factors of 20 fold - 1,000,000 fold are obtained for a single round of affinity selection. By infecting bacteria with the eluted phage, however, more phage can be grown and subjected to another round of selection. In this way, an enrichment of 1000 fold in one round can become 1,000,000 fold in two rounds of selection (McCafferty et al. (1990) Nature, 348: 552-554). Thus even when enrichments are low (Marks et al. (1991) J. Mol. Biol. 222: 581-597), multiple rounds of affinity selection can lead to the isolation of rare phage. Since selection of the phage antibody library on antigen results in enrichment, the majority of clones bind antigen after as few as three to four rounds of selection. Thus only a relatively small number of clones (several hundred) need to be analyzed for binding to antigen.
Human antibodies can be produced without prior immunization by displaying very large and diverse V-gene repertoires on phage (Marks et al. (1991) J. Mol. Biol. 222: 581-597). In one embodiment natural VH and VL repertoires present in human peripheral blood lymphocytes are were isolated from unimmunized donors by PCR. The V-gene repertoires were spliced together at random using PCR to create a scFv gene repertoire which is was cloned into a phage vector to create a library of 30 million phage antibodies (Id.). From this single "naive" phage antibody library, binding antibody fragments have been isolated against more than 17 different antigens, including haptens, polysaccharides and proteins (Marks et al. (1991) J. Mol. Biol. 222: 581-597; Marks et al. (1993). Bio/Technology. 10: 779-783; Griffiths et al. (1993) EMBO J. 12: 725-734; Clackson et al. (1991) Nature. 352: 624-628). Antibodies have been produced against self proteins, including human thyroglobulin, immunoglobulin, tumor necrosis factor and CEA (Griffiths et al. (1993) EMBO J. 12: 725-734). It is also possible to isolate antibodies against cell surface antigens by selecting directly on intact cells. The antibody fragments are highly specific for the antigen used for selection and have affinities in the 1 :M to 100 nM range (Marks et al. (1991) J. Mol. Biol. 222: 581-597; Griffiths et al. (1993) EMBO J. 12: 725- 734). Larger phage antibody libraries result in the isolation of more antibodies of higher binding affinity to a greater proportion of antigens.
It will also be recognized that antibodies can be prepared by any of a number of commercial services (e.g., Berkeley antibody laboratories, Bethyl Laboratories, Anawa, Eurogenetec, etc.).
C) Polypeptide-based assays -- Polypeptide activity.
In addition to, or as an alternative to, the assays described above, it is also possible to assay for hPHTl and/or hPHT2 activity. As explained above, hPHTl and/or hPHT2 are transporters. Thus, endogenous hPHTl and/or hPHT2 activity in a cell can be readily measured by providing a suitable substrate (e.g. one identified according to the methods described herein) and detecting the uptake of that substrate by hPHTl or hPHT2.
D) Pre-screening for agents that bind hPHTl or hPHT2 nucleic acids or polypeptides.
In certain embodiments it is desired to pre-screen test agents for the ability to interact with (e.g. specifically bind to) an hPHTl or hPHT2 nucleic acid or polypeptide. Specifically, binding test agents are more likely to interact with and thereby modulate hPHTl or hPHT2 expression and/or activity. Thus, in some preferred embodiments, the test agent(s) are pre-screened for binding to hPHTl and/or hPHT2 nucleic acids or to hPHTl and/or hPHT2 proteins before performing the more complex assays described above.
In one embodiment, such pre-screening is accomplished with simple binding assays. Means of assaying for specific binding or the binding affinity of a particular ligand for a nucleic acid or for a protein are well known to those of skill in the art. In preferred binding assays, the hPHTl and/or hPHT2 protein or protein fragment, or nucleic acid is immobilized and exposed to a test agent (which can be labeled), or alternatively, the test agent(s) are immobilized and exposed to an hPHTl and/or hPHT2 protein (or fragment) or to an hPHTl or hPHT2 nucleic acid or fragment thereof (which can be labeled). The immobilized moiety is then washed to remove any unbound material and the bound test agent or bound hPHTl or hPHT2 nucleic acid or protein is detected (e.g. by detection of a label attached to the bound molecule). The amount of immobilized label is proportional to the degree of binding between the hPHTl and/or hPHT2 protein or nucleic acid and the test agent.
V. High throughput screening for agents transported by peptide transporters and/or for agents that modulate transporter expression.
The assays for modulators of peptide transporter expression and/or activity or for agents transported by the transporters of this invention are also amenable to "high- throughput" modalities. Conventionally, new chemical entities with useful properties (e.g., modulation of transporter activity or expression, or ability to be transported by the transporters of this invention) are generated by identifying a chemical compound (called a "lead compound") with some desirable property or activity, creating variants of the lead compound, and evaluating the property and activity of those variant compounds. However, the current trend is to shorten the time scale for all aspects of drug discovery. Because of the ability to test large numbers quickly and efficiently, high throughput screening (HTS) methods are replacing conventional lead compound identification methods.
In one preferred embodiment, high throughput screening methods involve providing a library containing a large number of compounds (candidate compounds) potentially having the desired activity. Such "combinatorial chemical libraries" are then screened in one or more assays, as described herein, to identify those library members (particular chemical species or subclasses) that display a desired characteristic activity. The compounds thus identified can serve as conventional "lead compounds" or can themselves be used as potential or actual therapeutics.
A) Combinatorial chemical libraries for agents transported by hPHTl or hPHT2 or for modulators of hPHTl or hPHT2 expression.
The likelihood of an assay identifying an agent transported by hPHTl and/or hPHT2 or a modulator of hPHTl and/or hPHT2 expression or activity is increased when the number and types of test agents used in the screening system is increased. Recently, attention has focused on the use of combinatorial chemical libraries to assist in the generation of new chemical compound leads. A combinatorial chemical library is a collection of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of chemical "building blocks" such as reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed by combining a set of chemical building blocks called amino acids in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds can be synthesized through such combinatorial mixing of chemical building blocks. For example, one commentator has observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds (Gallop et al. (1994) 37(9): 1233-1250).
Preparation and screening of combinatorial chemical libraries is well known to those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka (1991) Int. J. Pept. Prot. Res., 37: 487-493, Houghton et al. (1991) Nature, 354: 84-88). Peptide synthesis is by no means the only approach envisioned and intended for use with the present invention. Other chemistries for generating chemical diversity libraries can also be used. Such chemistries include, but are not limited to: peptoids (PCT Publication No WO 91/19735, 26 Dec. 1991), encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), random bio-oligomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs et al, (1993) Proc. Nat. Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara et al. (1992) J. Amer. Chem. Soc. 114: 6568), nonpeptidal peptidomimetics with a Beta- D- Glucose scaffolding (Hirschmann et al, (1992) J. Amer. Chem. Soc. 114: 9217-9218), analogous organic syntheses of small compound libraries (Chen et al. (1994) J. Amer. Chem. Soc. 116: 2661), oUgocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl phosphonates (Campbell et al, (1994) 7. Org. Chem. 59: 658). See, generally, Gordon et al, (1994) 7. Med. Chem. 37:1385, nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083) antibody libraries (see, e.g., Vaughn et al. (1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et al. (1996) Science, 274: 1520-1522, and U.S. Patent 5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33, isoprenoids U.S. Patent 5,569,588, thiazolidinones and metathiazanones U.S. Patent 5,549,974, pyrrolidines U.S. Patents 5,525,735 and 5,519,134, morpholino compounds U.S. Patent 5,506,337, benzodiazepines 5,288,514, and the like). Devices for the preparation of combinatorial libraries are commercially available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, MA).
A number of well known robotic systems have also been developed for solution phase chemistries. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate π, Zymark Corporation, Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual synthetic operations performed by a chemist. Any of the above devices are suitable for use with the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein will be apparent to persons skilled in the relevant art. In addition, numerous combinatorial libraries are themselves commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.).
B) High throughput assays of chemical libraries for agents transported by hPHTl and/or hPHT2 or for modulators of hPHTl and/or hPHT2 expression. Any of the assays for agents that modulate hPHTl and/or hPHT2 expression or activity and or for agents transported by hPHTl and/or hPHT2 are amenable to high throughput screening. As described above likely modulators either inhibit expression of the gene product, or inhibit the activity of the expressed protein while agents transported by hPHT2 and/or hPHT2 are internalized into cells expressing these proteins. Preferred assays thus detect inhibition of transcription (i.e., inhibition of mRNA production) by the test compound(s), inhibition of protein expression by the test compound(s), binding to the gene (e.g., gDNA, or cDNA) or gene product (e.g., mRNA or expressed protein) by the test compound(s) in the case of expression assays, while transport assays preferably measure internalization of the test agent. High throughput assays for the presence, absence, or quantification of particular nucleic acids or protein products are well known to those of skill in the art. Similarly, binding assays are similarly well known. Thus, for example, U.S. Patent 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 5,585,639 discloses high throughput screening methods for nucleic acid binding (i.e., in arrays), while U.S. Patents 5,576,220 and 5,541,061 disclose high throughput methods of screening for ligand/antibody binding.
In addition, high throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically automate entire procedures including all sample and reagent pipetting, liquid dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate for the assay. These configurable systems provide high throughput and rapid start up as well as a high degree of flexibility and customization. The manufacturers of such systems provide detailed protocols the various high throughput. Thus, for example, Zymark Corp. provides technical bulletins describing screening systems for detecting the modulation of gene transcription, ligand binding, and the like. VI. Kits.
In still another embodiment, this invention provides kits for isolation and/or detection and/or cloning of the transporter genes of this invention. In addition kits are provided for the practice of any of the assay methods described herein. In one preferred embodiment, the kits comprise one or more containers containing nucleic acids encoding one or more of the H+/oligopeptide transporters or fragments thereof, or (optionally labeled) probes that specifically bind to one or more of the H+/oligopeptide transporters of this invention.
In another embodiment, kits comprise one or more containers containing a vector encoding one or more of the transporters of this invention and/or cells or cell lines optionally transfected with one or more of these vectors. In addition to, or alternatively the kit contain mRNA(s) encoding one or more of the transporters of this invention and/or cells suitable for transfection with such mRNAs. Instead of the RNAs, the kits may optionally contain DNA template(s) suitable for preparation of such mRNAs. The kits may optionally include one or more reagents for use in the methods of this invention. Such "reagents" may include, but are not limited to, cells and/or cell lines, transfection reagents (e.g. CaPO4, lipofectin), detectable labels, means for detecting labels, buffers, anti-transporter antibodies, nucleic acid constructs encoding housekeeping genes, bioreactors, syringes, and other devices. In addition, the kits may include instructional materials containing directions
(i.e., protocols) for the practice of the methods of this invention. Preferred instructional materials provide protocols utilizing the kit contents for creating or modifying cells encoding one or more of the transporters of this invention, and or utilizing the kit contents for measuring expression of one or more of the transporters of this invention, or for screening for agents that are transported by one or more of the transporters of this invention. While the instructional materials typically comprise written or printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this invention. Such media include, but are not limited to electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such media may include addresses to internet sites that provide such instructional materials. EXAMPLES
The following examples are offered to illustrate, but not to limit the claimed invention.
Example 1 Human Proton/Oligopeptide Transporter (POT) Genes: Identification of Putative
Human Genes Using Bioinformatics
The proton-dependent oligopeptide transporters (POT) gene family currently consists of -70 cloned cDNAs derived from diverse organisms. In mammals, two genes encoding peptide transporters, PepTI and PepT2 have been cloned in several species including humans, in addition to a rat histidine/peptide transporter (rPHTI). Because the Candida elegans genome contains five putative POT genes, we searched the available protein and nucleic acid databases for additional mammalian/human POT genes, using iterative BLAST runs and the human expressed sequence tags (EST) database. The apparent human orthologue of rPHTI (expression largely confined to rat brain and retina) was represented by numerous ESTs originating from many tissues. Assembly of these ESTs resulted in a contiguous sequence covering -95% of the suspected coding region. The contig sequences and analyses revealed the presence of several possible splice variants of hPHTl. A second closely related human EST-contig displayed high identity to a recently cloned mouse cDNA encoding cyclic adenosine monophosphate (cAMP)-inducible 1 protein (gi:4580995). This contig served to identify a PAC clone containing deduced exons and introns of the likely human orthologue (termed hPHT2). Northern analyses with EST clones indicated that hPHTl is primarily expressed in skeletal muscle and spleen, whereas hPHT2 is found in spleen, placenta, lung, leukocytes, and heart. These results suggest considerable complexity of the human POT gene family, with relevance to the absorption and distribution of cephalosporins and other peptoid drugs.
Introduction
Small peptides and peptidelike drugs are often too polar to cross lipid bilayers by simple diffusion. Therefore, translocation across membranes depends on transport by a suitable carrier, and orally administered peptoid drugs would be poorly absorbed unless transported (Yang et al. (1999) Pharm. Res. 16: 1331-1343). The main intestinal
HVdipeptide transporter protein, PepTI, is thought to play a critical role in oral bioavailability of peptidelike drugs (Dantzig and Bergin (1990) Biochim. Biophys. Acta, 1027: 211-217; Matsumoto et al. (1994) J. Pharmacol. Exp. Ther., 210: 498-504; Wenzel et al. (1996) J. Pharmacol Exp. Ther., 277: 831-839; Kramer et al. (1990) Biochim. Biophys. Acta, 1027: 25-30; Fei et al. (1994) Nature, 368: 563-566; Thwaites et al. (1993) J. Biol. Chem., 268: 7640-7642). hPepTl is a member of a well defined small gene family, the proton-dependent oligopeptide transporters (POT, also referred to as PTR), with ancestral roots that can be traced to bacterial, fungal, and plant peptide transporters (Graul and Sadee (1997) Pharm. Res., 14: 388-400; Fei et al. (1998) Prog. Nucleic Acid Res. Mol. Biol, 58: 239-261; Paulsen and Skurray (1994) Trends Biochem. Sci., 19: 404; Steiner et al. (1995) Mol. Microbiol, 16: 825-834). This class of secondary active transporters has broad selectivity for di- and tripeptides, whereas ability to transport longer peptides decreases drastically with increasing length. Most of the POT members share a common structural architecture with -12 predicted transmembrane domains (TMDs), but among the dipeptide transporters in distant phyla, variations on this theme do occur. We have confirmed the transmembrane topology of at least a portion of the main intestinal transporter, hPepTl, using an epitope tagging approach (Covitz et al. (1998) Biochemistry, 37: 15214-15221). Until recently, only two POT genes had been identified in mammalian species, PepTI and PepT2, the main renal peptide transporters (Fei et al. (1994) Nature, 368: 563-566; Liu et al. (1995) Biochim Biophys Acta., 1235: 461-466). cDNA's of the respective human orthologues have been cloned (orthologue refers to the same gene in different species) (Liu et al. (1995) Biochim Biophys Acta., 1235: 461-466; Liang et al. (1995) J. Biol. Chem., 270: 6456-6463). These transporters display overlapping, broad substrate selectivity and interact with numerous drugs, some with chemical structures quite distinct from peptides. Substrates include important drug classes, such as β-lactam and cephalosporin antibiotics, renin inhibitors, ACE inhibitors, and 5'-nucleoside esters of amino acids, such as valcyclovir (Han et al. (1998) Pharm. Res., 15: 1154-1159).
With a few exceptions, single amino acids are not substrates. In 1997, a third cDNA was cloned, encoding the rat peptide-histidine transporter, rPHTI (Yamashita et al. (1997) J. Biol. Chem. 272: 10205-10211), with use of sequence alignments of POT members with the expressed sequence tags (EST) database. Currently cloned mammalian POT cDNAs and deduced proteins are summarized in Table 1. Whereas rPHTI is mainly expressed in rat brain and retina, the identity and tissue distribution of its human orthologue remain unknown. We have found numerous human ESTs that on the basis of high sequence identity among them appear to represent the human orthologue of the rat transporter gene rPHTI. Many of these ESTs were isolated from tissues other than brain, e.g, human colon carcinoma, suggesting that the tissue distribution of this gene product in humans might differ from that in rat. This finding cautions against assuming that only one HNdipeptide transporter is expressed in a tissue of interest. Nakanishi et al (1997) have postulated the presence of a distinct peptide carrier in a human fibrosarcoma cell line (Nakanishi et al. (1997) Cancer Res., 57: 4118-4122). Understanding of peptide transport is facilitated by identification and characterization of all of the relevant genes.
Table 1.. Molecular characteristics of cloned POT cDNAs*
Figure imgf000050_0001
*POT indicates proton-dependent oligopeptide transporters; TMD indicates transmembrane domain. fPossible alternative splicing product of hPepTl.
Comprehensive sequence analysis reveals the presence of at least five possible members of the POT family in C elegans (this Example). Therefore, one would expect several POT genes to exist in the human genome, in addition to hPepTl and hPepT2.
Because the POT family shares limited sequence similarity with other transporter families, any newly identified sequences with significant similarity to POT proteins are likely to be members of the POT family. Therefore, our overall objective was to identify all human homologues of the POT family and determine their tissue distribution. In this Example, we show results obtained from a bioinformatics analysis of the available databases, supplemented by assays of gene expression in human tissues.
To find all members of a gene family, we developed a Java-based program, which iteratively searches sequence databases for homologous genes using BLAST. Called INCA (iterative neighborhood cluster analysis; http://itsa.ucsf.edu/~gram/home/inca/), this program identifies a complete cluster of sequence neighbors in the database (Graul and Sadee (1997) Pharm Res., 14: 1533-1541). By applying INCA to search the nonredundant protein sequence database and subsequently the human EST database for novel dipeptide transporters, we have identified two new human genes as possible members of POT and several ESTs representing candidate genes of additional human peptide transporters of the POT family. A greater repertoire of the dipeptide transporter gene family in humans must be considered in the interpretation of pharmacological studies with peptoid drugs and could also serve for targeting drugs to specific tissues. Moreover, sequence variations in these transporters could account for interindividual genetic differences in the disposition of peptoid drugs.
Methods
Iterative Neighborhood Cluster Analysis (INCA) using BLAST
Scanning available databases for all genes and proteins related to each other was accomplished using multiple BLAST runs (basic local alignment search tool; http://www.ncbi.nlm.nih.gov/BLAST/). Each individual run within an INCA analysis used gapped BLAST of the v2.0.9 family of programs (Altschul et al. (1997) Nucleic Acids Res., 25: 3389-3402). With gapped BLAST, we found local alignments of similar sequence fragments while allowing for gaps in the alignment. This approach enhances detection of sequence similarities and exceeds nongapped BLAST searches. We developed an iterative BLAST search termed INCA (http://itsa.ucsf.edu/~gram home/inca/) (Graul and Sadee (1997) Pharm Res., 14: 1533-1541), that automatically performs BLAST on all sequences identified in the first run with a maximal Expect value (10"6 in the present study). In subsequent runs, ESTCA further tabulated any newly identified sequences scoring with E < 10" , until no further sequences were found. The Expect value estimates the probability that a given alignment score describing sequence similarity could have arisen by chance in the database searched. E = 10"6 is often taken as a cutoff below which homology is considered possible while 10"9 indicates probable homology (cutoff E values may change as a function of the type of protein and the analysis used).
The INCA results are tabulated so that sequences found in subsequent runs are listed with the sequence in the first run yielding the lowest E value (highest similarity). Thus, a first pass with multiple iterative BLASTs identifies all known protein sequences belonging to the POT family, regardless of the starter sequence. In a second pass, we used this entire neighborhood cluster, performing BLAST with each sequence, to scan the human EST database (BLAST, database: 'human ests'). EST clones represent individual mRNA species extracted from target tissues and converted to cDNAs that were then subcloned and partially sequenced. Upon completion of the second INCA pass, the program tabulates the results with each EST assigned to the protein sequence giving the lowest E value. Thereby, INCA provided a simple and exhaustive means of finding all relevant ESTs and assigning them to their most closely related protein sequence in the core cluster of POT sequences. This facilitated the search for new genes that might be represented by ESTs in the accessible databases.
Sequence Analysis Sequence analysis programs used are available at http://www.sacs.ucsf.edu/.
Database homology searches were carried out using the NCBI BLAST v2.0.9 family of programs (Altschul et al. (1997) Nucleic Acids Res., 25: 3389-3402). Contigs of EST sequences (the term contig as used here refers to a contiguous sequence assembled from several overlapping sequence fragments) were assembled using CAP3 (Huang and Madan (1999) Genome Res., 9: 868-877) and viewed and edited using Sequencher3.0
(www.genecodes.com). The structure of the hPHT2 gene was characterized with FGENESH. Multiple alignments of protein sequences were produced with the Pileup GCGvlO (Wisconsin Package Version 10.0; Genetics Computer Group (GCG), Madison, WI), using oldpep matrix with default parameters. Hydropathy plots were generated using Kyte and Doolittle hydropathy measure (Kyte and Doolittle (1982) J. Mol. Biol, 157: 105-132) and a window size of 7 by Pepplot GCGvlO (Wisconsin Package Version 10.0; Genetics Computer Group (GCG)) with local modifications. The following transmembrane prediction tools were used to produce computer predicted TM topology:
TMHMM(http://www.cbs.dtu.dk/services/TMHMM-1.0/) TMPRED(http://www.ch.embnet.org/software/TMPRED_form.html),
HMMTOP(http://www.enzim.hu/hmmtop/), and MEMSTAT(http://globin.bio. warwick.ac.uk/psipred/). The transmembrane topology schematic was rendered using TOPO (S.J. Johns and R.C. Speth, Transmembrane protein display software, http://www.sacs.ucsf.edu/TOPO/topo.html). Sequence identities were calculated using the Smith Waterman algorithm (Smith and Waterman (1981) J. Mol. Biol, 147: 195-197) by the program ssearch3, a component of the FASTA programs (Pearson (1991) Genomics, 11: 635-650).
Reverse Transcriptase-Polymerase Chain Reaction Analysis
Poly (A)+ RNA samples extracted from human intestinal biopsy specimen were analyzed by reverse transcriptase-polymerase chain reaction (RT-PCR). RT-PCR was performed with the GeneAmp RNA PCR Kit Part No. N808-0017 from Perkin Elmer (Wellesley, MA) using 0.5ul AmpliTaq® DNA Polymerase. Alternatively, cDNA samples from skeletal muscle and pancreas (purchased from CLONTECH Laboratories, Palo Alto, CA) were analyzed without RT treatment. The thermocycle included heating at 95°C, annealing at 60°C, reaction temperature at 70°C, for 35 cycles. Primers 5'MHPHTl
(CGTTAGGTGGCATTGCCTAT, SEQ ID NO: , left primer at position 562) and
3'MHPHT1 (GAGGATGAGCACAGCATCAA, SEQ ED NO: , right primer, at position
1071) were designed with the Primer3 program from the Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA. The expected product size is 509 base pairs. The amplified product was electrophoresed on a 1% Agarose gel, extracted, and sequenced by the University of California San Francisco Human Genetics Sequencing Service, San Francisco, CA.
Northern Blot Analysis of hPHTl and hPHT2 Expression in Human Tissues
Membrane blots containing size-fractionated poly(A)+ mRNA from 12 tissues of human origin were purchased from CLONTECH Laboratories. The accession codes of the ESTs used as probes are W53019 and AA242853 for hPHTl and hPHT2, respectively. These cDNA's were labeled with α32P-dATP (3000 Ci/mmol; Amersham, Piscataway, NJ) according to the random priming method, using a kStrip-EZ DNA kit (Ambion, Austin, TX). Hybridization was performed at 42°C overnight after purifying 32-P- cDNA on a Sephadex G-50 spin column (mini Quick Spin Columns; Boehringer Mannheim, Indianapolis, IN). The blots were washed twice at 42°C for 5 min with low stringency solution (Ambion) and for 15 min with high stringency solution (Ambion). The membranes were exposed against x-ray film at -80°C for 3 to 7 days with intensifying screens. The hybridized probe was removed from the membranes by using a Strip-EZ DNA kit (Ambion) before rehybridization. Results
Iterative Neighborhood Cluster Analysis (INCA) of the POT Family, First Pass: Nonredundant Protein Databases
Regular BLAST analysis of the protein databases confirmed the presence of three mammalian members reported to belong to the POT family: PepTI, PepT2, and rPHTI. To compile the entire POT family, we then ran INCA with hPepTl as starter sequence (any sequence is suitable as INCA yields the same sequence cluster regardless of the starter sequence). As shown in Table 2, there are 68 members of the core cluster of POT sequences, each connected to at least one other core sequence with E = 10"6 or lower. In the first iteration, 63 sequences were identified by BLAST as possible/probable homologues of hPepTl, while in the second iteration 5 additional sequences appeared in the core cluster of POT.
Table 2.. Iterative BLAST analysis using INCA (18). The protein sequence of hPepTl served as the starter sequence for pass 1 searching the nr protein databases, and two iterations were done. The core cluster contains all sequences scoring with Expect values E < 10" . These are numbered 1-63, or with a second number to indicate the ranking order of a newly added sequence in iteration 2 (e.g, 42.41). Underlining indicates human core sequences; bold-face indicates multi-drug efflux transporter in POT core cluster; bold-face with italics indicates putative C elegans transporters. In pass 2, each of the 68 members of the core cluster was run against the human EST database, and the results tabulated such that the identified ESTs were listed with the core sequence providing the highest score. Note the inclusion of a bacterial drug-resistance transporter in the protein core cluster (bold-face); this sequence would have identified more members of the core cluster in a third iteration (not done here). These include bacterial drug resistance transporters which are currently listed outside the core cluster. Putative POT members from C elegans are shown in bold-face with italics. Further, note that the human ESTs mainly cluster with sequences 11 (hPepT2), 50 (rPHT), and 51 (mouse cAMP-inducible 1 protein). INCA parameters: pass = 1, iterations 2, similar = 0.0, significant = l.OE-6, minimum = 0.0, maximum = 10.0. PROGRAM = blastp, DATALIB = nr; ran BLAST 64 times, found 596 total neighbors, 68 inside cluster. Scores include the No. bits and the E value (seehttp://www.ncbi.nlm.nih.gov/BLAST/) Core Cluster (E < 106)
1 04827008 solute carrier family 15 (oligopeptide transp.) ["Homo sapl 1430 0.0
2 02832268 ( AF043233) Caco-2 oligopeptide transporter .Homo sapl 1430 0.0
3 01136776 (D50306) proton-coupled dipeptide cotransporter lΗomo sapl 1213 0.0
4 02143888 oligopeptide transport protein PepTI [Rat norv] 1210 0.0
5 01730492 OLIGOPEPT. TRANSP., SMALL INTEST. [Rat norv] 1206 0.0
6 00548474 OLIGOPEPT. TRANSP., SMALL INTEST. [Oryct Cunic] 1167 0.0
7 00535426 (U13707) oligopeptide transporter [Oryctolagus cu] 1162 0.0
8 01082041 (L46873) proton-dependent peptide transporter [Rat n]. 792 0.0
9 01585806 peptide transporter [Rattus norvegicus] 760 0.0
10 01172436 OLIGOPEPT. TRANSP., KIDNEY ISOf. [Oryct cunic] 651 0.0
11 02833272 OLIGOPEPT. TRANSP.. KIDNEY ISOF. [Homo sap] 636 0.0
12 02499990 OLIGOPEPT. TRANSP., KIDNEY ISOF. [Rat norv] 622 le-177
13 05901826 (AF181635) BcDNA.GH06717 [Drosoph mel] 495 le-139
14 03449109 (AL031130) EG:EG0002.1 [Drosophila melanogaster] 435 le-121
15 03449108 (AL031130) alternatively spliced form [Drosoph mel] 435 le-121
16 04115344 (U97114) optl short [Drosophila melanogaster] 432 le-120
17 04115343 (U97114) optl long [Drosophila melanogaster] 432 le-120
18 02829749 OLIGOPEPt. TRANSP. 1 (YIN PROTEL] [Dros mel] 424 le-117
19 02811011 HYPOTHET. OLIGOPEPT. TRANSP .[C elegans] 397 le-109
20 02506043 (AB001328) pH-sensing regulat. factor of pept.tr. rHomo sapl 362 4e-99
21 03241977 (AF 000417) high-affinity peptide transp. [C elegans J357 le-97
22 02833290 HYPOTHET. OLIGOPEPT. TRANSP. [C elegans] 357 le-97
23 03241979 (AF000418) low-affinity peptide transporter [C elegans] 351 le-95
24 02833308 HYPOTHET. OLIGOPEPT. TRANSP C elegans] 3193e-86
25 01172704 PEPTIDE TRANSPORTER PTR2-B [A thaliana] 177 3e-43
26 01076331 histidine transport protein -; [Arabidopsis thai] 176 4e-43
27 04406784 (AC006532) putative oligopept. transport prot. [A thaliana] 175 le-42
28 04102839 (AF016713) LeOPTl [Lycopersicon esculentum] 175 le-42
28.26 03335358 (AC003028) putative peptide transporter [Arabid thai thai] 185 8e-46
29 02655098 (AF023472) peptide transporter [Hordeum vulgare] 168 le-40
30 02160144 (AC000375) Strong similarity to Arabidopsis olig] 168 le-40
31 04490321 (AJ011604) nitrate transporter [Arabidopsis] 136 6e-31
32 02760834 (AC003105) putative nitrate transporter [Arab] 136 6e-31
33 01504097 (Y07561) proton-dependent peptide transporter... 135 le-30
34 01172703 PEPTIDE TRANSPORTER PTR2-A > gi|575427 132 le-29
35 04895194 (AC007661) putative peptide trans... 130 3e-29 36 02213590 (AC000348) T7N9.10 [Arabidopsis thaliana] 123 6e-27
37 00602292 (U 17987) RCH2 protein [Brassica napus] 123 6e-27
38 02651310 (AC002336) putative PTR2-B pept. Transp. [A thai] 122 7e-27
39 00544018 NITRATE/CHLORATE TRANSPORTER [A thaliana] 121 2e-26
40 05734721 (AC008075) Similar to gb|AF023472... 118 le-25
41 04490323 (AJ131464) nitrate transporter [A thaliana] 118 2e-25
42 02829802 HYPOTHET. 53.3 KD PROTEIN IN SFP-GE[B subtilis] 116 7e-25 42.41 02828140 (AF040248) erythroid different.-related factor [Homo sap] 63 4e-09
43 04455276 (AL035527) peptide transporter-like prot. [A thaliana] 114 3e-24
4402213586 (AC000348) T7N9.6 [Arabidopsis thaliana] 111 le-23
45 03377517 (AF073361) nitrate transporter NTL1 [Arabid thai] 109 9e-23
46 00548630 PEPTIDE TRANSP. PTR2 [S cerevis]. 108 le-22 53/61:54/62
47 00453646 (LI 1994) permease [Saccharomyces cerevisiae] 1081e-22
48 02829624 HYPOTHETICAL 54.2 KD PROTEIN IN PHRB-N... 107 4e-22 48.2 04062301 (D90709) Hypothetical protein f485 [E coli] 631 le-180
49 00731995 HYPOTHET. 53.1 KD PROT. IN LYSU-C [E coli] 102 le-20
50 02208839 (AB000280) peptide/histidine transporter [Rat n]. 100 5e-20
51 04580995 (AF121080) cAMP inducible 1 protein [Mus m] 100 6e-20
52 02367418 (AF000392) peptide transporter [Lotus japonicus] 97 4e-19
53 04678332 (AL049658) putative peptide transporter [Ar thai] 96 7e-19
54 03859684 (AL033503) peptide transport protein [Candida] 88 2e-16
55 02829654 HYPOTHET. 54.0 KD PROT. IN NTH-GS [E coli] 88 2e-16 55.45 04193955 (AF113952) multidrug-efflux transp. [Camp jejuni] 57 3e-07
56 01172741 PEPTIDE TRANSPORTER PTR2 [Candida alb] 86 le-15
57 01495366 (Z69370) nitrite transporter [Cucumis sativus] 83 9e-15 57.27 05360083 (AF154930) transporter-like protein 88 2e-16
58 02507268 HYPOTHET. 53.7 KD PROT. IN USPA-PRLC [E coli] 82 le-14
59 01073520 hypothetical protein o489 - E coli 82 le-14
6005080808 (AC007258) Similar to nitrate tr. [A thaliana] 79 le-13
61 04322327 (AF080545) peptide transporter [Nepenthes alata] 76 le-12
62 02811053 DI-/TR1PEPT1DE TRANSPORTER [Lactobac helv] 64 3e-09
63 00544192 DI-/TR1PEPTI-DE TRANSP. [Lact lactis] 62 le-08
Displaying neighbors outside cluster (10'6 > E < 0.01)
13.65 00071400 dermal gland protein APEG precursor - African c... 47 6e-04
13.66 00731172 SKIN SECRETORY PROTEIN XP2 PRECURSOR (A... 47 6e-04
13.67 05802676 (AF177977) serum opacity factor pr... 46 0.001
13.68 05701582 (AF026205) No definition line foun... 45 0.002
13.69 03877270 (Z77662) predicted using Genefinder [Caenor... 45 0.002
13.70 02435546 (AF026205) No definition line found [Caenorh... 45 0.002
13.71 05748800 (AF141140) serum opacity factor pr... 45 0.003 13.72 05002375 (AF153315) serum opacity factor pr... 44 0.004
13.73 02435547 (AF026205) No definition line found [Caenorh... 44 0.004
13.74 00477578 sialidase - Actinomyces viscosus > gi|141852 (L0... 44 0.004
13.75 02275336 (AF001978) differentially expressed in relation ... 44 0.005
13.76 02507049 HYPHAL WALL PROTEIN 1 (CELL ELONGATION... 44 0.005
13.77 05139301 (AF157555) serum opacity factor pr... 43 0.007
13.78 01781122 (Z83864) hypothetical protein Rv3835 [Mycobac... 43 0.007
13.79 01480457 (U42640) latex allergen [Hevea brasiliensis] > gi... 43 0.007
13.80 01420865 (X80397) orfl [Streptococcus pyogenes] 43 0.009
22.64 02621135 (AE000800) multidrug transporter homolog [Methan... 52 le-05 42.55 03820455 (AJ007367) multi-drug resistance efflux pump ... 51 2e-05 42.59 02695718 (AJ001694) putative membrane protein [Thermot... 48 2e-04
42.61 04467970 (X76640) hypothetical protein [Myxococcus xan... 46 6e-04
42.62 02618837 (AF017113) YvkA [Bacillus subtilis] > gi|2636047|... 450.001 42.64 05457646 (AJ248283) MULTIDRUG RESISTANCE PROTEIN [Py... 44 0.003 45.59 02827716 (AL021684) predicted protein [Arabidopsis tha... 43 0.005
48.52 02828202 BILE ACID TRANSPORTER >gi|1381569 (U57... 54 3e-06
48.55 01842056 (U87258) cis,cis-muconate transport protein MucK... 49 le-04
48.58 04885441 Na/PO4 cotransporter >gi|4587207... 44 0.003
48.59 02225983 (Z97193) hypothetical protein Rvl877 [Mycobac... 43 0.005 48.61 00586828 HYPOTHETICAL 44.2 KD PROTEIN EN COTF-T... 43 0.005
49.57 00401611 D-GALACTONATE TRANSPORTER >gi|290540 (... 45 0.001
49.58 00586812 HYPOTHETICAL 43.2 KD PROTEEN IN DNAC-R... 43 0.007
53.56 02500934 SfflKXMATE TRANSPORTER >gi|1736645|dbj|... 45 0.001
55.53 04753872 (AL049754) putative transmembrane efflux pr... 49 7e-05
55.61 02078013 (Z95207) efpA [Mycobacterium tuberculosis] 46 8e-04
55.62 01161051 (L39922) efflux protein [Mycobacterium tuberculo... 46 8e-04
55.64 04455672 (AL035472) putative transmembrane efflux pr... 44 0.002
55.65 00586001 METHYL VIOLOGEN RESISTANCE PROTEIN SMV... 44 0.003
55.66 02808775 (AL021411) integral membrane protein [Strepto... 43 0.004
55.67 02695836 (AL021006) hypothetical protein Rvl250 [Mycob... 43 0.004
55.68 02808785 (AL021411) putative transport protein [Strept... 43 0.005 58.43 02650264 (AE001079) oxalate/formate antiporter (oxlT-2) [... 53 5e-06
59.45 02127150 nitrate transporter - Bacillus subtilis (fragment) 506e-05
59.46 01171658 NITRATE TRANSPORTER >gi|1437473|dbj|BA... 50 6e-05 59.53 02222715 (Z97179) hypothetical protein MLCL383.37 [Myc... 46 5e-04 59.56 03257679 (AP000005) 372aa long hypothetical protein [P... 45 0.001
59.58 05777416 (AJ249180) sugar efflux transpoter [Erwinia... 43 0.004
59.59 01787304 (AE000207) orf, hypothetical protein [Escherichi... 43 0.004
59.60 02501163 HYPOTHETICAL 44.4 KD PROTEEN EN GRXB-R... 43 0.004 59.61 02982930 (AE000678) nitrate transporter [Aquifex aeolicus] 43 0.005 63.49 02612908 (AF015825) hexuronate transporter-like protein [... 51 2e-05
63.52 05457697 (AJ248283) TRANSPORTER [Pyrococcus abyssi] 48 le-04
63.53 03116222 (AB007122) transporter [Arthrobacter sp.] 47 3e-04
63.57 01346939 ANTISEPTIC RESISTANCE PROTEIN > gi|7733... 45 0.001 0/0:0/3
63.58 00097843 probable transport protein qacA - Staphylococcu... 44 0.003
63.59 03327943 (AF053771) multidrug efflux protein QacB [Staphy... 43 0.004
63.60 03097809 (L49465) hypothetical metabolite transport prote... 43 0.004
63.62 05650769 (AF089813) nitrate transporter [Sy... 43 0.007
63.63 03327948 (AF053772) multidrug efflux protein QacB [Staphy... 43 0.007
63.64 01706916 FOSMIDOMYCEN RESISTANCE PROTEIN >gi|212... 43 0.007
Pass = 2, iteration = 1, similar = 0.0, significant = l.OE-6, minimum = 0.0, maximum ; 10.0. PROGRAM = tblastn, IPROGRAM = tblastx, DATALIB = est iuman. Ran BLAST 68 times, found 125 total neighbors, 73 inside cluster, 52 outside cluster.
Core human EST cluster, 73 neighbors (E < 10"6)
11.1 03163329 am93cl0.sl Stratagene schizo brain Sll... 277 le-75
11.2 01544978 zf48e05.rl Soares retina N2b4HR Homo s... 213 2e-62
11.3 02575676 ns35dl l.sl NCI_CGAP_GCB 1 Homo sapiens ... 118 5e-32
11.4 00575005 H. sapiens partial cDNA sequence; clon... 118 le-25
11.5 04736822 wb22dl0.xl NCI_CGAP_GC6 Homo sapiens... 106 le-21
11.6 00574634 H. sapiens partial cDNA sequence; clon... 65 7e-17
11.8 01886510 zsl0dl2.sl NCI_CGAP_GCB 1 Homo sapiens ... 63 8e-09 42.1 03894461 ap21el0.xl Schiller oligodendroglioma ... 65 2e-09 48.1 01747695 zplδbll.rl Stratagene fetal retina 937... 98 3e-19
50.1 01349645 zc48dll.rl Soares senescent fibroblasts Nb... 344 le-93
50.2 05396524 wf64b06.xl Soares_NFL_T_GBC_Sl Homo ... 303 3e-81
50.3 05663458 wo90el2.xl NCI_CGAP_Kidll Homo sapie... 301 le-80
50.4 05675802 wp71b07.xl NCI_CGAP_Brn25 Homo sapie... 298 le-79
50.5 05109278 wg25b07.xl Soares_NSF_F8_9W_OT_PA_P_... 296 4e-79
50.6 06037457 xb65b04.xl Soares_NFL_T_GBC_Sl Homo ... 285 9e-76
50.7 04896654 wa76f08.xl Soares_NFL_T_GBC_Sl Homo ... 265 7e-70
50.8 05395461 wf66c05.xl Soares_NFL_T_GBC_S 1 Homo ... 263 4e-69
50.9 05396659 wf65d06.xl Soares_NFL_T_GBC_Sl Homo ... 250 2e-65
50.10 05450343 wf09b03.xl Soares_NFL_T_GBC_Sl Homo ... 247 3e-64
50.11 05391793 tdllgOl.xl NCI_CGAP_CLL1 Homo sapien... 243 3e-63
50.12 00816183 yg78d02.rl Homo sapiens cDNA clone 39680 5... 227 5e-62
50.13 03899028 ql64f04.xl Soares_NhHMPu_Sl Homo sapie... 238 le-61
50.14 03842747 qh34g06.xl Soares_NFL_T_GBC_Sl Homo ... 230 3e-59
50.15 01046571 ysl0b07.rl Homo sapiens cDNA clone 214357 5'. 227 3e-58 50.16 01024648 yr69b03.rl Homo sapiens cDNA clone 210509 ... 218 7e-56
50.17 03895536 ql54el0.xl Soares_NhHMPu_Sl Homo sapie... 211 le-53
50.18 06075659 xd72f06.xl Soares_NFL_T_GBC_Sl Homo ... 207 2e-52
50.19 03933090 qm02b04.xl Soares_NhHMPu_Sl Homo sapie... 205 8e-52
50.20 05542610 tc51g08.xl Soares_NhHMPu_Sl Homo sap... 205 le-51
50.21 05448546 wf04d03.xl Soares_NFL_T_GBC_Sl Homo ... 199 4e-50
50.22 03804209 qg95d09.xl Soares_NFL_T_GBC_Sl Homo sa... 199 6e-50
50.23 05437154 wkl4cl l.xl NCI_CGAP_Lyml2 Homo sapie... 197 2e-49
50.24 04739863 tt41al l.xl NCI_CGAP_GC6 Homo sapiens... 166 3e-48
50.25 01976192 EST26707 Cerebellum π Homo sapiens cD... 185 7e-46 50.28 01616355 zm91d04.rl Stratagene ovarian cancer (... 104 9e-32
50.30 02779368 nz03dl2.sl NCI_CGAP_GCB 1 Homo sapiens ... 132 6e-30
50.31 01933517 zs50g05.sl NCI_CGAP_GCB 1 Homo sapiens ... 99 7e-29
50.33 01470970 zi07e01.rl Soares fetal liver spleen 1... 124 2e-27
50.34 02849230 aa66d09.sl NCI_CGAP_GCB 1 Homo sapiens ... 122 7e-27
50.35 03253724 ow73h02.sl Soares_fetal_liver_spleen_l... 122 le-26
50.36 00750175 ye72bl2.rl Homo sapiens cDNA clone 123263 5'. 112 le-23
50.37 02783232 ny25bl0.sl NCI_CGAP_GCB 1 Homo sapiens ... 112 le-23
50.38 00900981 yp44h09.rl Homo sapiens cDNA clone 190337 5'. 109 5e-23
50.39 00900971 yp44f09.rl Homo sapiens cDNA clone 190313 5'. 109 5e-23 50.42 02018481 EST77078 Pancreas tumor IJJ Homo sapie... 95 2e-18
50.44 01989422 EST41878 Endometrial tumor Homo sapien... 92 le-17
50.45 01024562 yr69b03.sl Homo sapiens cDNA clone 210509 3'. 90 4e-17
50.46 01324288 zc82h02.rl Pancreatic Islet Homo sapiens c... 90 5e-17
50.47 01616244 zm91d04.sl Stratagene ovarian cancer (... 90 7e-17 0/0:2/3
50.49 03076109 omOδfl l.sl Soares_NFL_T_GBC_Sl Homo sa... 81 3e-14
50.50 01858092 zr49e07.rl Soares NhHMPu SI Homo sapie... 51 6e-14
50.51 04688089 ts89f09.xl NCI_CGAP_GC6 Homo sapiens... 80 8e-14 50.53 01959622 EST178192 Colon carcinoma (HCC) cell 1... 75 le-12 50.55 03838659 qh35f01.xl Soares_NFL_T_GBC_Sl Homo ... 69 le-10
50.57 03049244 om32h08.sl Soares_NFL_T_GBC_Sl Homo sa... 65 2e-09
50.58 01056010 ysl0b07.sl Homo sapiens cDNA clone 214357 3'. 57 2e-09
51.1 01873671 zr64d03.rl Soares NhHMPu SI Homo sapie... 238 le-67
51.2 03649141 ot08d05.xl NCI_CGAP_GC3 Homo sapiens c... 234 2e-60
51.3 03920067 qt87b05.xl NCI_CGAP_Col4 Homo sapiens ... 210 4e-53
51.5 05863189 UI-H-BI0-aac-g-02-0-UI.sl NCI_CGAP_S... 169 8e-41
51.6 00668137 yc09b07.rl Homo sapiens cDNA clone 80149 5'. 104 4e-39
51.7 03162566 oq02f07.sl NCI_CGAP_Lu5 Homo sapiens c... 160 4e-38 51.12 01192150 yy94d05.sl Homo sapiens cDNA clone 281193 3'. 140 4e-32 51.14 02899359 od60b01.sl NCI_CGAP_GCB 1 Homo sapiens ... 122 3e-29 51.24 03155247 oq67hl l.sl NCI_CGAP_Kid6 Homo sapiens ... 112 le-23 51.32 04112719 qy04el0.xl NCI_CGAP_Brn23 Homo sapiens... 104 2e-21 51.37 02163000 zx07dll.rl Soares total fetus Nb2HF8 9... 97 5e-19
51.39 03675523 oz69fl l.xl Soares_senescent_fibroblast... 95 2e-18
51.40 00761243 yf26a08.rl Homo sapiens cDNA clone 127958 ... 94 3e-18
51.41 03180750 ou49all.sl NCI_CGAP_Br2 Homo sapiens c... 94 5e-18
51.42 01887135 zr64d03.sl Soares NhHMPu SI Homo sapie... 89 2e-16 51.49 00668009 yc09b07.sl Homo sapiens cDNA clone 80149 3'. 70 5e-l l 51.52 03918968 qw07b08.xl NCI_CGAP_Ut3 Homo sapiens c... 62 2e-08
Displaying human EST neighbors outside cluster (10 6 < E < 0.01)
50.61 02788570 ny06b02.sl NCI_CGAP_GCB 1 Homo sapiens ... 55 2e-06
50.65 02820068 nz64gl0.sl NCI_CGAP_GCB1 Homo sapiens ... 50 8e-05
50.66 02752650 nw56h09.sl NCI_CGAP_GCB1 Homo sapiens ... 50 8e-05
50.67 02167640 zx45c08.rl Soares testis NHT Homo sapi... 45 0.003 55.45.1 05130954 cnl8bl2.xl Normal Human Trabecular B... 55 2e-06
Relationship of POT Family to Other Transporters
Earlier NCA runs had yielded similar results but with fewer sequences in the core cluster. The same analysis performed earlier revealed only 46 core sequences and converged after two iterations; that is, further iterations did not reveal any new sequences scoring with E < 10"6. This suggested that the POT family shows rather unique sequence characteristics, with no sequence from other transporter families scoring with E = 10"6 or lower. However, continued deposition of new sequences has enlarged the POT family and general databases considerably. This could result in the discovery of links to other transporter families possibly related to POT in evolution. Indeed, the recent ENCA run shown in Table 2 identified a distinct sequence with E < 10"6 belonging to the family of bacterial drug resistance transporters, a multidrug-efflux transporter of Campylobacter (Table 2, core cluster; bold-face type). To avoid including numerous multidrug-efflux transporters with the POT core family in a third iteration of BLAST, Table 2 contains only two iterations. A number of these drug-resistance transporters of the major facilitator type transporter family appear in the list of neighbor sequences outside the core cluster (Table 2). Several distinct types of transporters reach E values of -10"5 (e.g, the bile acid transporter gi/1381596 [sequence 48.52, Table 2], and the hexuronate transporter Af015825 [sequence 63.49] reported earlier (Sadee et al. (1999) Membrane Transporters as Drug Targets. New York, Plenum Press, pp 29-58.25). These results support the proposition that the peptide transporter family POT is indeed related in evolution to other transporter families.
Search for New POT Members
The core cluster contains 5 putative POT genes from the completely sequenced genome of C elegans (Table 2, bold-face and italics). Each of these deduced proteins has high similarity to hPepTl. This finding suggests that the human genome may also contain more POT members than are currently cloned. Table 2 includes a number of deposited sequences encoding the main intestinal and renal transporters, hPepTl (sequences 1-3) and hPepT2 (sequence 12), and their orthologues in other mammalian species . The pH sensing regulatory factor of peptide transporter (sequence 20) (Saito et al. (1997) Biochem Biophys Res Commun., 237: 577-582) appears to represent a possible truncated splice isoform of hPepTl. Also shown in the core cluster of Table 2, the rat peptide/histidine transporter (rPHTI) (sequence 50; E = 5 x 10"50 against hPepTl) is adjacent to a new sequence with high similarity to it, namely the mouse cAMP-inducible protein 1 (sequence 51). The latter was recently cloned from a lymphoid cell line and had not been suspected to belong to the POT family. We demonstrate below that both rPHTI and mouse cAMP- inducible 1 protein have apparent human orthologues, which we term hPHTl and hPHT2, respectively. This indicates that rPHTI and mouse cAMP-inducible 1 protein represent two distinct but closely related genes belonging to the POT family. The core cluster of POT sequences contains one additional human sequence, namely, erythroid differentiation-related factor 2 (sequence 42.41, Table 2). The E value (4 x 10" ) suggests probable homology to a putative POT transporter of Bacillus subtilis (sequence 42; 13 predicted TMDs). However, this sequence is rather short (107 residues), showing good sequence similarity with TMD11 and adjacent loop of the B subtilis transporter.
In summary, the ENCA analysis of the protein sequence databases revealed the presence of 4 mammalian sequences as probable members of POT: PepTI, PepT2, PHT1, and cAMP-inducible 1 protein (the latter two termed PHT1 and PHT2 in our nomenclature).
Second Pass INCA: Scanning the Human EST Database
All 68 sequences in the core cluster (Table 2) served in a second ENCA pass to search the human EST database. The resultant list of core ESTs (E < 10"6), sorted by highest similarity to one of the 68 core protein sequences, is also included with Table 2. Curiously, no EST scored best with PepTI, presumed to be the main intestinal transporter in rodents and, by inference, in humans (intestinal ESTs may be underrepresented in the available EST databases). Further, seven ESTs assorted with the cloned human hPepT2; however, it remains to be seen whether all seven are indeed representatives of hPepT2. With E scores exceeding 10"10, it is possible that these ESTs with moderate similarity could represent distinct genes or splice variants (not analyzed further). Numerous human ESTs scored best with the rat peptide histidine transporter, rPHT (sequence 50), and with mouse cAMP-inducible 1 protein (sequence 51). We therefore assembled these ESTs into contiguous sequences to identify the respective human orthologous gene products, termed hPHTl and hPHT2.
In the human EST core cluster (Table 2), two single ESTs scored best with putative peptide transporters from bacteria (sequences 42 and 48 of the core proteins of POT). Additional analysis will be required to ascertain whether the respective human genes suggested by these two ESTs could represent yet additional members of the human POT gene family.
hPHTl and hPHT2 Sequences Assembled from Human ESTs
Scanning the human EST database, we have identified numerous ESTs that appear to represent the human orthologues of rPHT and cAMP-inducible 1 protein. Table 3 lists the ESTs firmly assigned to the presumed human orthologue of rPHTI (hPHTl) and of cAMP-inducible 1 protein (termed here hPHT2) (cutoff E value -10" 20). For comparison, Table 3 also displays the ESTs assigned to hPepT2, whereas no ESTs appeared to represent hPepTl. The tissue source of the numerous ESTs representing hPHTl is of considerable interest because in the rat, rPHT expression is largely confined to the brain and retina. Yet, the hPHTl -related ESTs derive from many body tissues, including colon carcinoma. This raises the possibility that hPHTl is broadly expressed in many human tissues.
Table 3. ESTs from BLAST (blastn) assigned to hPepT2, hPHTl, and hPHT2. Criteria of inclusion for alignments are >90% identity in sequences longer than 60 bps.
PepT2 vs human EST: gi|3163329|gb|AA984804 am93cl0.sl Stratagene schizo br... 277 le-75 gi[1544978|gb|AA054054 zf48e05.rl Soares retina N2b4HR... 213 2e-62 gi 12575676 |gb|AA649247 ns35dll.sl NCI_CGAP_GCBl Homo s... 118 5e-32 gi|575005 j emb | Z45771 | HSCZTG061H. sap. partial cDNA sequence 118 le-25 hPHTl vs human EST (ESTs from single tissues are bold-face)
1349645 gb 530 19 zc48dll.rl Soares senescent fibroblasts 1056 0.0 gi 5396524 gb AI80 9958 wf64b06.xl Soares_NFL_T_GBC_S ... 955 0.0 gi 5663458 gb AI927494 wo90el2.xl NCI_CGAP_Kidney Homo sap 949 0.0 gi 5109278 gb AI740990 wg25b07.xl Soares_NSF_F8_9W_0... 947 0.0 gi 5675802 gb AI936932 wp71b07.xl NCI_CGAP_Brain Homo sap 945 0.0 gi 5450343 gb AI829672 wf09b03. xl Soares_NFL_T_GBC_S . 859 0.0 gi 5396659 gb AI810093 f65d06.xl Soares_NF _T_GBC_S . 859 0.0 gi 4896654 gb AI685360 wa76f08.xl Soares_NF _T_GBC_S . 857 0.0 gi 5395461 gb AI808895.1 wf66c05.xl Soares_NFL_T_GBC_S . 850 0.0 gi 3899028 gb AI276754 ql64f04.xl Soares_NhHMPu_Sl Horn. 800 0.0 gi 1470970 gb AAOO 9923 zi07e01.rl Soares fetal liver spleen 792 0.0 gi 5391793 gb AI805139.1 tdllgOl.xl NCI_CGAP_CLL1 Homo... 790 0.0 gi 3933090 gb AI290316 qm02b04.xl Soares_NhHMPu_Sl Horn... 753 0.0 gi 3842747 gb AI247350.1 qh34g06.xl Soares_NFL_T_GBC_S ... 747 0.0 gi 1024648 gb H65908 yr69b03.rl Homo sapiens cDNA clone ... 745 0.0 gi 1046571 gb H73031 ysl0b07.rl Homo sapiens cDNA clone ... 743 0.0 gi 4739863 gb AI655884.1 tt41all.xl NCI_CGAP_GC6 Homo ... 699 0.0 gi 1616355 gb AA076486 zm91d04.rl Stratagene ovarian cancer 681 0.0 gi 3895536 gb AI273268 ql54el0.xl Soares_NhHMPu_Sl Horn... 665 0.0 gi 816183 gb R54281 yg78d02.rl Homo sapiens cDNA clone 3. 624 e-177 gi 5437154 gb AI818075.11 wkl4cll.xl NCI_CGAP lymph node H sap 590e-166 gi 3804209 gb AI222006 qg95d09.xl Soares_NFL_T_GBC_Sl ... 564 e-158 gi 5448546 gb AI827875.1 wf04d03.xl Soares_NFL_T_GBC_S ... 547 e-153 gi 1324288 gb 40166 zc82h02.rl Pancreatic Islet Homo sap. 543 e-152 gi 1976192 gb AA323865 EST26707 Cerebellum II Homo sap. 531 e-149 gi 2779368 gb AA740776 nz03dl2.sl NCI_CGAP_GCB1 Homo s... 525 e-147 gi 1933517 gb AA287835 zs50g05.sl NCI_CGAP_GCB1 Homo s... 432 e-119 gi 750175 gb R00439 ye72bl2.rl Homo sapiens cDNA clone 1... 420 e-115 gi 3049244 gb AA909954 om32h08.sl Soares_NFL_T_GBC_Sl ... 412 e-113 gi 3253724 gb AI032598 ow73h02.sl Soares_fetal_liver_spleen 368 le-99 gi 900971 gb H30061 yp44f09.rl Homo sapiens cDNA clone 1... 329 5e-88 gi 900981 gb H30071 yp44h09.rl Homo sapiens cDNA clone 1... 329 5e-88 gi 2783232 gb AA743881 ny25bl0.sl NCI_CGAP_GCB1 Homo s... 325 8e-87 gi 1858092 gb AA234173 zr49e07.rl Soares NhHMPu SI Horn... 323 3e-86 gi 2849230 gb AA789110 aa66d09.sl NCI_CGAP_GCB1 Homo sap 293 4e-77 gi 1989422 gb AA337185 EST41878 Endometrial tumor Homo sap 287 3e-75 gi 1959622 gb AA307073 EST178192 Colon carcinoma (HCC) ... 265 le-68 gi 1616244 gb AA076314 zm91d04.sl Stratagene ovarian cancer 261 2e-67 gi 1024562 gb H658 22 yr69b03.sl Homo sapiens cDNA clone .. 230 3e-58 gi 3076109 gb AA927212 o Oδfll.sl Soares_NFL_T_GBC_Sl . 228 le-57 gi 4688089 gb AI636759.1 ts89f09.xl NCI_CGAP_GC6 Homo . 198 2e-48 gi 1056010 gb H77921 ysl0b07.sl Homo sapiens cDNA clone .. 196 6e-48 gi 2752650 gb AA731761 nw56h09.sl NCI_CGAP_GCB1 Homo s. 154 4e-35 gi 2788570 gb AA748612 ny06b02.sl NCI_CGAP_GCB1 Homo s. 151 2e-34 gi 2820068 gb AA768830 nz64gl0.sl NCI_CGAP_GCBl Homo s. 147 3e-33 gi 3075545 gb AA926648 om28bll.sl Soares_NF _T_GBC_Sl . 119 8e-25 gi 3094496 gb AA936578 on78fl2.sl Soares_NF _T_GBC_Sl . 111 2e-22 gi 3052867 gb AA913475 ol30h05.sl Soares NFL T GBC SI . 109 9e-22 hPHT2 vs human EST gi 1873671 gb AA242853 zr64d03.rl Soares NhHMPu SI Horn.. 795 0.0 gi 668137 gb T64272 yc09b07.rl Homo sapiens cDNA clone ! ... 541 e-152 gi 2163000 gb AA448980 zx07dll.rl Soares total fetus N.. 379 e-103 gi 3162566 gb AA984041 oq02f07.sl NCI_CGAP_Lu5 Homo sa .. 311 le-82 gi 761243 gb R09320 yf26a08.rl Homo sapiens cDNA clone 1... 301 le-79 gi | 3649141 | gb | AI141684 ot08d05 . xl NCI_CGAP_GC3 Homo sa . . . 155 le-35
Approximately 50 ESTs served to assemble a contig DNA sequence of the presumed hPHTl nucleotide sequence Figure IA and Figure IC; SEQ ED NOS:5, 6, 7, 8). A schematics of the hPHTl contig assembly Figure IA contains the minimum number of EST's spanning the length of the deduced hPHTl sequence. To view the EST coverage of each segment, click on the EST/region of interest. This will reveal segments with numerous overlapping ESTs. Because multiple ESTs cover the same regions of hPHTl, one can deduce possible sequence variations in the human population where EST sequences are not identical. Clearly, many of these variations may be due to sequencing errors, but in a few cases a variation occurs more than once at the same location. These variations increase the probability that a genetic variant might be involved. The contig sequence is closely related to that of rPHTI; however, the first -50 5'-terminal base pairs are missing. (It appears that there is a rare Nøtl site at the 5'-end, which could have caused truncation during preparation for EST sequence analysis.) Thus, the EST contig is likely to represent >95% of the coding region of hPHTl. The deduced hPHTl amino acid sequence is shown in SEQ D NO: 5).
Putative Splice Variants of hPHTl
Several ESTs appear to span a sequence insert, suggesting the presence of possible splice variants shown inFigure IC, D, E, F, and G. Several variant ESTs assemble into a contig sequence containing an additional coding region of -100 amino acid residues in the predicted extracellular loop between TMDs 11 and 12 (Variant A). It will be interesting to see the functional consequences of this insertion. Yet 3 additional ESTs -; also representing hPHTl -; assembled into a contig sequence suggestive of a variant form with a somewhat smaller insertion in the same location Figure IC. Lastly, RT-PCR analysis of several tissues resulted in two bands, both found to be related to hPHTl upon sequence analysis. The faster migrating band contained a gap of 169 bps in the middle of hPHTl Figure IC, Variant B; for RT-PCR results see Figure 5 A). Some of the putative hPHTl splice variants may introduce a frame shift and would not be expected to result in a functionally active transporter. These variants need to be cloned individually and tested experimentally. The information presented here is therefore important for guiding cloning efforts to produce functional hPHTl protein. Genomic hPHT2 Sequence.
The human EST contig sequence corresponding to rat cAMP-inducible 1 protein served to scan the human nr nucleotide databases. This revealed a PAC clone (Pl- derived artificial chromosome) containing human genomic sequences closely related to the mouse cDNA-encoding cAMP-inducible 1 protein. Using the cDNA sequence of cAMP- inducible 1 protein, we were able to identify the likely introns and exons representing the presumed hPHT2 gene Figure 2; (SEQ ED Nos: 7-23). Each of the intron-exon boundaries are flanked by GT. . . AG in the intron sequence. Thus, the intron structure follows the GT- AG rule, where GT is called the splice donor and AT is called the splice acceptor. The deduced cDNA coding sequence and protein sequence are shown in
(SEQ ED NOS: 5-8). Table 4 lists the identities and similarities among the protein sequences of the main mammalian members of the POT family. While hPepTl and hPEPT2 represent one branch of this family, hPHTl, hPHT2, rPHT, and mouse cAMP-inducible 1 protein are closely related and form a second branch. hPHTl has 89% identity to rPHTI, while hPHT2 is 81% identical to mouse cAMP-inducible 1 protein. Multiple sequence alignments are provided in Figure 3, including either the PHT branch only (Figure 3A), or both branches (Figure 3B).
Table 4 Identities and similarities among six POT protein sequences* hPepTl hPepT2 hPHTl hPHT2 rPHTI camp- inducible 1 protein hPepTl 51.34 20.39 25.96 22.84 24.05 hPepT2 51.34 22.54 22.88 24.17 24.52 hPHTl 20.39 22.54 53.90 89.27 51.96 hPHT2 25.96 23.41 53.90 54.00 80.94 rPHTI 22.84 24.17 89.27 54.00 51.81 cAMP- 24.05 24.52 51.96 80.94 51.81 inducible
1 protein *POT indicates proton-dependent oligopeptide transporters.
Hydropathy analysis corroborates the close similarity within the PHT and PepTI branches of the POT family (Figure 4A). These hydropathy profiles predict the presence of 11-12 transmembrane domains, as reported earlier. Topological predictions are shown in Figure 4B for hPHTl and hPHT2. A cDNA sequence nearly identical to hPHT2 was deposited by K. Ishiabshi and M. Imai: AB020598, Homo sapiens mRNA for peptide transporter 3, complete eds, 2113 bps in length. This defines the 3' and 5' ends of hPHTl as having a coding sequence of 1740 bps. The deduced putative cDNA coding sequence is the identical length of the coding sequence suggested by our genomic hPHT2 sequence, and it is identical to hPHT2 over a large portion of the presumed coding region. However, there are also several remarkable differences. First, a fragment of 50 bps in the hPHT2 coding region from position 61-110 is replaced in the cloned cDNA by a 50-bp fragment of low complexity (cg-rich), which is excluded from BLAST analysis by a low complexity filter. This 50-bp cDNA fragment did not recognize any sequence in the PAC clone containing the hPHT2 genomic sequence, but it did recognize fragments in a number of unrelated genes, therefore, possibly representing a low complexity repeat fragment. The remainder of the cDNA sequence was identical to that of hPHT2, except for an insertion of three nucleotides each in three different locations of hPHT2 (at positions 837, 1271, and 1428 of hPHT2). These sequence variations would indicate the presence of three additional amino acids at these respective positions, without disturbing the overall reading frame. It remains to be seen how these changes from our deduced coding sequence came about and whether they are of functional significance. In any case, comparing the cDNA and genomic sequences reveals many details of the possible protein structure not available otherwise.
RT-PCR and Northern Blot Analysis of hPHTl and hPHT2 mRNA From
Human Tissues
The presence of numerous ESTs from many tissues representing the presumed hPHTl in the databases suggested that hPHTl might be expressed in many human tissues. RT-PCR analysis had revealed detectable expression of hPHTl in each of three tissues tested: intestines, skeletal muscle, and pancreas (for intestines, see Figure 5A). In each of these tissues, the presence of a shorter band suggested a possible splice isoform. The experimentally determined sequence Figure IB shows a deletion of 169 bases.
For Northern analysis we used two ESTs representing hPHTl and 2, which were labeled to detect the presence of mRNA in 12 human tissues Figure 5B. hPHTl was mainly expressed in skeletal muscle, followed by kidney, heart, and liver, with relatively little expression in colon and brain. mRNA bands were detected at apparent molecular weight 2.8 kb and 5.1 kb, indicating the presence of possible mRNA variants. The mRNA tissue distribution of hPHT2 differed significantly from that of hPHTl Figure 5B. A single major band appeared at 2.4 kb, with highest expression in spleen, placenta, lung, and leukocytes, followed by heart, kidney, and liver.
Discussion This study identifies several new putative members of the POT gene family, using a bioinformatics analysis of available sequence databases, including the human EST database. Because the POT family seemingly stands separate from other transporter gene families, searching for POT-related ESTs is facilitated. With an increasing number of sequences present in the publicly accessible databases, however, we now have begun to find transporters outside the POT gene family with alignment scores that support a finding of possible homology. The ENCA search, or any other exhaustive search tool, should be performed periodically to finds the missing links that can document POT's probable homology to other gene families. This will greatly facilitate the study of structure and function of these transporters because one would expect molecular architecture and functional domains to recur in homologous proteins. In this study, we have used gapped
BLAST analyses (Altschul et al. (1997) Nucleic Acids Res., 25: 3389-3402), which permits inclusion of gaps in local alignments. This carries the possible disadvantage that the permutation matrix used to calculate similarity scores may be inappropriate, in particular for the TMD sections. The use of psi BLAST (Altschul et al. (1997) Nucleic Acids Res., 25: 3389-3402) could overcome this problem by generating a position-specific matrix that would account for mutational drift in TMDs and loops separately. However, our iterative ENCA approach could result in overmodeling of the matrix, and thus, possible inclusion of unrelated sequences. Moreover, the primary goal here was to identify new human genes closely related to the known POT family members, rather than probing the most distant relationships.
By far, the largest number of human ESTs aligned best with rPHTI, a peptide-histidine transporter mainly expressed in rat brain. Assembly of numerous ESTs into a contig sequence termed hPHTl permits a number of observations on the putative coding sequence. More than 95% of the coding sequence expected from comparison to rPHTI can be derived from the assembled ESTs. In several regions of the deduced hPHTl sequence, multiple ESTs overlap, thereby providing a first glimpse of possible sequence variations in the human population. While EST sequencing is not rigorously quality controlled and single nucleotide variants occur only sporadically, multiple overlapping ESTs could nevertheless assist in finding single nucleotide polymorphisms (SNPs). There are several such candidate SNPs in the EST alignments: future work will determine whether these do, indeed, represent human sequence variations. However, the presence of possible splice variants of hPHTl was clearly indicated by an insert or gap in two regions of the hPHTl coding region. These were detected either by EST alignments or experimentally by RT-PCR Figure IB and Figure 5A. It will be of interest to determine the tissue expression and function of the splice variants. The insert into the loop between TMDs 11 and 12 introduces an additional 100 residue-sequence fragment into hPHTl. This region contains no homology to any other known protein.
A possible splice variant has previously been detected for hPepTl. Inue and colleagues (Saito et al. (1997) Biochem Biophys Res Commun., 237: 577-582) have cloned a cDNA from human duodenum with 1704 bp, encoding a predicted protein of only 208 residues (hPepTl-RF). Of these, residues 18-195 are identical to an equivalent region of hPepTl. This truncated hPepTl protein appears to lack ability to transport peptides; however, cotransfection of hPepTl-RF with hPepTl affected the pH sensitivity of peptide transport by hPepTl, suggesting a regulatory function for hPepTl-RF of yet unknown mechanism. The functions of splice variants of hPHTl remain to be determined.
Assembly of ESTs into a second contig sequence revealed high identity to the mouse cAMP-inducible 1 protein, which was isolated from lymphocytes as one of the upregulated mRNAs after stimulating with camp. Close similarity to PHT1 suggests that this gene also encodes a peptide transporter. Using this human contig sequence, we have identified a full-length genomic sequence in a PAC clone from which introns and exons can be predicted for the presumed hPHT2 gene. High identity to the entire mouse cAMP- inducible 1 protein suggests that hPHT2 is the human orthologue, closely related to the PHT- like branch of the POT family. This facilitates the cloning and testing of this putative new member of the human POT family.
Our comprehensive ENCA search uncovered several additional ESTs with sequences similar to members of the POT family. Using BLAST analyses with these ESTs as the query suggested the possibility of additional human POT genes. Further work is needed to complete the human POT family. Also, we cannot preclude that the proposed hPHT 1 and 2 genes, although highly similar to genes encoding rPHTI and cyclic AMP- inducible 1 protein, are not the immediate orthologues, but that there are as yet additional closely related human genes not represented in the EST database.
The strong representation of hPHTl in the EST database suggests that the presumed hPHTl is widely expressed in human tissues, largely in the CNS, in contrast to its restricted expression in rats. One of these ESTs stems from a human colon carcinoma, an indication that hPHTl may also be expressed in human intestines. We have corroborated this supposition with the use of RT-PCR; however, Northern blot analysis has revealed that hPHTl and KPHT2 are not highly expressed in intestines relative to other tissues. Protein expression and functional studies are required to determine whether these transporters, in addition to hPepTl, could play a role in intestinal peptoid drug absorption. This could be of considerable pharmacological interest, as previous studies have suggested that PepTI was the sole peptide transporter in rodent intestines, whereas no definitive experimental evidence has been reported on the responsible transporter subtype in humans. Bioavailability studies demonstrate that peptoid drugs, such as certain antibiotics, largely depend on intestinal absorption by peptide transporters to enter the systemic circulation. For example, coadministration of a dipeptide reduced the oral bioavailability of amoxicillin by 80% in human subjects providing strong evidence that a saturable dipeptide transport process is involved (Chulavatnatol and Charles (1994) Europ. J. Pharmacol, 40: 374-378). Whether hPHTl or 2 could play a role in oral antibiotic bioavailability remains to be seen. Our Northern blot analysis revealed strong expression for hPHTl in skeletal muscle and kidney while hPHT2 was highly expressed in leukocytes, lung placenta, and spleen. Detectable expression of both genes in organs, such as the heart, may be of interest in understanding the efficacy of antibiotic treatment of localized infections -; particularly if the infectious agent resides intracellularly. The tissue distribution of gene expression differs from that of hPepTl (mainly intestinal) and hPepT2 (mainly renal) which underscores the relevance of our findings to targeting therapy to specific organs. It is curious to note that neither hPHTl nor hPHT2 are strongly expressed in the brain, while rPHTI is highly expressed in the CNS. It will be of interest to establish the role of any peptide transporters expressed in human brain. Overall, our results indicate that the human POT family contains at least 4 genes encoding possible peptide transporters. Each displays a distinct pattern of tissue expression, providing a possible avenue for drug targeting to select tissues. Tissue distribution of POT gene expression is of particular interest for achieving oral bioavailability or targeting drugs to tumor tissues. For example, Tsuji et al have identified a fibrosarcoma cell line expressing an unusual HVdipeptide transporter activity (Nakanishi et al. (1997) Cancer Res., 57: 4118-4122). We have determined that hPepTl is highly expressed in pancreatic (Gonzales et al. (1998) Cancer Res., 58: 519-525) and colon adenocarcinomas, including liver metastases (M.Y. Covitz and W.S., unpublished data), considerably above the level seen in surrounding normal tissues. Moreover, each member of the peptide transporter family is likely to exhibit distinct selectivities for peptides and peptoid drugs. Thus, amino acid esters of 5'nucleosides (eg, valcyclovir) (Han et al. (1998) Pharm. Res., 15: 1154-1159) are recognized by hPepTl, even though the structure of these prodrugs is quite distinct from dipeptides. However, affinity of these nucleoside prodrugs for other peptide transporters remains to be determined. One could envisage a large variety of amino acid-nucleoside prodrugs of antivirals or anticancer agents to enhance oral bioavailability or target the drug to tumor tissues, as a function of which peptide transporter is expressed in the target tissue. Phenotypic characterization of tumor tissues as to which transporters are expressed and to what extent will become an important question that extends beyond the peptide transporter family, and indeed should be applied to all known drug transporters.
EXAMPLE 2 Genomic Structure of Proton-Coupled Oligopeptide Transporter hPEPTl and pH-
Sensing Regulatory Splice Variant Proton-coupled oligopeptide transporter PEPTI facilitates the transport of dipeptides and peptoid drugs (including antibiotics) across the cell membranes of endothelial and epithelial cells. Substrate transport by the proton symport is driven by pH gradients, while the profile of pH sensitivity is regulated by a closely related protein, hPEPTl-RF. We investigated the genomic structure of hPEPTl and hPEPTl -RF. Analysis of the high- throughput genomic sequence (HTGS) database revealed that hPEPTl and hPEPTl -RF are splice variants encoded by the same gene located in chromosome 13, consisting of 24 exons. hPEPTl is encoded by 23 exons and hPEPTl-RF by 6 exons. Coding sequences of hPEPTl- RF share 3 exons completely and 2 exons partially with hPEPTl. The genomic organization of hPEPTl shows high similarity with its mouse orthologue. Exon-intron boundaries occur mostly in the loops connecting transmembrane segments (TMSs), suggesting a modular gene structure reflecting the TMS-loop repeat units in hPEPTl. The putative promoter region of hPEPTl contains TATA boxes and GC-rich regions and a potential insulin responsive element.
Introduction.
Proton-coupled oligopeptide transporters (POTs) comprise the transport family 2. A.17 (for transporter classification see http://www.biology.ucsd.edu/~msaier/transport/titlep age.html). Oligopeptide transporters are symporters driven by the flux of protons; they have a molecular architecture consisting of -12 predicted TMSs (Sadee et al. (1995) Pharm Res. 12: 1823-1837). Members of the POT family include peptide transporter 1 (PEPTI) (Fei et al. (1994) Nature, 368: 563-566; Liang et al. (1995) J. Biol. Chem. 270: 6456-6463), peptide transporter 2 (PEPT2) (Liu et al.
(1995) Biochem Biophys Acta 1235: 461-466), peptide/histidine transporter 1 (PHT-1) (see Example 1), and peptide/histidine transporter 2 (PHT-2) (see Example 1). Recently, a cDNA termed PET3 (NM_016582), which is largely identical to PHT-2, has been deposited into the nr database (http://www.ncbi.nlm.nih.gov: 80/entrez/query.fcgi). The peptide transporter 1 gene of rabbits was cloned in 1994 (Fei et al. (1994)
Nature, 368: 563-566), and the human orthologue (hPEPTl) was cloned shortly after (Liang et al. (1995) J. Biol. Chem. 270: 6456-6463). Human PEPTI cDNA contains 3105 base pairs (bp), and the predicted protein consists of 708 amino acids. The transporter protein has 12 predicted TMSs and 2 putative protein kinase C phosphorylation sites. The membrane topology of the human dipeptide transporter, hPEPTl, was determined by epitope insertions by Covitz et al (Covitz et al. (1998) Biochem. 37: 15214-15221). PEPTI is expressed in the intestine (brush border), early proximal kidney tubuli, liver, placenta, and pancreas (Liang et al. (1995) 7. Biol. Chem. 270: 6456-6463; Shen et al. (1999) Am J Physiol. 276: F658- F665). In the intestines, PEPTI facilitates absorption of digested dipeptides so that most of the dietary nitrogen is absorbed as dipeptides rather than as amino acids (Ganapathy and Leibach (1999) pages 456-467 In: Yamada T, ed. Textbook of Gastroenterology. Philadelphia, PA: Lippincott Williams and Wilkins).
Human PEPTI has broad substrate specificity. The substrates include di- and tripeptides and peptoid drugs. Thus, PEPTI mediates the high bioavailability of many hydrophilic beta-lactam antibiotics (Terada et al. (1999) Am J Physiol. 276: G1435-G1441). In addition, PEPTI is suggested to play a role in intracellular peptide transport, including lysosomal transport (Gonzales et al. (1998) Cancer Res. 519-525). Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582, have described a highly related transcript, termed hPEPTl-RF, which modulates the activity of human PEPTI. The cDNA for the regulatory factor encodes an open reading frame of 208 amino acids. Residues 18-195 are identical to residues 8-185 in hPEPTl, while sequences 1- 17 and 196-208 are unique. Both hPEPTl and hPEPTl-RF are expressed in Caco-2 cells. Expression studies in Xenopus oocytes and Caco-2 cells showed that the regulatory factor shifted the pH-sensitivity profile of hPEPTl -mediated peptide transport (Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582). Although somatic cell hybrid analysis and in situ hybridization studies of Liang et al. (1995) J. Biol. Chem. 270: 6456-6463, positioned hPEPTl to chromosome 13 q33-q34, the genomic structures of human PEPTI and hPEPTl- RF were not known. Genomic organization of the mouse PEPTI gene has been reported recently (Fei et al. (2000) Biochem Biophys Acta. 1492: 145-154) as having a length of 38 kb with 23 exons. The aim of this study was to determine the genomic structure of hPEPTl and hPEPTl-RF. We identified a common gene with 24 exons encoding both hPEPTl and the regulatory factor in clones representing chromosome 13. hPEPTl and hPEPTl -RF are splice variants of the same gene.
Materials and Methods
Advanced (BLAST) analysis was carried out using the National Center for Biotechnology Information (NCBI) Web server (http://ncbi.nlm.nih.gov). BLOSUM62 matrix was used with default parameters. The analysis was done with and without filtering of the low-complexity sequences and without masking of repetitive elements. Queries used the cDNA sequences of human PEPTI (accession number: NM_005073) and hPEPTl-RF (AB001328) and the high-throughput genomic sequence (HTGS) database.
Using the accession number of the mRNA sequence, we retrieved the (CDS) sequence from NCBI and performed a BLAST search of the HTGS database. Results were filtered using the blastflt.py code written by Arne Mueller (BLAST2 Parser ver. 1.2, ©Arne Mueller, http://www.bmm.icnet.uk/people/mueller. The obtained hits were filtered to ensure that only data from the same species and chromosome were used. The alignments served to locate the exons in the genomic sequence. When problems arose, the sequences were examined by hand to attempt to resolve or identify possible alternative splice sites. Membrane topological prediction was done using the (TOPPRED) program. The sequences 2 kb upstream from the transcription start sites of hPEPTl and hPEPTl -RF were investigated using programs FindPatterns and FitConsensus (Genetics Computer Group, Madison, WI)to locate possible promoters and enhancer sites.
Results. Bioinformatic analysis revealed that hPEPTl and hPEPTl -RF are encoded by the same gene located in chromosome 13, clone RP11-56D6 (accession: AL357553). hPEPTl contains 23 exons (Table 5, Figure 6), and hPEPTl -RF contains 6 exons (Table 6, Figure 1). Human PEPTI and hPEPTl -RF share 3 exons completely, and 2 exons are partially shared (Figure 6). Therefore, hPEPTl -RF and hPEPTl are splice variants of the same gene that has in total 24 exons. Over the course of the study, additional genomic clones became available containing all hPEPTl exons in several contiguous fragments, and these served to verify the order and intronic sizes provided in Table 5 and Table 6.
Figure imgf000073_0001
refers to the codon in the 5' end of the exon. Uncertain sizes of introns are indicated by > sign due to the boundary between two fragments of the clone in HTGS database, b Size of whole exon/ size of the part spliced into hPEPTl mRNA.
Table 6.. Exon- intron organization of hPEPTl- RF gene.
Figure imgf000074_0001
Exon boundaries are shown in upper case and intron boundaries in lower case. Codon phase refers to the codon in the 5' end of the exon. Εxon T is repetitive element UTR.
All the exon-intron boundaries for hPEPTl conform to the consensus splice junction sequences (gt ag) for eukaryotic genes (Shapiro and Senapathy (1987) Nucl Acids Res. 15: 7155-7174). The 9 conserved nucleotides in the 5' donor side are (A/C)AG/gt(a/g)agt. These are conserved at 64%, 73%, 50%, 100%, 100%, 86%, 70%, 83%, and 77%, respectively, in hPEPTl. Similarly, we found positions of the 3'-acceptor site, (c/a)ag/(A/G), to be conserved at 73%, 100%, 100%, and 73%, respectively (Table 5). Splice sites are classified in phase 0 (13 sites), phase 1 (4 sites), and phase 2 (5 sites) (Table 5).
The hPEPTl gene structure shows several interesting features. The start sites of the transcripts for hPEPTl and pH-regulatory factor are located in different exons (Figure 6). Moreover, exon 1 located >20 kb upstream of exon 2 contains only the first 4 nucleotides of the hPEPTl coding region. Alternative splicing occurs in exon 3, and 118 bases in the 5' end of exon 3 are spliced out of the mRNA of hPEPTl. Another site for differential splicing is exon 7 of hPEPTl -RF. In this case, 41 bases in the 3' end of the exon are spliced out of hPEPTl hmRNA (Figure 6).
Membrane topology predictions of hPEPTl and hPEPTl -RF proteins are shown in Figures 2 and 3. The transmembrane topology schematics were rendered using TOPO (S.J. Johns and R.C. Speth, Transmembrane protein display software, http://www.sacs.ucsf.edu/TOPO/topo.html, unpublished data). The figures show the peptide sequences that are encoded by each exon. In accordance with earlier information, hPEPTl is predicted to have 12 transmembrane segments (TMSs). Interestingly, comparison of membrane topology with gene structure shows possible functional modularity. Few if any exon-intron boundaries are found within the TMSs, and in most cases each exon encodes for a single TMS-loop unit (Figure 2). Topological predictions suggest that hPEPTl-RF has 5 TMSs with a cytoplasmic N-terminal and extracellular C-terminal.
The upstream region (2 kb) from the transcription start sites of hPEPTl is shown in Figure 4. TATA boxes were found about 520 bp upstream from the transcription start site in hPEPTl. The putative regulatory region also contains GC boxes, so several GC boxes are located within 300 bp from the transcription site in hPEPTl. Binding sites for transcription factors did not include any amino acid responsive element. Some other transcription factor binding sites of the regulatory regions are illustrated in Figure 9.
Discussion.
The genomic structure of hPEPTl and hPEPTl-RF presented here is based on a sequence in the HTGS database. The HTGS contains yet unordered pieces of genomic sequences. We used the August 11, 2000, version of the clone AL357553 in this analysis. It contained 11 contigs, but the true order of these pieces was unknown, and the size of the gaps between them were subject to change. Three introns of hPEPTl include such gaps (indicated by > signs in Table 5), while hPEPTl -RF exons are all located in one contig. Within the contigs the sequences are likely to be unaffected, and intron sizes are reliable
(Table 5). Note also that the order of the exons in the clone matches perfectly the nucleotide sequence of cDNA. Where possible, these predictions have subsequently been verified and the intronic sizes adjusted where needed, on the basis of additional genomic clones deposited in the HTGS database. Human PEPTI is encoded by 23 exons, and the entire gene contains 24 exons. Likewise, mouse PEPTI is encoded by 23 exons (Fei et al. (2000) Biochem Biophys Acta. 1492: 145-154). Comparison of mouse and human genes shows that the sizes of the exons and their relative locations are similar. Identity of mouse and human cDNA for PEPT s is 83% (Fei et al. (2000) Biochem Biophys Acta. 1492: 145-154). A high degree of similarity in both gene clustering and coding sequence confirm that human and mouse PEPTI genes are orthologues. In this study the comparison of membrane topological prediction and genomic structure indicates that human PEPTI gene is modular with each TMS-loop unit encoded by a different exon (Figure 7 and Figure 8). This is in accordance with earlier analysis of peptide transporters that suggested modular structure of transporter genes may have evolved by exon shuffling and rearrangements of functional modules (Graul and Sadee (1997) Pharm Res. 14: 388-400). The /iPEPTlgene also encodes the splice variant hPEPTl-RF. PEPT1-RF and PEPTI share 5 identical TMSs, while the extramembraneous terminals differ (Figure 6, Figure 7, and Figure 8). PEPT1-RF is not capable of transporting substrates across the membrane, but it is thought to sense pH changes and modulate the response of PEPTI to these changes (Saito et al. (1997) Biochem Biophys Res Commun. 237: 577-582). Fei et al. (1998) Biochem Biophys Res Commun. 246: 39-44, have shown by using chimeric PEPT1- PEPT2 proteins that the TMSs 7-9 are important for substrate recognition by hPEPTl. PEPT1-RF does not have these TMSs and does not transport substrates. However, the mechanisms of proton and substrate transfer and the interplay between PEPTI and PEPT1- RF are still elusive. The putative regulatory region of hPEPTl (Figure 9) revealed some similarities with the mouse PEPTI gene (Fei et al. (2000) Biochem Biophys Acta. 1492: 145- 154). TATA boxes are located in unusual locations (511 bp and 517 bp upstream from the transcription start site), while GC boxes are located near the start site (at -29 bp and several others within 300 bp). The location of TATA boxes so far upstream from the transcription start site is not optimal. Therefore, this kind of structure suggests that the GC box is a more important promoter in the regulation of hPEPTl than is the TATA box. Note also that there may be more than one transcription start site for a gene, as shown previously (Pave-Preux M et al. (1990) J Biol Chem. 265: 4444- 4448). Unlike in the mouse genome, amino acid responsive element was not found within 1983 bp from the transcription start site in PEPTI. Human PEPTI expression is known to be upregulated by its substrates, dipeptides, as shown by Walker et al (Walker et al. (1998) J Physiol. 507: 697-706), but the mechanism of this upregulation remains unclear. Insulin regulates the activity of PEPTI in Caco-2 cells (Thamotharan et al. (1999) Am J Physiol. 276: C821-C826). Insulin regulation was mediated by transporter translocation to the basolateral side of the cells upon release of hPEPTl from the translated intracellular pool to the plasma membrane. Changes in hPEPTl mRNA were not seen in that study. However, the putative insulin responsive element is located upstream from the transcription start site (Figure 9), suggesting that insulin might be involved in the regulation of hPEPTl transcriptional activity. The genomic organization of hPEPTl and hPEPTl- RF indicates that they are splice variants of the same gene (Figure 6). Expression of hPEPTl -RF has not been studied in detail. Nevertheless, the splice variants may be expressed in different proportions depending on, for example, the stage of differentiation, hormonal regulation signals, and cell type. Human PEPTI is expressed in several tissues (intestine, kidney, brain, liver) where the pH environment is quite different. Also, an intracellular pool of hPEPTl may be associated with peptide trafficking in lysosomes and endosomes that have different pH depending on the maturity of the vesicle (Gonzales et al. (1998) Cancer Res. 519-525).
It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.

Claims

CLAIMSWhat is claimed is;
1. An isolated nucleic acid encoding a proton-coupled peptide transporter, said nucleic acid comprising a nucleic acid or the complement of a nucleic acid selected from the group consisting of: a nucleic acid that specifically hybridizes to hPHTl or hPHT2 under stringent conditions and that encodes a proton-coupled peptide transporter; a nucleic acid that has 90% or greater sequence identity with hPHTl or hPHT2 and that encodes a proton-coupled peptide transporter; a nucleic acid that encodes an hPHTl peptide transporter protein or an hPHT2 peptide transporter protein; an hPHTl or hPHT2 splice variant; a nucleic acid comprising the nucleotide sequence of a nucleic acid amplified using primers of SEQ ED NO:l and SEQ ED NO:2, and human intestinal DNA as a template; a nucleic acid comprising the nucleotide sequence of a nucleic acid amplified using primers of SEQ ED NO:3 and SEQ ED NO:4, and human intestinal DNA as a template; a nucleic acid that is amplified using primers of SEQ ED NO:l and SEQ ED NO:2, and human intestinal DNA as a template; a nucleic acid that is amplified using primers of SEQ ED NO:3 and SEQ ED NO:4, and human intestinal DNA as a template; a nucleic acid that hybridizes under stringent conditions to a nucleic acid amplified using primers of SEQ ED NO:l and SEQ ED NO:2, and human intestinal DNA as a template, wherein said nucleic acid encodes a peptide transporter; a nucleic acid that hybridizes under stringent conditions to a nucleic acid amplified using primers of SEQ ED NO:3 and SEQ ED NO:4, and human intestinal DNA as a template, wherein said nucleic acid encodes a peptide transporter; and a nucleic acid that comprises at least 15 contiguous nucleotides of an a an hPHTl peptide transporter protein or an hPHT2 transporter protein.
2. The nucleic acid of claim 1, wherein said nucleic acid is in an expression cassette.
3. The nucleic acid of claim 2, wherein said nucleic acid is operably linked to a constitutive promoter.
4. The nucleic acid of claim 2, wherein said nucleic acid is operably linked to an inducible promoter.
5. The nucleic acid of claim 2, wherein said nucleic acid is operably linked to a tissue-specific promoter.
6. The nucleic acid of claim 1, wherein said nucleic acid is a vector.
7. A cell transfected with the nucleic acid of claim 6.
8. The cell of claim 7, wherein said cell expresses a polypeptide encoded by said nucleic acid.
9. A polypeptide comprising a proton-coupled peptide transporter encoded by a nucleic acid of claim 1.
10. An immunogenic polypeptide comprising a proton-coupled peptide transporter or a fragment thereof encoded by a nucleic acid of claim 1 or a fragment thereof.
11. The polypeptide of claim 10, wherein an antibody that specifically binds to said polypeptide also specifically binds to a full-length proton-coupled peptide transporter.
12. An antibody that specifically binds a polypeptide of claim 9.
13. The antibody of claim 12, wherein said antibody is selected from the group consisting of a complete antibody, an antibody fragment, a single-chain antibody, an antibody displayed on a phage, an antibody displayed on a bacterium.
14. A computer readable medium having recorded thereon the nucleotide sequence of a nucleic acid of claim 1.
15. The computer readable medium of claim 14, wherein said medium is selected from the group consisting of a floppy disc, a hard disc, a CD disc, a DVD disc, a random access memory (RAM), a read-only memory (ROM), and a flash memory.
16. A method of identifying a compound whose cellular uptake is mediated by a hPHTl or hPHT2 peptide transporter, said method comprising: i) contacting a cell expressing a peptide transporter selected from the group consisting of an hPHTl transporter, and a KPHT2 transporter with a test compound; and ii) detecting uptake of said test compound by said cell where elevated uptake of said compound by said cell as compared to a cell expressing the peptide transporter at a lower level indicates that said peptide transporter mediates transport of said test compound.
17. The method of claim 16, wherein said cell is a cell transfected with a vector that expresses the hPHTl or hPHT2 peptide transporter.
18. The method of claim 17, wherein said cell is a human somatic cell.
19. The method of claim 17, wherein said cell is an oocyte.
20. The method of claim 16, wherein said compound is a drug.
21. The method of claim 16, wherein said compound is a prodrug.
22. A method of targeting a drug to a tissue that expresses a hPHTl or hPHT2 peptide transporter, said method comprising identifying a drug that is transported by a hPHTl or hPHT2 peptide transporter; and contacting said tissue with said drug.
23. The method of claim 22, further comprising identifying a tissue that expresses a hPHTl or KPHT2 peptide transporter.
24. The method of claim 22, wherein said identifying comprises selecting a drug known to be transported by a hPHTl or hPHT2 peptide transporter.
25. The method of claim 22, wherein said identifying comprises screening for a drug that is transported by a hPHTl or hPHT2 peptide transporter.
26. The method of claim 25, wherein said screening is according to the method of claim 16.
27. The method of claim 22, wherein said peptide transporter is hPHTl and hPHT2 and said tissue is heart.
28. A method of identifying an agent that modulates expression of an hPHTl peptide transporter or an hPHT2 peptide transporter, said method comprising: contacting a cell comprising a gene encoding an hPHTl peptide transporter or an hPHT2 peptide transporter with a test agent; and detecting the expression level or activity level of hPHTl peptide transporter or hPHT2 peptide transporter where a difference in expression level of hPHTl or hPHT2 as compared to the expression level of hPHTl or hPHT2 in a cell contacted with a different amount of said agent indicates that said agent modulates expression of the hPHTl peptide transporter or the hPHT2 peptide transporter.
29. The method of claim 28, wherein said different amount is the absence of said test agent.
30. The method of claim 28, wherein said detecting comprises detecting an hPHTl mRNA or an /.PHTC mRNA.
31. The method of claim 30, wherein said level of hPHTl mRNA or hPHT2 mRNA is measured by hybridizing said mRNA to a probe that specifically hybridizes to an hPHT2 or to an hPHTl nucleic acid.
32. The method of claim 32, wherein said hybridizing is according to a method selected from the group consisting of a Northern blot, a Southern blot using DNA derived from the hPHTl or hPHT2 RNA, an array hybridization, an affinity chromatography, and an in situ hybridization.
33. The method of claim 30, wherein said probe is a member of a plurality of probes that forms an array of probes.
34. The method of claim 30, wherein the level of hPHTl mRNA or hPHT2 RNA is measured using a nucleic acid amplification reaction.
35. The method of claim 28, wherein said detecting comprises detecting an hPHTl protein or an hPHT2 protein.
36. The method of claim 35, wherein said detecting is via a method selected from the group consisting of capillary electrophoresis, a Western blot, mass spectroscopy, ELISA, immunochromatography, and immunohistochemistry.
37. The method of claim 28, wherein said cell is cultured ex vivo.
38. The method of claim 28, wherein said cell is a human somatic cell.
39. The method of claim 28, wherein said cell is a human intestinal cell.
40. The method of claim 28, wherein said cell is a cell from human heart tissue.
41. The method of claim 28, wherein said test agent is contacted to an animal comprising a cell containing the hPHTl or hPHT2 nucleic acid or the hPHTl or HPHT2 protein.
42. A method of prescreening for an agent that agent that modulates expression or activity of an hPHTl peptide transporter or an hPHT2 peptide transporter, said method comprising: i) contacting an hPHTl or hPHT2 nucleic acid or an hPHTl or hPHT2 protein with a test agent; and ii) detecting specific binding of said test agent to said hPHTl or hPHT2 protein or nucleic acid.
43. The method of claim 42, further comprising recording test agents that specifically bind to said hPHTl or hPHT2 nucleic acid or protein in a database of candidate agents that alter peptide transporter activity.
44. The method of claim 42, wherein said test agent is not an antibody.
45. The method of claim 42, wherein said test agent is not a protein.
46. The method of claim 42, wherein said test agent is not a nucleic acid.
47. The method of claim 42, wherein said test agent is a small organic molecule.
48. The method of claim 42, wherein said detecting comprises detecting specific binding of said test agent to said hPHTl or hPHT2 nucleic acid.
49. The method of claim 48, wherein said binding is detected using a method selected from the group consisting of a Northern blot, a Southern blot using DNA derived from a hPHTl or hPHT2 RNA, an array hybridization, an affinity chromatography, and an in situ hybridization.
50. The method of claim 42, wherein said detecting comprises detecting specific binding of said test agent to said hPHTl or hPHT2 protein.
51. The method of claim 50, wherein said detecting is via a method selected from the group consisting of capillary electrophoresis, a Western blot, mass spectroscopy, ELISA, immunochromatography, and immunohistochemistry.
52. The method of claim 42, wherein said test agent is contacted directly to the hPHTl or hPHT2 nucleic acid or to the hPHTl or hPHT2 protein.
53. The method of claim 42, wherein said test agent is contacted to a cell containing the hPHTl or hPHT2 nucleic acid or the hPHTl or hPHT2 protein.
54. The method of claim 53, wherein said cell is cultured ex vivo.
55. The method of claim 42, wherein said test agent is contacted to an animal comprising a cell containing the hPHTl or hPHT2 nucleic acid or the hPHTl or hPHT2 protein.
56. A kit comprising a container containing a reagent selected from the group consisting of a nucleic acid of claim 1, a cell of claim 7, and antibody of claim 12.
57. The kit of claim 56, further comprising instructional materials describing the assays of claim 16 or claim 22.
PCT/US2001/004799 2000-02-14 2001-02-14 Novel members of the h+/oligopeptide transporter gene family WO2001060854A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001238290A AU2001238290A1 (en) 2000-02-14 2001-02-14 Novel members of the h+/oligopeptide transporter gene family

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US18232800P 2000-02-14 2000-02-14
US60/182,328 2000-02-14
US78295901A 2001-02-13 2001-02-13
US09/782,959 2001-02-13

Publications (2)

Publication Number Publication Date
WO2001060854A1 true WO2001060854A1 (en) 2001-08-23
WO2001060854A8 WO2001060854A8 (en) 2001-11-29

Family

ID=26877992

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/004799 WO2001060854A1 (en) 2000-02-14 2001-02-14 Novel members of the h+/oligopeptide transporter gene family

Country Status (1)

Country Link
WO (1) WO2001060854A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002024913A2 (en) * 2000-09-25 2002-03-28 Millennium Pharmaceuticals, Inc. 32612, a novel human peptide transporter and uses therefor

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999066041A1 (en) * 1998-06-16 1999-12-23 Human Genome Sciences, Inc. 94 human secreted proteins

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999066041A1 (en) * 1998-06-16 1999-12-23 Human Genome Sciences, Inc. 94 human secreted proteins

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002024913A2 (en) * 2000-09-25 2002-03-28 Millennium Pharmaceuticals, Inc. 32612, a novel human peptide transporter and uses therefor
WO2002024913A3 (en) * 2000-09-25 2003-05-15 Millennium Pharm Inc 32612, a novel human peptide transporter and uses therefor

Also Published As

Publication number Publication date
WO2001060854A8 (en) 2001-11-29

Similar Documents

Publication Publication Date Title
Chou et al. Tic40, a membrane‐anchored co‐chaperone homolog in the chloroplast protein translocon
Ham et al. A polypyrimidine tract binding protein, pumpkin RBP50, forms the basis of a phloem-mobile ribonucleoprotein complex
Gottschalk et al. Identification by mass spectrometry and functional analysis of novel proteins of the yeast [U4/U6· U5] tri‐snRNP
Back et al. ER stress signaling by regulated splicing: IRE1/HAC1/XBP1
US7736853B2 (en) Methods of diagnosis of androgen-dependent prostate cancer, prostate cancer undergoing androgen withdrawal, and androgen-independent prostate cancer
Echard et al. Alternative splicing of the human Rab6A gene generates two close but functionally different isoforms
AU2010202722B2 (en) Juvenile hemochromatosis gene (HFE2A), expression products and uses thereof
US6096515A (en) NF-AT polynucleotides
Hendrickson et al. IC138 is a WD-repeat dynein intermediate chain required for light chain assembly and regulation of flagellar bending
Mingot et al. Ambient pH signaling regulates nuclear localization of the Aspergillus nidulans PacC transcription factor
Chambraud et al. FAP48, a new protein that forms specific complexes with both immunophilins FKBP59 and FKBP12: prevention by the immunosuppressant drugs FK506 and rapamycin
CA2459219A1 (en) Methods of diagnosis of cancer compositions and methods of screening for modulators of cancer
US5523227A (en) DNA encoding calcium-signal modulating cyclophilin ligand
AU2008229749A1 (en) Novel compositions and methods for cancer
KR20080042162A (en) Composition and method for diagnosing kidney cancer and estimating kidney cancer patient&#39;s prognosis
US5837514A (en) IκB kinases
Goehring et al. MyRIP anchors protein kinase A to the exocyst complex
Pu et al. The balance of RanBP1 and RCC1 is critical for nuclear assembly and nuclear transport
AU2009202600A1 (en) Novel compositions and methods for cancer
Geng et al. Saccharomyces cerevisiae Rab-GDI displacement factor ortholog Yip3p forms distinct complexes with the Ypt1 Rab GTPase and the reticulon Rtn1p
JP2003159059A (en) Identification and use of molecule associated with pain
Chirivi et al. Characterization of multiple transcripts and isoforms derived from the mouse protein tyrosine phosphatase gene Ptprr
Bonnet‐Corven et al. An analysis of the sequence requirements of EDEN‐BP for specific RNA binding
Harris et al. Interaction between the F plasmid TraA (F‐pilin) and TraQ proteins
AU2009201627B2 (en) Inhibition of tristetraproline for protection of the heart from cardiac injuries

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: C1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

CFP Corrected version of a pamphlet front page
CR1 Correction of entry in section i

Free format text: PAT. BUL. 34/2001 UNDER (30) REPLACE "60/271758" BY "09/782959"

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP