[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

EP2121920B1 - Method for cloning cognate antibodies - Google Patents

Method for cloning cognate antibodies Download PDF

Info

Publication number
EP2121920B1
EP2121920B1 EP08706942A EP08706942A EP2121920B1 EP 2121920 B1 EP2121920 B1 EP 2121920B1 EP 08706942 A EP08706942 A EP 08706942A EP 08706942 A EP08706942 A EP 08706942A EP 2121920 B1 EP2121920 B1 EP 2121920B1
Authority
EP
European Patent Office
Prior art keywords
cells
variable region
multiplex
sequences
primer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP08706942A
Other languages
German (de)
French (fr)
Other versions
EP2121920A2 (en
Inventor
Per-Johan Meijer
Lars Soegaard Nielsen
Jesper Kastrup
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Symphogen AS
Original Assignee
Symphogen AS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=38561767&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=EP2121920(B1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Symphogen AS filed Critical Symphogen AS
Priority to PL08706942T priority Critical patent/PL2121920T3/en
Priority to SI200830415T priority patent/SI2121920T1/en
Publication of EP2121920A2 publication Critical patent/EP2121920A2/en
Application granted granted Critical
Publication of EP2121920B1 publication Critical patent/EP2121920B1/en
Priority to CY20111101105T priority patent/CY1112664T1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/005Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies constructed by phage libraries
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/12Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
    • C07K16/1267Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria
    • C07K16/1282Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria from Clostridium (G)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/18Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
    • C07K16/28Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
    • C07K16/2863Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against receptors for growth factors, growth regulators
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/20Immunoglobulins specific features characterized by taxonomic origin
    • C07K2317/24Immunoglobulins specific features characterized by taxonomic origin containing regions, domains or residues from different species, e.g. chimeric, humanized or veneered
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/70Immunoglobulins specific features characterized by effect upon binding to a cell or to an antigen
    • C07K2317/73Inducing cell death, e.g. apoptosis, necrosis or inhibition of cell proliferation
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/70Immunoglobulins specific features characterized by effect upon binding to a cell or to an antigen
    • C07K2317/76Antagonist effect on antigen, e.g. neutralization or inhibition of binding
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2537/00Reactions characterised by the reaction format or use of a specific feature
    • C12Q2537/10Reactions characterised by the reaction format or use of a specific feature the purpose or use of
    • C12Q2537/143Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis

Definitions

  • the present invention relates to a procedure for linking cognate pairs of VH and VL encoding sequences from a population of cells enriched in particular surface antigen markers.
  • the linking procedure involves a multiplex molecular amplification procedure capable of linking nucleotide sequences of interest in connection with the amplification, in particular polymerase chain reaction (multiplex PCR).
  • multiplex PCR polymerase chain reaction
  • the method is particularly advantageous for the generation of cognate pair libraries as well as combinatorial libraries of variable region encoding sequences from immunoglobulin.
  • the invention also relates to methods for generation of chimeric human/non-human antibodies and expression libraries generated by such methods.
  • WO 2005/042774 discloses a method for linking nucleotide sequences of interest, in particular cognate pairs of V H and V L encoding sequences using a multiplex molecular procedure.
  • the sequences of interest are preferably amplified and linked from isolated single cells following limited dilution or other cell separation techniques.
  • the reference discloses various ways of enriching a lymphocyte containing cell population to obtain a population of plasma cells that are particularly suitable for the multiplex molecular amplification procedure.
  • WO 2007/003041 discloses a method of isolating an antibody-secreting cell that has a high likelihood of secreting an antibody specific for a desired antigen, comprising the steps: a) providing a sample from an animal that was recently immunized with the desired antigen, wherein the sample comprises a population of antibody-secreting cells that are secreting antibodies specific for the desired antigen; b) increasing the concentration of antibody-secreting cells in the sample; and c) isolating an antibody-secreting cell.
  • the concentration of antibody-secreting cells in the sample may be increased by enriching for cells or depleting cells in the sample with surface markers.
  • the present invention focuses on methods for generating libraries of immunoglobulin encoding sequences from non-human animals and methods for generating in a few steps adapted for high throughput cloning and screening, libraries of vectors coding for chimeric antibodies comprising human constant regions and non-human variable regions.
  • the invention relates to a method of producing a library of cognate pairs comprising linked variable region encoding sequences, said method comprising:
  • This method provides a library of cognate pair antibodies or antibody fragments.
  • the invention in another aspect relates to a method of randomly linking a plurality non-contiguous nucleotide sequences of interest, wherein said nucleotide sequences of interest comprise variable region encoding sequences and the linkage generates a combinatorial library of pairs of variable region encoding sequences, said method comprising:
  • This method preferably provides a combinatorial library of randomly combined heavy and light chain immunoglobulin variable domains.
  • the method provides several advantages over the alternative, viz generation of hybridoma.
  • the established cell lines When preparing hybridoma from an immunised mouse, the established cell lines will encode a repertoire of different antibody isotypes.
  • the hybridoma cell lines will subsequently have to be screened for both function (e.g. binding to a specific antigen, efficacy in neutralising a pathogen) and for the antibody isotype.
  • function e.g. binding to a specific antigen, efficacy in neutralising a pathogen
  • the antibody isotype Thus it requires a two step screening procedure to select a hybridoma with a particular antibody isotype and with a particular effect in a functional assay.
  • the antibody secreted by the selected hybridoma will need to be cloned and sequenced before it can be transferred to a producer cell line.
  • the antibody isotype is determined by the primers used for the multiplex molecular amplification, and therefore this will not need to be determined.
  • the antibody isotype may be also be determined (and can be changed) by the subsequent linking or splicing of constant domain(s) to cloned variable sequences.
  • the method of the invention provides a library of polynucleotides that can be easily sequenced and/or inserted into vectors, such as expression, transfer, display or shuttle vectors, so that once a particular antibody has been selected, it is already cloned, its sequence is already known and it can be transferred easily to an appropriate expression vector for production of antibody.
  • the cells sorted according to the provided protocol provide a source of high affinity antibodies, potentially with affinities in the picomolar range.
  • Monoclonal antibodies from hybridoma do not possess affinities in the picomolar range, and will need to be synthetically affinity matured to reach these affinities.
  • these methods further comprise assessing prior to multiplex molecular amplification that the population of lymphocyte-comprising cells comprises cells that express CD43 and CD138 antigen or MHCII and B220 antigen, preferably CD43 and CD138. Also the methods may include enriching said lymphocyte-comprising cell fraction for a lymphocyte population expressing CD43 and CD138 antigen or MHCII and B220 prior to multiplex molecular amplification.
  • the methods further comprise isolating from said lymphocyte-comprising population cells expressing CD43 and CD138 antigen or MHCII and B220 antigen prior to multiplex molecular amplification.
  • the isolated cells or subpopulation of cells are CD138 High/CD43 High or CD138 Intermediate/CD43 High relative to the lymphocyte-comprising cell fraction. More preferably the isolated cells or subpopulation of cells are CD138 High/CD43 High relative to the lymphocyte-comprising cell fraction.
  • Enrichment or isolation preferably comprises an automated sorting procedure, such as MACS or FACS.
  • Also described herein is a method for generating a vector encoding a chimeric antibody with human constant regions and non-human variable regions, said method comprising:
  • the non-human animal is a mouse.
  • the methods of the invention are named: Mouse-SymplexTM or mSymplexTM.
  • This method provides a novel method for generation of libraries of chimeric human/non-human antibodies. This is made possible by combining the multiplex molecular amplification and subsequent cloning into a vector backbone with ligation and/or splicing of human heavy and light chain constant domains.
  • the chimerisation has been the last step after hybridoma have been established and screened and the encoded antibody has been cloned. Chimerisation may affect the binding specificity and/or affinity of an antibody, and thus there is a risk that a good monoclonal mouse antibody loses its efficacy when it is chimerised into a human/mouse antibody.
  • the screening can be carried out on products that do not need to be modified further prior to preclinical and clinical development.
  • the constant human regions can be provided in a molecular amplification step or they can be provided as part of a vector-backbone, into which the variable regions are cloned following molecular amplification.
  • the method comprises a further amplification step, wherein a polynucleotide encoding a human constant light chain or a fragment thereof with an overlap capable of providing linkage to the variable light chain, is added to the PCR mixture together with a primer set capable of amplification of a construct comprising in order: a murine VH chain, a linker, a murine VL chain, and a human constant light chain.
  • the method comprises a further amplification step, wherein a polynucleotide encoding human constant heavy chain or a fragment thereof with an overlap capable of providing linkage to the variable heavy chain, is added to the PCR mixture together with a primer set capable of amplification of a construct comprising in order: a human constant heavy chain, a murine VH chain, a linker, and a murine VL chain.
  • the vectors are expression vectors enabling the expression of the antibody members of the library for subsequent screening. More preferably the expression vector is for mammalian expression.
  • the vectors of the library may be obtained by a method of the invention.
  • the light chain constant region is a kappa constant region.
  • the non-human sequences may be from rat, sheep, goat, rabbit, guinea-pig or other suitable animal for which immunisation protocols have been described, for which sequence information is available to allow the design of suitable primers, and for which suitable cell sorting techniques enable sorting of plasma cells for single cell mulitiplex molecular amplification to link cognate pairs of variable region sequences.
  • the non-human sequences are of non-human primate origin, such as cynomolgous monkey, Rhesus monkey, chimpanzee, or maqaque.
  • the non-human sequences are rodent, such as murine or rat.
  • the non-human sequences are rabbit sequences.
  • variable regions of the antibodies are cognate pairs.
  • Also described herein is a sub-library which codes for antibodies exhibiting desired binding specificities directed against a particular target, selected from a library prepared according to the invention.
  • the present invention sets out to provide further possibilities for using the amplification and linkage method disclosed in WO 2005/042774 for providing collections of antibody vectors from non-human animals. These improvements enable the cloning of human/non-human chimeric antibody encoding sequences with cognate pairs of variable regions to be fitted to a high-throughput format. This is basically achieved by providing a new starting material for the amplification and linkage processes and by providing methods for generation of libraries of chimeric human/non-human antibodies with cognate pairs of variable regions.
  • One aspect of the invention is a method of linking heavy and light chain variable sequences comprising amplifying, in a multiplex molecular amplification procedure, the relevant nucleotide sequences using a template derived from an isolated single cell, a population of isogenic cells or a population of genetically diverse cells and effecting a subsequent linkage of the amplified sequences.
  • cognate pair describes an original pair of non-contiguous nucleic acids of interest that are contained within or derived from a single cell.
  • a cognate pair comprises two variable region encoding sequences which together encode a binding protein variable domain and which gene sequences are derived from the same cell.
  • a cognate pair can for example be comprised of an antibody variable heavy chain encoding sequence associated with a variable light chain encoding sequence from the same cell, or a T cell receptor ⁇ chain encoding sequence associated with a ⁇ chain encoding sequence from the same cell.
  • a library of cognate pairs is a collection of such cognate pairs.
  • hot-start polymerase describes polymerases that are inactive or have very low activity at temperatures used for reverse transcription. Such polymerases need to be activated by high temperatures (90 to 95°C) to become functional. This is for example an advantage in single-step RT-PCR procedures, since this prohibits interference of the polymerase with the reverse transcriptase reaction.
  • isogenic population of cells describes a population of genetically identical cells.
  • an isogenic population of cells derived by clonal expansion of an isolated single cell is of interest in the present invention.
  • isolated single cell describes a cell that has been physically separated from a population of cells corresponding to "a single cell in a single vessel'.
  • a population of isolated single cells is obtained.
  • the proportion of vessels with a single cell is not necessarily 100% in order to call it a population of single cells.
  • a segment comprises nucleic acid sequences encoding a variable domain, e.g. an antibody heavy chain variable region associated with an antibody light chain variable region encoding sequence, derived from the same cell.
  • the linkage can either be achieved simultaneously with the amplification or as an immediate step following the amplification. There are no requirements to the form or functionality of the segment, it may be linear, circular, single stranded or double stranded.
  • one of the nucleic acid sequences of interest may be isolated from the segment if desired, one of the variable region encoding sequence may for example be isolated from a cognate pair segment. However, as long as the original variable regions constituting the cognate pair are not scrambled with other variable regions, they are still considered a cognate pair, although not linked together into a single segment.
  • the linkage is preferably a nucleotide phosphodiester linkage. However, linkage can also be obtained by different chemical cross linking procedures.
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • SDA strand displacement amplification
  • NASBA nucleic acid based sequence amplification
  • multiplex PCR describes a variant of PCR in which two or more target sequences are amplified simultaneously, by including more than one set of primers in the same reaction, e.g. one primer set adapted for amplification of the heavy chain variable region and one primer set adapted for amplification of the kappa chain variable region in the same PCR reaction. Additionally or alternatively a primer set adapted for amplification of the lambda chain variable region may be combined with these primer sets.
  • multiplex RT-PCR subscribes a multiplex PCR reaction, which is preceded by a reverse transcription (RT) step.
  • the multiplex RT-PCR can either be performed as a two-step process with a separate RT step prior to the multiplex PCR, or as a single-step process where all components for both RT and multiplex PCR are combined in a single tube.
  • multiplex overlap-extension PCR and “multiplex overlap-extension RT-PCR” imply that the multiplex PCR or multiplex RT-PCR is performed utilizing a multiplex overlap-extension primer mix to amplify the target sequences, thereby enabling simultaneous amplification and linkage of the target sequences.
  • a plurality of vessels describes any object (or collection of objects) which enables the physical separation of a single cell from a population of cells.
  • This may be tubes, multiwell plates (e.g. 96-well, 384-well, microtiter plates or other multiwell plates), arrays, microarrays, microchips, gels, or a gel matrix.
  • the object is applicable for PCR amplification.
  • polyclonal protein refers to a protein composition comprising different, but homologous protein molecules, preferably selected from the immunoglobulin superfamily. Thus, each protein molecule is homologous to the other molecules of the composition, but also contains one or more stretches of variable polypeptide sequence, which is/are characterized by differences in the amino acid sequence between the individual members of the polyclonal protein.
  • polyclonal proteins include antibody or immunoglobulin molecules, T cell receptors and B cell receptors.
  • a polyclonal protein may consist of a defined subset of protein molecules, which has been defined by a common feature such as the shared binding activity towards a desired target, e.g. a polyclonal antibody exhibiting binding specificity towards a desired target antigen.
  • a population of genetically diverse cells refers to a cell population where the individual cells in the population differ among each other on the genomic level.
  • a population of genetically diverse cell is for example a population of cells derived from a donor, or a fraction of such cells, e.g. a B lymphocyte or a T lymphocyte containing cell fraction.
  • primer set is used interchangeably with the term "primer pair" and describes two or more primers which together are capable of priming the amplification of a nucleotide sequence of interest (i.e., one member of a cognate pair).
  • a primer set of the present invention may be designed to prime a family of nucleotide sequences containing variable region encoding sequences. Examples of different families are antibody kappa light chains, lambda light chains, heavy chain variable regions, and ⁇ , ⁇ , ⁇ , or ⁇ T cell receptor variable regions.
  • a primer set for the amplification of a family of nucleotide sequences containing variable region encoding sequences often constitutes a plurality of primers where several primers can be degenerate primers.
  • sequence identity is expressed as a percentage which indicates the degree of identity between to nucleic acid sequences over the length of the shortest of the two sequences. It can be calculated as (N ref - N dif ) ⁇ 100%/N ref , wherein N ref is the number of residues in the shorter of the sequences, and wherein N dif is the total number of non-identical residues in an N ref long optimally aligned match between the two sequences.
  • N ref is the number of residues in the shorter of the sequences
  • N dif is the total number of non-identical residues in an N ref long optimally aligned match between the two sequences.
  • randomly or random with respect to linkage refer to linkage of nucleotide sequences which are not derived from the same cell but are linked transversely among a population of genetically diverse cells. If the nucleotide sequences of interest are variable region encoding sequences, this will result in a combinatorial library of linked sequences. If, on the other hand, the nucleotide sequences of interest encode a non-diverse heteromeric protein the randomly linked sequences will appear similar to sequences linked from a single cell.
  • the term "template derived from an isolated single cell”, with regard to reverse transcription, relates to the nucleic acids within such an isolated cell.
  • the nucleic acids can for example be in the form of RNA, mRNA, DNA or genomic DNA.
  • the nucleic acids can either be isolated from the cell or still be with the remaining contents of the cell, where the cell is in an intact form or a lysed form.
  • CD43 refers to a mouse surface antigen known under numerous synonyms including 3E8 antigen, A630014B01Rik, B cell differentiation antigen LP-3, Cd43, CD43 antigen, Galgp, Leucocyte sialoglycoprotein, leukosialin, Leukosialin precursor, Ly48, Ly-48, Sialophorin, as well as to orthologous surface markers from other animals.
  • CD138 refers to a mouse surface antigen known under numerous synonyms including Syndecan-1, AA408134, AA409076, CD138, syn-1, Synd, Synd1, SYND1, Synd-1, Syndecan-1 precursor as well as to orthologous surface markers from other animals.
  • MHCII refers to a mouse surface antigen known under numerous synonyms including CD74 antigen, CLIP, DHLAG, H-2 class II histocompatibility antigen gamma chain, HLADG, HLA-DR-GAMMA, la antigen-associated invariant chain, la-GAMMA, Ii, MHC class II-associated invariant chain as well as to orthologous surface markers from other animals.
  • B220 refers a mouse surface antigen known under numerous synonyms including B220, Cd45, CD45, CD45 antigen, CD45R, L-CA, Leukocyte common antigen precursor, loc, Ly-5, Lymphocyte common antigen Ly-5, Lyt-4, T200 as well as to orthologous surface markers from other animals.
  • One feature of the present invention reduces the number of tubes necessary to amplify the nucleotide sequences of interest, utilizing a variant of PCR in which two or more target sequences are amplified simultaneously in the same tube, by including more than one set of primers, for example all the primers necessary to amplify variable region encoding sequences, in the same reaction.
  • this approach is known as multiplex polymerase chain reaction (multiplex PCR).
  • a further feature of the present invention is that two or more target sequences amplified by multiplex PCR are linked in close proximity to the amplification process. In particular cognate pairs of variable region encoding sequences are linked by this process.
  • One embodiment of the present invention exploits that a multiplex primer mix can be designed to work in an overlap-extension PCR procedure, resulting in a simultaneous amplification and linkage of nucleotide sequences of interest.
  • This multiplex overlap-extension PCR technique serves to reduce the number of reactions necessary to isolate and link nucleotide sequences of interest, in particular cognate pairs of linked variable regions.
  • linkage by ligation or by recombination as an alternative to linkage by multiplex overlap-extension PCR.
  • the linkage is not performed simultaneously with the multiplex PCR amplification, but as an immediate step following the amplification.
  • linkage can still be performed in the same tube as the multiplex PCR was performed in.
  • a multiplex overlap-extension PCR requires the presence of two or more primer sets (a multiplex primer mix), where at least one primer of each set is equipped with an overlap-extension tail.
  • the overlap-extension tails enable the linkage of the products generated by each of the primer sets during amplification.
  • Such a primer mix is called a multiplex overlap-extension primer mix.
  • the multiplex overlap-extension PCR differs from conventional overlap-extension PCR in that the sequences to be linked are generated simultaneously in the same tube, thereby providing immediate linkage of the target sequences during amplification, without any intermediate purification.
  • a further feature of the present invention is a reverse transcription (RT) step preceding the multiplex PCR or multiplex overlap-extension PCR amplification, utilizing a template derived from an isolated single cell or a population of isogenic cells.
  • RT reverse transcription
  • a further feature of the present invention is the use of nucleotide sequences derived from an isolated single cell or a population of isogenic cells as template for the multiplex PCR amplification.
  • RNA from a single cell is reverse transcribed into cDNA prior to the multiplex PCR.
  • genomic DNA may be used as an alternative to mRNA.
  • the present invention facilitates the generation of libraries of linked nucleic acid sequences of interest, in particular combinatorial libraries and libraries of cognate pairs of variable regions. Further, the present invention utilizes nucleic acids derived from single cells, preferably in the form of RNA that does not need to be isolated from the remaining cell contents before it can be utilized as template.
  • One embodiment of the present invention encompasses the linkage of a plurality of non-contiguous nucleotide sequences of interest.
  • the method comprises amplifying, in a multiplex PCR or multiplex RT-PCR amplification procedure, nucleotide sequences of interest using a template derived from an isolated single cell or a population of isogenic cells and effecting linkage of the amplified nucleotide sequences of interest. Further, the method comprises an optional step of performing an additional amplification of the linked products.
  • a further embodiment of the present invention encompasses a method of producing a library of cognate pairs comprising linked variable region encoding sequences.
  • the method comprises providing a lymphocyte-containing cell fraction from a donor, which is optionally enriched for a particular lymphocyte population from said cell fraction, or wherein a particular lymphocyte population has been isolated from said cell fraction. Further, a population of isolated single cells is obtained by distributing cells from the lymphocyte-containing cell fraction, or the enriched cell fraction, individually among a plurality of vessels. Multiplex molecular amplification (multiplex RT-PCR amplification) of the variable region encoding sequences contained in the population of isolated single cells is performed and linkage of pairs of variable region encoding sequences is effected, wherein an individual pair of variable region sequences is derived from a single cell, within the population of isolated single cells.
  • a lymphocyte-containing cell fraction from a donor, which is optionally enriched for a particular lymphocyte population from said cell fraction, or wherein a particular lymphocyte population has been isolated from said cell fraction.
  • a population of isolated single cells is obtained by distributing cells from the lymphocyte-containing cell fraction, or the enriched cell
  • the technique comprises two optional steps: in the first step the individual isolated single cell in the population of single cells is expanded to a population of isogenic cells prior to performing multiplex RT-PCR amplification, thereby obtaining a plurality of vessels with a diverse population of isogenic cells (one population of isogenic cells in one vessel).
  • the second optional step encompasses performing an additional amplification of the linked variable region encoding sequences.
  • an individual member of said library of cognate pairs comprised of an immunoglobulin light chain variable region encoding sequence is associated with an immunoglobulin heavy chain variable region encoding sequence, originating from the same cell or of sequences encoding a T cell receptor binding domain, constituted of an alpha chain variable region associated with a beta chain variable region or a gamma chain variable region associated with a delta chain variable region, where the associated variable regions originate from the same cell.
  • the multiplex RT-PCR amplification of the present invention can be performed either as a two-step process, where reverse transcription (RT) is performed separate from the multiplex PCR amplification (or alternative multiplex molecular amplification), or as a single-step process, where the RT and multiplex PCR amplification steps are performed with the same primers in one tube.
  • RT reverse transcription
  • RT reverse transcription
  • Primers which can be utilized for the reverse transcription are for example oligo-dT primers, random hexamers, random decamers, other random primers, or primers that are specific for the nucleotide sequences of interest.
  • the two-step multiplex RT-PCR amplification procedure allows for the cDNA generated in the RT step to be distributed to more than one vessel, permitting the storage of a template fraction before proceeding with the amplification. Additionally, the distribution of cDNA to more than one tube allows for the performance of more than one multiplex PCR amplification of nucleic acid derived from the same template. Although this results in an increased number of separate reactions, it opens for the possibility to decrease the complexity of the multiplex primer mix if this should be desired.
  • This two-step approach can for example be applied to amplify and link heavy chain variable region and kappa light chain variable region encoding sequences in one tube and heavy chain variable region and lambda light chain variable region encoding sequences in a different tube utilizing the same template.
  • a single cell usually only expresses one of the light chains. However, it will often be easier to perform the reactions simultaneously instead of awaiting the result of one of the reactions before performing the other. Further, the amplification of both kappa and lambda serves as an internal negative control, since it would be expected that only kappa or lambda amplify from a single cell.
  • the composition needed for the single-step multiplex RT-PCR comprises a nucleic acid template, an enzyme with reverse transcriptase activity, an enzyme with DNA polymerase activity, deoxynucleoside triphosphate mix (dNTP mix comprising dATP, dCTP, dGTP and dTTP) and a multiplex primer mix.
  • the nucleic acid template is preferably total RNA or mRNA derived from an isolated single cell either in a purified form, as a lysate of the cell or still within the intact cell.
  • the exact composition of the reaction mixture requires some optimization for each multiplex primer mixture to be used with the present invention. This applies both for the two-step and the single-step multiplex RT-PCR procedures.
  • the nucleotide sequences of interest, amplified by the multiplex RT-PCR can be linked to one another by several methods, such as multiplex overlap-extension RT-PCR, ligation or recombination, using different multiplex primer mixes.
  • the multiplex RT-PCR amplification and linkage process is a single step or a two step process.
  • the linkage process may also be performed as a multi step process, using for example a stuffer fragment to link the nucleic acid sequences of interest, either with PCR, ligation or recombination.
  • a stuffer fragment may contain cis-elements, promoter elements or a relevant coding sequence or recognition sequence.
  • the linkage process is performed in the same vessel as the multiplex RT-PCR amplification.
  • the linkage of a plurality of non-contiguous nucleotide sequences of interest is performed in association with the multiplex PCR amplification, utilizing a multiplex overlap-extension primer mix.
  • the composition needed for the multiplex overlap-extension PCR comprises a nucleic acid template, an enzyme with DNA polymerase activity, deoxynucleoside triphospate mix (dNTP mix comprising dATP, dCTP, dGTP and dTTP) and a multiplex overlap-extension primer mix.
  • the linkage of a plurality of non-contiguous nucleotide sequences of interest is performed by multiplex overlap-extension RT-PCR using a template derived from an isolated single cell or a population of isogenic cells.
  • the method comprises an optional step of performing an additional molecular amplification of linked products.
  • the multiplex overlap-extension RT-PCR is performed as a single-step/one-tube reaction.
  • a multiplex overlap-extension primer mix of the present invention comprises at least two primer sets capable of priming the amplification and linkage of at least two variable region encoding sequences, for example, amplification and linkage of sequences from immunoglobulin heavy chain variable region families with kappa or lambda light chain variable region families, or amplification and linkage of sequences from T cell receptor families ⁇ , ⁇ , ⁇ , or ⁇ .
  • the plurality of nucleotide sequences of interest, amplified by multiplex RT-PCR are linked by ligation.
  • the multiplex primer mix used for the multiplex RT-PCR is designed such that the amplified target sequences can be cleaved with appropriate restriction enzymes, and covalent linkage by DNA ligation can be performed (the primer design is described in the section "Primer Mixtures and Design” ) .
  • the restriction enzymes needed to form compatible ends of the target sequences are added to the mixture together with the ligase. No purification of the PCR products is needed prior to this step, although purification may be performed.
  • the reaction temperature for the combined restriction cleavage and ligation is approximately between 0 and 40°C. However, if the polymerase from the multiplex PCR reaction is still present in the mixture, an incubation temperature below room temperature is preferred, most preferred are temperatures between 4 and 16°C.
  • the plurality of nucleotide sequences of interest, amplified by multiplex RT-PCR are linked by recombination.
  • the target sequences amplified can be joined using identical recombination sites.
  • Linkage is then performed by adding the recombinases facilitating recombination.
  • Some suitable recombinase systems are Flp recombinase with a variety of FRT sites, Cre recombinase with a variety of lox sites, integrase ⁇ C31 which carries out recombination between the attP site and the attB site, the ⁇ -recombinase-six system as well as the Gin-gix system.
  • Linkage by recombination has been exemplified for two nucleotide sequences (V H linked with V L ) ( Chapal, N. et al. 1997 BioTechniques 23, 518-524 ).
  • the nucleotide sequences of interest comprise variable region encoding sequences and the linkage generates a cognate pair of variable region encoding sequences.
  • a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions.
  • the constant regions are of human origin and the variable region cognate pair is of different origin, such as mouse, rat, or rabbit.
  • the nucleotide sequences of interest comprise immunoglobulin variable region encoding sequences and the linkage generates a cognate pair of light chain variable region and heavy chain variable region encoding sequences.
  • a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions.
  • such a cognate pair may be isolated from template derived from cells of the B-lymphocyte lineage enriched from a lymphocyte-containing cell fraction, such as whole blood, mononuclear cells or white blood cells.
  • the nucleotide sequences of interest comprise TcR variable region encoding sequences and the linkage generates a cognate pair of ⁇ chain variable region and ⁇ chain variable region encoding sequences or ⁇ chain variable region and ⁇ chain variable region encoding sequences.
  • a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions.
  • such a cognate pair may be isolated from template derived from cells of the T-lymphocyte lineage enriched from a lymphocyte-containing cell fraction, such as whole blood, mononuclear cells or white blood cells.
  • Another aspect of the present invention is to utilize the multiplex RT-PCR with a population of genetically diverse cells as template source.
  • the majority of heteromeric protein encoding sequences do not vary from cell to cell as is the case with variable region encoding sequences from binding proteins.
  • variable region encoding sequences from binding proteins.
  • a plurality of non-contiguous nucleotide sequences of interest are linked randomly by a method comprising performing multiplex RT-PCR amplification of nucleotide sequences of interest using a template derived from a population of genetically diverse cells and effecting linkage of the amplified nucleotide sequences of interest. Further, the method comprises an optional step of performing an additional amplification of the linked products. As with the single cell approach the linkage can either be performed utilizing a multiplex overlap-extension primer mix for the amplification or alternatively by ligation or recombination.
  • the template derived from the population of cells is not strictly contained within the cells. The population of cells may for example be lysed.
  • the population of cells constitutes cells that express variable region binding proteins, such as B lymphocytes, T lymphocytes, hybridoma cells, plasma cells, plasmablasts, or a mixture of these cells.
  • the population of cells in the above mentioned embodiment can for example be permeabilized or lysed, without additional purification, or the template nucleic acids can be isolated from the cells by standard procedures.
  • the single-step multiplex RT-PCR procedure is preferred. However, the two-step procedure may also be used in the embodiment.
  • An efficient way to increase the specificity, sensitivity, and yield of the multiplex RT-PCR-linkage process is by performing an additional molecular amplification of the linked nucleotide sequences obtained from the multiplex RT-PCR followed by linkage by ligation or recombination or linkage using the multiplex overlap-extension RT-PCR.
  • This additional amplification is preferably performed with PCR amplification, utilizing a primer mix adapted for amplifying the linked nucleic acid sequences of interest.
  • the primer mix utilized may be the outer primers of the multiplex primer mix or multiplex overlap-extension primer mix, meaning the primers which anneal to the outermost 5' end and 3' end of the sense strand of the linked variable region encoding sequences, thereby enabling the amplification of the entire linked product.
  • the outer primers can also be described as the primers of the multiplex overlap-extension primer mixture that do not contain overlap extension tails.
  • a nested or semi-nested primer set can be used for the additional amplification of the linked nucleotide sequences. Such a nested PCR especially serves to increase the specificity of the method as well as to increase the amount of linked product.
  • semi-nested PCR (as described in the section entitled Primer Mixtures and Design) is considered to function as well as the nested PCR.
  • the additional amplification can either be performed directly using a fraction or the entire multiplex overlap-extension RT-PCR reaction product or ligation product or recombination product, or a fraction of any one of these products or using partially purified linked products from any one of these reactions, e.g. by performing an agarose gel electrophoresis of the linked products, and excising the fragment corresponding to the expected size of the linked variable region encoding sequences.
  • the additional amplification is preferably performed directly on a fraction from the multiplex overlap-extension RT-PCR reaction, since this would assist linkage of the individual target sequences that were not linked in the first reaction.
  • the nucleotide sequences of interest of the present invention can be selected from sequences that encode different subunits or domains, which when expressed, form a protein or part of a protein.
  • proteins that are composed of at least two non-identical subunits are known as heteromeric proteins.
  • Heteromeric proteins are common in all kinds of species. Some of the classes to which such proteins belong are for example enzymes, inhibitors, structural proteins, toxins, channel proteins, G-proteins, receptor proteins, immunoglobulin superfamily proteins, transportation proteins etc.
  • the nucleotide sequences encoding such heteromeric proteins are non-contiguous, meaning for example that they originate from different genes, or different mRNA molecules. However, non-contiguous as used in the present invention may also mean nucleotide sequences encoding domains of the same protein, where the domains are separated by nucleotide sequences which are not of interest.
  • the nucleotide sequences of interest contain variable region encoding sequences from the immunoglobulin superfamily, such as immunoglobulins (antibodies), B cell receptors and T cell receptors (TcRs).
  • variable region encoding sequences from immunoglobulins are of interest.
  • Such variable region encoding sequences comprise full-length antibodies as well as Fab's, Fv's, scFv's and combinations of fragments of the variable region encoding sequences, e.g. complementarity determining regions (CDR's), joining genes or V-genes or combinations of these.
  • CDR's complementarity determining regions
  • the present application enables the linkage of only the variable domains of the heavy and light chains generating Fv or scFv encoding sequences, or the linkage of the entire light chain with the heavy chain variable region + constant region domain C H1 + parts of the hinge region, generating Fab, Fab' or F(ab) 2 . Further, it is possible to add any region of the heavy chain constant region domains to the variable heavy chain, thereby generating full-length antibody encoding sequences or truncated antibody encoding sequences.
  • the non-human variable sequences are linked to human constant regions to generate full chimeric human/non-human antibodies, preferably chimeric antibodies with human constant regions.
  • variable region encoding sequences comprise one type of immunoglobulin light chain (kappa or lambda) encoding sequence and one immunoglobulin heavy chain variable region encoding sequence. This is achieved by selecting primers amplifying only one isotype of light and heavy chain.
  • the isotype can also be determined by linking or splicing human constant regions from one or more particular isotypes of heavy and light chain.
  • TcR T cell receptors
  • Such TcR encoding sequences comprise encoding sequences for full-length alpha and beta chains or gamma and delta chains as well as soluble TcRs or only the variable domains of these chains or single chain fusion proteins thereof (e.g. single chain ⁇ or single chain ⁇ ).
  • One feature of the present invention is the ability to link nucleotide sequences derived from an isolated single cell, a population of isogenic cells, or a genetically diverse population of cells which have not been separated into single vessels.
  • a preferred feature of the present invention is the use of isolated single cells or a population of isogenic cells as template source, since scrambling of the nucleic acid sequences of interest, in particular variable region encoding sequences is avoided. This is of importance if one wishes to obtain an original pair of for example variable region encoding sequences.
  • Another preferred feature of the present invention is obtaining a single cell or population of single cells from a cell fraction comprising lymphocytes, such as B lymphocytes, T lymphocytes, plasma cells and/or various developmental stages of these cell lineages. Other populations of cells that express binding proteins from the immunoglobulin superfamily might also be used to obtain single cells.
  • Cell lines such as hybridoma cells, cell lines of B lymphocyte or T lymphocyte lineage or virus immortalized cell lines or donor derived cells participating in the immune response are also applicable in the present invention.
  • Donor derived lymphocyte-containing cell fractions may be obtained from natural tissue or fluid which is rich in such cells, e.g.
  • Donors can either be na ⁇ ve or hyperimmune with respect to a desired target.
  • hyperimmune donors are preferred.
  • Such hyperimmune donors can either be donors immunized with the target, or fragments of the target, or can be convalescent patients, or non-healthy individuals with a natural immune response towards the target, e.g. autoimmune patients, cancer patients, patients with infectious diseases e.g.
  • the donor is a non-human animal that has been immunised with a human self-antigen, such as a human protein implicated in cancer, such as EGFR.
  • cell donors may be of the same species as the species to be treated with the products obtainable from the linked nucleotide sequences of the present invention.
  • a cell donor is a domestic animal, a pet, or a human.
  • a special feature of the present invention is that it enables the generation of chimeric human/non-human antibody libraries for generation of chimeric antibodies for use in human therapy. Such an approach is preferred when the antibodies are directed against so-called self-antigens, i.e. human antigens.
  • the donor may also be a transgenic animal, in particular a transgenic mouse.
  • Transgenic animals carrying human immunoglobulin loci are described in U.S. Patent No. 6,111,166 and Kuroiwa, Y. et al. Nature Biotechnology; 2002; 20: 889-893 .
  • Such transgenic animals are capable of producing human immunoglobulins.
  • fully human antibodies against a specific target can be raised by usual immunization techniques of such transgenic animals. This allows for generation of libraries encoding binding proteins with specificities towards more difficult targets such as human antigens to which no or limited natural human antibody response exists.
  • Such transgenic animals can likewise be developed to produce human T cell receptors.
  • the lymphocyte-containing cell fraction is constituted of whole blood, bone marrow, mononuclear cells, or white blood cells obtained from a donor.
  • Mononuclear cells can be isolated from blood, bone marrow, lymph nodes, spleen, infiltrations around cancer cells and inflammatory infiltrations.
  • Mononuclear cells can be isolated by density centrifugation techniques, e.g. Ficoll gradients. If the mononuclear cells are isolated from samples composed of tissue, the tissue is disintegrated before the gradient centrifugation is performed. Disintegration can be performed, for example, by mechanical methods such as grinding, electroporation and/or by chemical methods such as enzymatic treatments.
  • the isolation of white blood cells can be performed directly from donors using leukopheresis.
  • Raw preparations of for example bone marrow or tissue which contains lymphocytes can also be used in the present invention. Such preparations will need to be disintegrated, for example as described above, in order to facilitate single cell distribution.
  • a further feature of the present invention is enrichment of the lymphocyte-containing cell fraction e.g. whole blood, mononuclear cells, white blood cells or bone marrow, with respect to a particular lymphocyte population, such as cells from the B lymphocyte or T lymphocyte lineage.
  • Enrichment of B lymphocytes can for example be performed using magnetic bead cell sorting (MACS) or fluorescence activated cell sorting (FACS) taking advantage of lineage-specific cell surface marker proteins such as CD19 or other B cell lineage-specific markers such as B220.
  • Enrichment of T lymphocytes can for example be performed utilizing a cell surface marker such as CD3 or other T cell lineage-specific markers.
  • a preferred feature of the present invention is to sort the enriched B lymphocytes further in order to acquire plasma cells, before distributing the cells individually among a plurality of vessels.
  • Isolation of plasma cells is generally performed by MACS sorting or FACS sorting, utilizing surface markers such as CD19.
  • surface markers such as CD19.
  • Other plasma cell-specific surface markers or combinations thereof can be utilized as well, for example CD138, CD43, CD19, MHC-II, the exact choice of marker depending on the plasma cell source, e.g. spleen, tonsils, blood or bone marrow. Of course, the exact choice of surface markers also depends on the species from which the cells are isolated.
  • the markers used for sorting and/or selection of cells are CD43 and CD138 or MHCII and B220 or orthologs.
  • the combination of markers is CD43 and CD138, and preferably the selected cells have an intermediate or high expression of these markers relatively to the lymphocyte-comprising cell population from which they are selected or isolated. More preferably the level of expression of CD43 and CD138 is high relative to the lymphocyte-comprising cell population from which they are selected or isolated.
  • Plasma cells can also be obtained from a non-enriched lymphocyte-containing cell population obtained from any of these sources.
  • the plasma cells isolated from blood are sometimes called early plasma cells or plasmablasts. In the present invention these cells are also termed plasma cells.
  • Plasma cells are desired for the isolation of cognate pairs of immunoglobulin encoding sequences because a higher frequency of these cells produces antigen-specific antibodies that reflect the acquired immunity toward the desired antigen and most of the cells have undergone somatic hypermutation and therefore encode high-affinity antibodies. Further, the mRNA levels in plasma cells are elevated compared to the remaining B lymphocyte population, thus the reverse transcription procedure is more efficient when using single plasma cells.
  • memory B cells may be isolated from a lymphocyte containing cell fraction utilizing a cell surface marker such as CD27 and IgG.
  • An alternative feature of the present invention is selecting the enriched B lymphocytes for antigen specificity before distributing the cells among a plurality of vessels.
  • Isolation of antigen-specific B lymphocytes is performed by contacting the enriched B lymphocytes with the desired antigen or antigens enabling binding of antigen to surface exposed immunoglobulin, followed by isolation of binders. This can be done, for example, by biotinylation of the desired antigen or antigens followed by suitable cell sorting techniques.
  • Plasma cells as well as B lymphocytes, non-enriched mononuclear cells, white blood cells, whole blood, bone marrow or tissue preparations can be subjected to isolation with respect to antigen specificity if this is desired.
  • T lymphocytes e.g. CD3 positive cells
  • surface markers such as e.g. CD27
  • T lymphocytes can also be selected for MHC-antigen specificity using MHC-peptide complexes (e.g. Callan, M.F. et al. 1998. J. Exp. Med. 187, 1395-1402 ; Novak, E.J. et al. 1999. J. Clin. Invest 104, R63-R67 ).
  • a further feature of the present invention is immortalization of any of the isolated cell fractions described in the above (e.g. B lymphocytes, plasma cells, memory cells or T lymphocytes). Immortalization may for example be performed with Epstein-Barr virus ( Traggiai, E., et al., 2004. Nat Med 10, 871-875 ) prior to cell distribution. Alternatively, isolated single cells may be immortalized and expanded prior to reverse transcription. Traggiai et al., Nat Med. 2004 Aug;10(8):871-5 .
  • a further feature of the present invention is the distribution of a population of desired cells (e.g. hybridoma cells, cell lines of B lymphocyte or T lymphocyte lineage, whole blood cells, bone marrow cells, mononuclear cells, white blood cells, B lymphocytes, plasma cells, antigen-specific B lymphocytes, memory B cells, T lymphocytes, peptide/MHC-specific T lymphocytes, or memory T cells) individually into a plurality of vessels, in order to obtain a population of isolated single cells.
  • This isolation of single cells refers to the physical separation of cells from a population of cells in such a way that a single vessel contains a single cell, or a micro array, chip or gel matrix is loaded in a manner that produces single cells.
  • the cells may be distributed directly into multitudes of vessels such as arrays of single vessels by limiting dilution.
  • the single vessels utilized in the present invention are preferably those applicable in PCR (e.g. PCR tubes and 96 well or 384 well PCR plates or larger arrays of vessels). However, other vessels may also be used.
  • PCR e.g. PCR tubes and 96 well or 384 well PCR plates or larger arrays of vessels.
  • other vessels may also be used.
  • a population of single cells is obtained. Such a distribution may be performed, for example, by dispensing a volume into a single vessel that on average encompasses a cell concentration of one, 0.5 or 0.3 cell, thereby obtaining vessels that on average contain a single cell or less.
  • variable region encoding sequences may occur among the cells present in the vessel.
  • it is a minor event it will not affect the overall utility of the present invention.
  • combinations of variable region encoding sequences which do not posses the desired binding affinity and specificity will most likely not be selected and hence will be eliminated during a screening process. Therefore, minor events of scrambling will not significantly affect the final library of the present invention.
  • the enrichment, sorting and isolation procedures described in the above are performed such that the majority of the cells are kept intact. Rupture of cells during enrichment and sorting might result in scrambling of the variable region encoding sequences. However, this is not expected to be a problem since the frequency of rupture is expected to be low. Washing and possible RNAse treatment of the cells prior to distribution into single vessels will remove any RNA that has leaked during the process.
  • every vessel must contain a single cell. Rather, it indicates that a majority of the vessels contain single cells, e.g. the number of vessels with two or more cells is below 25% of the total amount of cells distributed, or even better it is below 10%.
  • a further feature of the present invention is the performance of a reverse transcription using template derived from cells distributed individually among a plurality of vessels.
  • the nucleic acids within a single cell that is to serve as template source for the RT are considered to be derived from a single cell although they have not necessarily been separated from the remaining contents of that single cell.
  • the single cells may be expanded in order to obtain a population of isogenic cells prior to reverse transcription.
  • This process yields more mRNA to be used as template, which might be important if a rare target is to be amplified and linked.
  • the cells should remain genetically identical with respect to the target gene during the expansion.
  • the isolated cells or the population of isogenic cells can either be kept intact or lysed, as long as the template for the reverse transcription is not degraded. Preferentially, the cells are lysed in order to ease the following reverse transcription and PCR amplification.
  • the disclosed multiplex overlap-extension RT-PCR method or multiplex RT-PCR followed by linkage by ligation or recombination may also be utilized on template derived from a genetically diverse population of cells which have not been separated into single vessels, but all remain together as a pool of cells.
  • This method may be used for the generation of combinatorial libraries. Such an approach will not require the distribution of single cells.
  • the cells which may be used in this approach are the same as those described for the single cell approach, for example a population (pool) of sorted B lymphocytes or T lymphocytes.
  • RNA or mRNA may be isolated from the lysate.
  • the sensitivity of the single-step multiplex overlap-extension RT-PCR of the present invention enables the use of a very low amount of template, e.g. an amount of template corresponding to the lysate of a single cell.
  • the primer mixtures of the present invention comprise at least four primers that form primer sets two by two, which are capable of amplifying at least two different target sequences of interest. Mixtures of two or more of such primer sets constitute a multiplex primer mix.
  • a multiplex mix comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20, 30, 40, 50, 60, 70, 80, 90 100, 110, 120, 130, 140 or 150 primer sets (primer pairs).
  • an individual primer set within the multiplex primer mix may constitute several more than two primers.
  • an individual primer set comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280 or 300 primers.
  • the total number of primers in a multiplex primer mix is at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200 and at the most 225, 250, 275, 300, 325, 350, 375 or 400 primers.
  • All the primers of the present invention comprise a gene-specific region, and preferably all primers are additionally equipped with a primer tail at the 5' end of the primer, i.e. 5' noncoding sequences which are fused to the 3' end of the gene-specific primer part.
  • a primer tail is approximately from 6 to 50 nucleotides long, but it may also be longer if desired.
  • the primer tails are added to the target sequences.
  • Primer tails of the present invention are for example, cloning tails and linkage tails such as, tails adapted for linkage by ligation, tails adapted for linkage by recombination or overlap-extension tails.
  • Cloning tails may be from 6 to 20 nucleotides long or longer and comprise restriction sites and/or recombination sites, which are useful for the insertion of the linked product into an appropriate vector.
  • the primer sets of the multiplex primer mix are designed such that one part (forward or reverse primer(s)) of the first primer set is equipped with a linkage-tail containing a restriction site that upon cleavage will be compatible with a restriction site located in the linkage tail of one part of the second primer set.
  • the second part of the second primer set is equipped with a restriction site that upon cleavage will be compatible with a restriction site located in one part of the third primer set.
  • This second restriction site located in the second primer set should be non-compatible with that of the first primer set. A considerable number of target sequences can be linked by designing primer sets this way.
  • Restriction sites with a low frequency or no occurrence in the target sequences should be chosen. Further, it is preferable that compatible restriction sites are not identical, such that the site of ligation becomes cleavage-resistant for the particular restriction enzymes used. This will drive the reaction towards linkage of target sequence one with target sequence two, since linkage between identical target sequences will be cleavable by the restriction enzymes. Suitable pairs of restriction sites are, for example, Spel with Xbal (alternatively Nhel or Avrll can substitute one or both of these), Ncol with BspHI, EcoRI with Mfel or Pstl with Nsil.
  • Spel can for example be located in target sequence one
  • Xbal can be located in target sequence two
  • Ncol can be located at the other end of target sequence two
  • BspHI in target sequence three and so forth.
  • the primer sets of the multiplex primer mix can for example be designed as exemplified in the article by Chapal (1997 BioTechniques 23, 518-524 ).
  • tails adapted for overlap-extension PCR are added to at least one primer of each primer set of the multiplex primer mix, thereby generating a multiplex overlap-extension primer mixture.
  • the overlap-extension tails are typically longer, ranging from 8 to 75 nucleotides in length, and may contain restriction sites or recombination sites which allow for subsequent insertion of regulatory elements such as promoters, ribosomal binding sites, termination sequences, or linker sequences such as in a scFv.
  • the overlap-extension tail may also contain a stop codon if that is desired.
  • the complementary nucleotides represent between 60 to 85% of the overlap-extension tail.
  • type II overlap-extension tails 4 to 6 of the 5' nucleotides are complementary to the gene-specific region of the adjacent target sequence.
  • type III overlap-extension tails the entire overlap is complementary to the adjacent target sequence.
  • the type I and II overlap-extension tails are preferred when regulatory elements and the like are later to be inserted between the linked target sequences.
  • Type II overlap-extension tails are preferred if the target sequences are to be linked by a defined linker as seen with scFv.
  • Type III overlap-extension tails are preferred if the target sequences are to be linked in-frame to each other.
  • overlap-extension tails are dependent on sequence features such as length, relative GC content (GC%), presence of restriction sites, palindromes, melting temperature, the gene-specific part to which they are coupled etc.
  • the length of the overlap-extension tails should be between 8 and 75 nucleotides long, preferably they are from 15 to 40 nucleotides long. Even more preferred they are from 22 to 28 nucleotides long.
  • the use of very long overlap-extension tails (50 to 75 nucleotides) could favor the linkage of the products produced by each primer set. However, the proportion between the length of the overlap-extension tail and the gene-specific region probably will need to be adjusted when using very long overlap-extension tails.
  • the GC% preference is dependent on the length of the overlap-extension tail. Since shorter tails have a shorter area where they are complementary they need a higher GC% to strengthen the interaction than longer tails. Other principles of primer design should likewise be observed, e.g. primer dimerization and hairpin formation should be minimized. Neither shall they engage in false priming. Further, it is known that Taq DNA polymerase often adds an adenosine (A) at the 3' end of the newly synthesized DNA strand, and this can be accommodated for in overlap-extension tail design by enabling overlap-extension tails to accommodate 3' non-template A addition.
  • A adenosine
  • primers that carry the linkage tail e.g. the overlap-extension tail, tail adapted for linkage by ligation or tail adapted for linkage by recombination, defines the order and direction of linkage of the target sequences. It is not essential to the invention whether it is the forward primer(s) or reverse primer(s) of a primer set or possibly both forward and reverse primers that are equipped with the linkage tail. However, some consideration should be given to this anyway since the order and direction of the target sequences in the final product might be of relevance e.g. for the insertion of regulatory elements such as promoters and termination sequences or for the in-frame linkage of the individual target sequences.
  • the linkage tail may be added either to the reverse primer(s) or forward primer(s) of each primer set used for the PCR amplification of each target sequence.
  • the present invention exemplifies addition of overlap-extension tails and tails adapted for linkage by ligation, to the mVH and mVK forward primers of each set. This results in a linking direction of the products that is 5' to 5' (head-to-head and bi-directional).
  • linkage tails might as well be added to the reverse primer(s) of each set. This results in a linking direction of the product that is 3' to 3' (tail-to-tail and bi-directional).
  • a third option is adding the linkage tails to the reverse primer(s) of the first primer set and the forward primer(s) of the second primer set or visa versa. This results in a 3' to 5' orientation (head-to-tail and unidirectional).
  • some of the primer sets need to have linkage tails on both the forward and reverse primers, such that one tail is complementary to a tail of the preceding primer set and the other tail is complementary to one of the primers of the subsequent primer set. This principle holds for all the primer sets that amplify target sequences that are to be linked between two other target sequences.
  • the design of the gene-specific primer part generally should observe known primer design rules such as minimizing primer dimerization, hairpin formation and non-specific annealing. Further, multiple G or C nucleotides as the 3' bases are to be avoided when possible.
  • the melting temperature (Tm) of the gene-specific regions in a primer set should preferably be equal to each other plus/minus 5°C. In the present invention Tm values between 45°C and 75°C are desirable and Tm values of about 60°C are optimal for most applications.
  • the initial primer design can be aided by computer programs developed for this task. However, primer designs generally need laboratory testing and routine optimization.
  • degenerate positions within primers is a useful approach when amplifying sequences with variable regions or when searching for new family members belonging to a specified class of proteins.
  • the numbers of degenerate positions may also require optimization.
  • primer mixes composed of at least two primer sets that are able to prime amplification and promote linkage of at least two nucleotide sequences of interest.
  • the primer mixes of the present invention are capable of priming the amplification of at least two subunits or domains from heteromeric proteins, e.g. belonging to the class of enzymes, inhibitors, structural proteins, toxins, channel proteins, G-proteins, receptor proteins, immunoglobulin superfamily proteins, transportation proteins etc, preferably immunoglobulins.
  • a further feature of the present invention is a multiplex overlap-extension primer mix comprising primer sets wherein at least one primer set member of each primer set comprises an overlap-extension tail capable of hybridizing to the overlap-extension tail of a primer set member of a second primer set.
  • the overlap-extension tails enables the immediate linkage of the nucleotides of interest during the multiplex overlap-extension PCR amplification by equipping each individual product arising from the primer sets with a tail that is complementary to an adjoining product. This however does not mean that the linkage necessarily occurs during this first PCR amplification. Depending on the reaction setup, the majority of the actual linkage may be performed during an additional amplification with the outer primers of the first PCR amplification (multiplex PCR amplification).
  • a further feature of the present invention is a primer set designed to amplify a family of nucleotide sequences containing variable region encoding sequences.
  • families are kappa light chains (e.g. VK1-19 in mice), lambda light chains (e.g. VL1-8 in mice) and variable heavy chains (e.g. VH1-15 in mice) from immunoglobulins, and ⁇ , ⁇ , ⁇ or ⁇ TcR variable regions.
  • a primer set for the amplification of a family of nucleotide sequences containing variable region encoding sequences often comprises a plurality of primers where several primers can be degenerate primers.
  • Amplification of families of immunoglobulin light chain variable region encoding sequences is for example performed using a primer set comprised of a plurality of primers complementary to the 5' end of the variable region of the kappa chain (mVK primer(s)) or the kappa leader sequence and/or the lambda chain or the lambda leader sequence (forward primers) together with constant region kappa (mKappar1 primer(s)) and/or lambda primers (reverse primers) or a plurality of such primers.
  • light chain joining region primers may be used as reverse primers instead of the constant region primers.
  • forward primers annealing in the UTR region preceding the leader sequence of the variable light chain may be used.
  • families of immunoglobulin heavy chain variable region encoding sequences can be amplified with one primer set utilizing various primer combinations, for example a plurality of primers complementary to the 5' end of the heavy chain variable region (mVH primer(s)) or the leader sequence of this region (forward primers) together with a plurality of heavy chain joining region primers or heavy chain constant region primer(s) (reverse primers).
  • the mCH primer may be isotype-specific and in principle any mCH primer can be utilized, also one that would result in a full-length heavy chain.
  • a mCH primer is used that does not amplify the full length heavy chain to enable addition of a human heavy chain constant region.
  • forward primers annealing in the UTR region preceding the leader sequence of the variable heavy chain may be used.
  • One feature of the present invention is primers which anneal in the 3' end of the leader encoding sequence preceding a variable region encoding sequence, and their use for amplification of variable region encoding sequences.
  • the multiplex overlap-extension primer mix utilized for the multiplex overlap-extension PCR and possibly for the reverse transcription step as well comprises:
  • the light chain primers are adapted for amplifying both kappa and lambda light chain variable region encoding sequences.
  • the immunoglobulin mVK primers carry linkage tails, preferably in the form of complementary overlap-extension tails.
  • This generates variable region encoding sequences that are linked in a head-to-head fashion.
  • either the mKappar1 and mVH primers contain linkage tails or the mVK and mCHrev1 primers contain linkage tails, preferably in the form of complementary overlap-extension tails.
  • mCH and mKappar1 primers contain linkage tails, preferably in the form of complementary overlap-extension tails.
  • the multiplex primer mixes, including multiplex overlap-extension primer mixes, of the present invention comprise two primer sets.
  • a multiplex primer mix comprises at least four different primers.
  • a multiplex primer mix comprises more than four different primers.
  • a multiplex primer mix of the present invention is used for the amplification of target sequences in a single vessel. For example kappa, lambda and heavy chain variable regions may be amplified in the same vessel.
  • the present invention also encompasses primers for an additional PCR amplification of the linked products obtained by multiplex RT-PCR followed by linkage by ligation or recombination or by multiplex overlap-extension RT-PCR.
  • This additional PCR amplification can be performed using a primer mix adapted for amplifying the linked target sequences.
  • a primer mix may comprise the outer primers of the multiplex primer mix or multiplex overlap-extension primer mix, meaning the primers that anneal to the outermost 5 end and 3' end of the sense strand of the linked nucleotide sequences, thereby selectively enabling the amplification of the entire linked product.
  • This process generally serves to increase the amount of linked product obtained from the multiplex RT-PCR followed by linkage by ligation or recombination or from the multiplex overlap-extension RT-PCR.
  • a primer set which is nested compared to the outer primers used in the primary multiplex RT-PCR or multiplex overlap-extension RT-PCR reaction can be used for the additional amplification of the linked nucleotide sequences.
  • a primer set is termed a nested primer set.
  • the design of nested primers generally observes the same design rules as for the gene-specific primers previously described, except that they prime partly or totally 3' to the annealing position of the outer primers used in the multiplex RT-PCR or multiplex overlap-extension RT-PCR.
  • the product resulting from a nested PCR may therefore be shorter than the linked product obtained by the multiplex RT-PCR followed by linkage by ligation or recombination or by multiplex overlap-extension RT-PCR.
  • the nested PCR further serves to increase the overall specificity, especially of the multiplex overlap-extension RT-PCR technology.
  • the outer primers of the multiplex primer mix/multiplex overlap-extension primer mix can be used for the additional amplification or a semi-nested PCR can be applied as described later.
  • a mixture of J L and J H primers is used as nested primers for the additional amplification of the linked immunoglobulin variable region encoding sequences.
  • Nested primer sets of the present invention can also be comprised of a reverse (or forward) outer primer(s) from the first multiplex primer mix/multiplex overlap-extension primer mix and a second nested primer that prime 3' to the annealing position of the forward (or reverse) outer primer(s) of the first multiplex primer mix/multiplex overlap-extension primer mix.
  • the use of such a primer set for an additional PCR amplification is generally known as a semi-nested PCR.
  • Semi-nested PCR can for example be applied if it is difficult to design a nested primer in one specific region e.g. for the variable region sequences, because such a primer would have to anneal in the complementarity determining regions (CDRs). Further, semi-nested PCR can be used when it is desirable to keep one end of the linked sequences intact e.g. for cloning purposes.
  • the parameters of the multiplex overlap-extension PCR step of both the two-step and the single-step procedure can be optimized for several parameters (see, for example, Henegariu, O. et al. 1997. BioTechniques 23, 504-511 ; Markoulatos, P. et al. 2002. J. Clin. Lab. Anal. 16, 47-51 ). Generally the same optimization parameters apply for multiplex RT-PCR, although the ratio between outer and inner primers is less important for such a reaction.
  • the concentration of the primers carrying the overlap-extension tail is preferably lower than the concentration of the outer primers without overlap-extension tail (for example J H and kappa primers).
  • one of the target sequences amplifies with a lower efficiency than the others, for example as a result of a higher GC%, it may be possible to equalize the amplification efficacy. This may be done by using a higher concentration of the primer set which mediates amplification with low efficiency, or lowering the concentration of the other primers. For example, sequences encoding heavy chain variable regions tend to have a higher GC% and hence lower amplification efficiency than light chain variable regions. This points towards using V L primers at a lower concentration than the V H primers.
  • the total primer concentration might be an issue.
  • the upper limit is determined experimentally by titration experiments. For the "AmpliTaq Gold' PCR system from Applied Biosystems the upper limit was found to be 1.1 ⁇ M total oligonucleotide concentration, for other systems it may however be as high as 2.4 ⁇ M. Such an upper limit of total oligonucleotide concentration influences the maximal concentration of individual primers. If the individual primer concentration is too low it is likely to cause a poor PCR sensitivity.
  • oligonucleotide primers The quality of the oligonucleotide primers has also been found to be important for the multiplex overlap-extension PCR. HPLC-purified oligonucleotides have produced the best results.
  • the cycling conditions are as follows: Denaturation: 10-30 s 94°C Annealing: Tm 30-60 s 50-70°C Approximately 5°C below of primers. Extension: 1 min x EPL 65-72°C EPL is Expected Product Length in kb. Cycle number: 30-80 Final extension: 10 min 65-72°C
  • annealing temperature is important.
  • all the individual primer sets that are to constitute the final primer mix should be tested separately in order to identify optimal annealing temperature and time, as well as elongation and denaturing times. This will give a good idea about the window within which these parameters can be optimized for the multiplex overlap-extension primer mix.
  • a high number of thermal cycles constitutes between 35 and 80 cycles, preferably around 40 cycles.
  • Long extension times can improve the multiplex overlap-extension PCR process. Long extension times constitute 1.5-5 min x EPL compared to the normal 1 min extension.
  • Multiplex PCR reactions can be significantly improved by using a PCR additive, such as DMSO, glycerol, formamide, or betaine, which relax DNA, thus making template denaturation easier.
  • a PCR additive such as DMSO, glycerol, formamide, or betaine, which relax DNA, thus making template denaturation easier.
  • Deoxynucleoside triphosphate (dNTP) quality and concentration is important for the multiplex overlap-extension PCR.
  • the best dNTP concentration is between 200 and 400 ⁇ M of each dNTP (dATP, dCTP, dGTP and dTTP), above which the amplification is rapidly inhibited.
  • Lower dNTP concentrations (100 ⁇ M of each dNTP) suffice to achieve PCR amplification.
  • dNTP stocks are sensitive to thawing/freezing cycles. After three to five such cycles, multiplex PCR often does not work well. To avoid such problems, small aliquots of dNTP can be made and kept frozen at -20°C.
  • Mg 2+ concentration is critical since most DNA polymerases are magnesium-dependent enzymes. In addition to the DNA polymerase, the template DNA primers and dNTP's bind Mg 2+ . Therefore, the optimal Mg 2+ concentration will depend on the dNTP concentration, template DNA, and sample buffer composition. If primers and/or template DNA buffers contain chelators such as EDTA or EGTA, the apparent Mg 2+ optimum may be altered. Excessive Mg 2+ concentration stabilizes the DNA double strand and prevents complete denaturation of DNA, which reduces yield. Excessive Mg 2+ can also stabilize spurious annealing of primer to incorrect template sites, thereby decreasing specificity. On the other hand, an inadequate Mg 2+ concentration reduces the amount of product.
  • a good balance between dNTP and MgCl 2 is approximately 200 to 400 ⁇ M dNTP (of each) to 1.5 to 3 mM MgCl 2 .
  • KCI based buffers suffice for multiplex overlap-extension PCR; however, buffers based on other components such as (NH 4 ) 2 SO 4 , MgSO 4 , Tris-HCl, or combinations thereof may also be optimized to function with the multiplex overlap-extension PCR.
  • Primer pairs involved in the amplification of longer products work better at lower salt concentrations (e.g. 20 to 50 mM KCI), whereas primer pairs involved in the amplification of short products work better at higher salt concentrations (e.g. 80 to 100 mM KCl). Raising the buffer concentration to 2X instead of 1X may improve the efficiency of the multiplex reaction.
  • the present invention is exemplified with Taq polymerase.
  • other types of heat-resistant DNA polymerases including, for example, Pfu, Phusion, Pwo, Tgo, Tth, Vent, or Deep-vent may be used.
  • Polymerases without or with 3' to 5' exonuclease activity may either be used alone or in combination with each other.
  • nucleotide sequences of interest produces a nucleotide segment comprising the linked nucleotide sequences coding for variable regions of immunoglobulins.
  • libraries of such linked nucleic acid sequences are produced by the methods of the present invention, in particular libraries of non-human variable region encoding sequences linked or spliced to human constant region (heavy and light chain) sequences.
  • One feature of the present invention is the insertion of a segment containing linked nucleotide sequences of interest or a library of linked nucleotide sequences of interest, generated by a method of the present invention, into suitable vectors.
  • the libraries may be combinatorial libraries or more preferably libraries of cognate pairs of variable region encoding sequences.
  • the restriction sites generated by the outer primers, nested primers or semi-nested primers are preferably designed to match appropriate restriction sites of the vector of choice.
  • the linked nucleic acid sequences of interest can also be inserted into vectors by recombination, if one of the semi-nested, nested primers or outer primers is equipped with a suitable recombination site and the vector of choice contains one as well.
  • Vectors that can be used as carriers of the products generated by one of the multiplex RT-PCR-linkage methods of the present invention.
  • Vectors of choice may be those suitable for amplification and expression in cells including, for example, bacteria, yeast, other fungi, insect cells, plant cells, or mammalian cells. Such vectors may be used to facilitate further cloning steps, shuttling between vector systems, display of the product inserted into the vector, expression of the inserted product and/or integration into the genome of a host cell.
  • Cloning and shuttle vectors are preferably bacterial vectors. However, the other types of vectors may also be applied in cloning and shuttle procedures.
  • Display vectors can for example be phage vectors or phagemid vectors originating from the class of fd, M13, or f1 filamentous bacteriophages. Such vectors are capable of facilitating the display of a protein including, for example, a binding protein or a fragment thereof, on the surface of a filamentous bacteriophage.
  • Display vectors suitable for display on ribosomes, DNA, yeast cells or mammalian cells are also known in the art. These comprise for example viral vectors or vectors encoding chimeric proteins.
  • Expression vectors exist for all the mentioned species and the one to be chosen completely depends on the protein to be expressed. Some expression vectors are additionally capable of integrating into the genome of a host cell either by random integration or by site-specific integration, utilizing appropriate recombination sites. Expression vectors may be designed to provide additional encoding sequences that, when the linked product is inserted in-frame to these sequences, enable the expression of a larger protein, e.g. a full-length monoclonal antibody, when introduced into an appropriate host cell. This in-frame insertion may also facilitate the expression of chimeric proteins that facilitate the display on the surface of a filamentous bacteriophage or cell.
  • the linked nucleotide sequences of interest may be inserted in-frame to a sequence encoding a coat protein such as pIII or pvIII ( Barbas, C.F. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 7978-7982 ; Kang, A.S. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 4363-4366 ).
  • pIII or pvIII Barbas, C.F. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 7978-7982 ; Kang, A.S. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 4363-4366 ).
  • the individual segments of linked nucleotide sequences of interest is comprised of an immunoglobulin heavy chain variable region encoding sequence associated with a light chain variable region encoding sequence from one species, inserted into a vector that contains sequence(s) encoding one or more human immunoglobulin constant domains, preferably both human light and heavy chain constant regions.
  • the insertion is engineered such that the linked heavy chain variable region and/or light chain variable region encoding sequences are inserted in-frame to the constant region encoding sequences.
  • Such an insertion can for example generate a Fab or F(ab') 2 expression vector, a full-length antibody expression vector or an expression vector encoding a fragment of a full-length antibody.
  • such a vector is an expression vector suitable for expression (e.g. E. coli, phagemid, or mammalian vectors) and the constant region heavy chain encoding sequences are chosen from the human immunoglobulin classes IgGI, IgG2, IgG3, IgG4, IgM, IgA1, IgA2, IgD, or IgE, thereby enabling the expression of a Fab or full-length recombinant antibody.
  • the vector may also contain a constant light chain encoding sequence chosen from human lambda or kappa chains. This is preferred in the generation of chimeric antibodies as the linked nucleotide sequences in these cases only encode the immunoglobulin variable region encoding sequences (Fv's) from the non-human species.
  • the human constant region(s) is/are spliced or linked to the non-human variable regions in a step of the molecular amplification procedure, by adding to the vessels a human constant region providing an overlap with the non-human sequence and appropriate primers ensuring the amplification of both variable and constant region(s) in frame.
  • the human constant kappa or lambda chain may be added and/or a human constant heavy chain may be added.
  • the individual segments of the linked nucleotide sequences are comprised of a TcR ⁇ chain variable region encoding sequence associated with a ⁇ chain variable region encoding sequence or a ⁇ chain variable region encoding sequence associated with a ⁇ chain variable region encoding sequence.
  • these linked sequences are inserted into a vector that contains sequences encoding one or more TcR constant domains. The insertion is engineered such that the inserted linked variable region encoding sequences are in-frame to the corresponding TcR constant region encoding sequences.
  • such a vector is a chimeric expression vector comprising sequences that encode a leucine zipper in-frame to the TcR constant regions. It has been shown that such constructs increase stability of soluble TcRs ( Willcox, B.E. et al. 1999. Protein Sci 8, 2418-2423 ).
  • Libraries of cognate pairs of the present invention may be introduced into vectors by two different approaches.
  • the single cognate pairs are inserted individually into a suitable vector. This library of vectors may then either be kept separate or be pooled.
  • the second approach all the cognate pairs are pooled prior to vector insertion, followed by in-mass insertion into suitable vectors generating a pooled library of vectors.
  • Such a library of vectors comprises a large diversity of pairs of variable region encoding sequences.
  • One aspect of the present invention is a library of antibodies with cognate pairs of linked variable region encoding sequences.
  • the individual antibodies of the library comprise an immunoglobulin light chain variable region encoding sequence associated with a heavy chain variable region encoding sequence from one species and human constant regions.
  • Another preferred library of cognate pairs comprises linked TcR region encoding sequences, where each individual TcR region encoding sequence comprises an alpha chain variable region encoding sequence associated with a beta chain variable region encoding sequence and/or a TcR gamma chain variable region encoding sequence associated with a delta chain variable region encoding sequence.
  • An embodiment of the present invention is a sub-library of cognate pairs of linked variable region encoding sequences which encode desired binding specificities directed against a particular target.
  • these cognate pairs comprise linked immunoglobulin light chain variable region and heavy chain variable region encoding sequences, TcR alpha chain variable region and beta chain variable region encoding sequences and/or TcR gamma chain variable region and delta chain variable region encoding sequences.
  • a further embodiment is a sub-library selected from a parent library of cognate pairs of variable region encoding sequences as described throughout the invention.
  • a preferred embodiment of the present invention is a library or sub-library encoding cognate pairs of full-length chimeric immunoglobulins selected from human immunoglobulin classes IgA1, IgA2, IgD, IgE, IgG1, IgG2, IgG3, IgG4, or IgM.
  • Another preferred feature of the present invention is a library or sub-library encoding soluble and stable cognate pairs of TcRs.
  • a feature of the present invention is the diversity of said libraries, which are comprised of at least 5, 10, 20, 50, 100, 1000, 10 4 , 10 5 or 10 6 different cognate pair antibodies.
  • said libraries of cognate pairs of linked variable region encoding sequences are obtainable by a method comprising the steps described herein.
  • This library is also termed the parent library.
  • the parent library of pairs of linked variable region encoding sequences isolated from a donor, utilizing one of the methods of the present invention, is expected to represent a diversity of binding proteins of which some will be irrelevant, i.e. not binding to a desired target, in particular for combinatorial libraries. Therefore, the present invention encompasses enrichment and screening for a sub-library encoding a subset of diversities of binding specificities directed against a particular target.
  • the diversity of the library is expected to represent the diversity present in the donor material, with only a minor number of randomly linked variable regions. Thus, an enrichment step may not be necessary prior to the screening for target-specific binding affinities in a library composed of cognate pairs.
  • the method of generating a library of pairs of linked variable region encoding sequences further comprises creating a sub-library by selecting a subset of pairs of linked variable region sequences that encode binding proteins with a desired target specificity. Such a selection of linked variable region encoding sequences is also termed a library of target-specific cognate pairs.
  • the library of target-specific cognate pairs of variable region encoding sequences is transferred to an expression vector.
  • the expression vector may be a mammalian expression vector, a yeast expression vector, a fungus expression vector, a plant expression vector, or a bacterial expression vector depending on the type of cell used for screening.
  • the expression vector is mammalian.
  • Immunological assays are generally suitable for the selection of target-specific immunoglobulin variable region encoding sequences.
  • Such assays are well know in the art and constitute for example FMAT, FLISA, ELISPOT, ELISA, membrane assays (e.g. Western blots), arrays on filters, or FACS.
  • the assays can either be performed in a direct manner, utilizing the polypeptides produced from the immunoglobulin variable region encoding sequences, or alternatively, the immunoassays can be performed in combination with or following enrichment methods such as phage display, ribosome display, bacterial surface display, yeast display, eukaryotic virus display, RNA display or covalent display (reviewed in FitzGerald, K., 2000.
  • a special feature of the invention is that it enables the use of various types of functional screening to select antibody secreting clones with desired properties.
  • Such screening assays include but are not limited to proliferation assays, virus inactivation assays, cell killing assays, etc.
  • the functional assays can be carried out in high-throughput format using supernatants from cells transfected with expression vectors of the invention.
  • the selection of a sub-library of target-specific cognate pairs or combinatorial pairs of variable region encoding sequences is performed by using a high-throughput screening assay.
  • High-throughput screening assays could be, but are not restricted to, ELISA assays performed with semi-automated or fully automated equipment. It could also be a membrane assay in which bacteria are robotically picked and gridded onto an appropriate membrane on top of agar plates generating arrays of colonies expressing antigen-binding molecules. The molecules are secreted through the membrane onto a second underlying antigen-coated membrane which can be developed separately and used to identify clones that secrete antigen binding molecules towards the desired target ( de Wildt, R.M., et al. 2000. Nat. Biotechnol. 18, 989-994 ).
  • DNA sequencing will provide information about the library diversity such as germline origin, family distribution and maturation within the CDR regions. Such an analysis will enable the selection of clones which represent a broad diversity, and leaving out repeated clones. Secondly, DNA sequencing will reveal mutations introduced during the isolation process.
  • variable region encoding sequences there are three types of mutations to consider when assessing whether a mutation is acceptable: i) The most frequent type of mutations result from cross-priming, where a V gene primer due to sequence similarities primes a germline sequence to which it is not totally homologous.
  • the changes introduced are mainly substitutions of naturally occurring codons at one particular position. Due to the high degree of sequence homology between V-gene sequences. Some of these changes may be significant, sometimes with no natural counterpart. Such changes could potentially affect the immunogenicity of the variable region by creating new epitopes.
  • the sub-library of target-specific and possibly sequence analysed pairs of linked immunoglobulin light chain variable region and heavy chain variable region encoding sequences are transferred to a mammalian expression vector.
  • a mammalian expression vector Such a transfer can be performed into any of the vectors described in the previous section, enabling the expression of a full-length recombinant antibody. If the screening is performed with a mammalian cognate full-length antibody expression library such a transfer may not be needed.
  • the parent library is generated from a lymphocyte-containing cell fraction which is enriched for T lymphocytes.
  • the pairs of linked variable region encoding sequences constituting the parent library may be selected for encoding a subset of pairs of linked variable region sequences composed of alpha and beta and/or gamma and delta chains that encode binding proteins with a desired target specificity, generating a sub-library of cognate pairs or combinatorial pairs.
  • Antigen-specific T cell receptors can subsequently be identified from a pool of transfected cells using standard methodology such as staining with tetrameric MHC-peptide complexes (e.g., Callan, M.F. et al. 1998. J. Exp. Med.
  • the libraries of the present invention can be transferred to vectors suitable for expression and production of proteins encoded from the linked nucleic acid sequences of interest, in particular variable region containing binding proteins or fragments thereof.
  • vectors suitable for expression and production of proteins encoded from the linked nucleic acid sequences of interest in particular variable region containing binding proteins or fragments thereof.
  • Such vectors are described in the Vectors and Libraries section, and provide for the expression of for example full-length antibodies, Fab fragments, Fv fragments, scFv, membrane bound or soluble TcRs or TcR fragments of a species of choice.
  • One feature of the present invention is the introduction into a host cell of a library or a sub-library of vectors of cognate pairs of linked variable region encoding sequences or a single clone encoding a cognate pair of linked variable region encoding sequences, for amplification and/or expression.
  • Host cells can be chosen from bacteria, yeast, other fungi, insect cells, plant cells, or mammalian cells.
  • mammalian cells such as Chinese hamster ovary (CHO) cells, COS cells, BHK cells, myeloma cells (e.g., Sp2/0 cells, NS0), NIH 3T3, fibroblast or immortalized human cells such as HeLa cells, HEK 293 cells, or PER.C6 are preferred.
  • vectors into host cells may be accomplished by a number of transformation or transfection methods known to those skilled in the art, including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like.
  • transformation or transfection methods including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like.
  • monoclonal full-length antibodies, Fab fragments, Fv fragments and scFv fragments is well known.
  • a recombinant polyclonal manufacturing technology has been described in PCT application WO 2004/061104 .
  • this technology involves the generation of a collection of cells, suitable as a manufacturing cell line.
  • the following description of the technique is made for a library of cognate pairs, it is however just as applicable for a combinatorial library.
  • the individual cells in the collection of cells are capable of expressing a distinct member of the recombinant polyclonal binding protein, for example from a library of cognate pairs.
  • the nucleic acid sequences encoding the cognate pairs are introduced into a single site-specific site in the genome of each individual cell. This is an important feature of the collection of cells, since this prevents scrambling of the heavy and light chains expressed from each cell, but also because it generates cells that are virtually identical to one another, except for the small differences in the variable regions of the individual cognate pairs. This trait will enable an unbiased growth of the collection of cells over the period of time necessary for the production.
  • a host cell line with only one integration site should be used, these are commercially available, e.g.
  • CHO Flp-In cells containing a single FRT site containing a single FRT site.
  • Appropriate vectors for this cell line contain a corresponding FRT site and are introduced into the genome using the Flp recombinase.
  • Flp recombinase There are several other known recombinases e.g. Cre, beta-recombinase, Gin, Pin, PinB, PinD, R/RS, lambda integrase, or phage ⁇ C31 integrase that can be used in combination with their corresponding recombination sites.
  • appropriate vectors contain a selection marker that enables the selection of site-specific integrants.
  • the generation of a polyclonal manufacturing cell line and the production of a recombinant polyclonal protein from such a cell line can be obtained by several different transfection and manufacturing strategies.
  • One way is to use a library of vectors mixed together into a single composition, for the transfection of a host cell line with a single integration site per cell. This method is termed bulk transfection or transfection in bulk.
  • This method is termed bulk transfection or transfection in bulk.
  • the vector and host cell design previously described will ensure that a polyclonal cell line capable of unbiased growth will be obtained upon appropriate selection.
  • a frozen stock of the polyclonal cell line will be generated before initiation of the recombinant polyclonal protein manufacturing.
  • Another way is to use a library of vectors split into fractions, containing approximately 5 to 50 individual vectors of the library in a composition, for transfection.
  • a fraction of the library constitutes 10 to 20 individual vectors.
  • Each composition is then transfected into an aliquot of host cells. This method is termed semi-bulk transfection.
  • the number of aliquots transfected will depend on the size of the library and the number of individual vectors in each fraction. If the library for example constitutes 100 distinct cognate pairs, which are split into fractions containing 20 distinct members in a composition, 5 aliquots of host cells would need to be transfected with a library composition constituting a distinct fraction of the original library. The aliquots of host cells are selected for site-specific integration.
  • the distinct aliquots are selected separately. However, they can also be pooled before selection.
  • the aliquots can be analyzed for their clonal diversity and only those with sufficient diversity will be used to generate a polyclonal cognate pair library stock.
  • the aliquots can be mixed before generating the freezing stock, immediately after they have been retrieved from the stock or after a short proliferation and adaptation time.
  • the aliquots of cells are kept separate throughout production, and the polyclonal protein composition is assembled by combining the products of each aliquot rather than the aliquots of cells before production.
  • a third way is a high throughput method in which host cells are transfected separately using the individual vectors constituting the library of cognate pairs.
  • This method is termed individual transfection.
  • the individually transfected host cells are preferably selected for site specific integration separately.
  • the individual cell clones generated upon selection may be analyzed with respect to proliferation time and preferably, those with similar growth rates are used to generate a polyclonal cognate pair library stock.
  • the individual cell clones can be mixed to obtain the desired polyclonal cell line before generating the stock, immediately after they have been retrieved from the stock, or after a short proliferation and adaptation time. This approach may eliminate any possible residual sequence bias during transfection, integration and selection.
  • the individually transfected host cells are mixed before selection is performed, this will enable control of sequence bias due to transfection.
  • a shared feature in the manufacturing strategies outlined in the above is that all the individual cognate pairs constituting the recombinant polyclonal protein can be produced in one, or a limited number of bioreactors. The only difference is the stage at which one chooses to generate the collection of cells that constitutes the polyclonal manufacturing cell line.
  • One embodiment of the present invention is a population of host cells comprising a cognate library or sub-library of linked pairs of variable region encoding sequences.
  • a population of host cells comprises a library obtained from a population of isolated single cells constituting lymphocytes, utilizing the multiplex RT-PCR amplification followed by linkage by ligation or recombination or the multiplex overlap-extension RT-PCR technology of the present invention, to link the cognate pairs.
  • Another embodiment of the present invention is a population of host cells comprising a combinatorial library or sub-library of linked pairs of variable region encoding sequences.
  • a population of host cells according to the present invention will encompass a diverse population of cells corresponding to the diversity of the library the cells have been transformed/transfected with.
  • each cell of the population of cells only constitutes one cognate pair of the entire library of cognate pairs, and no individual member of the library of cognate pairs exceeds more than 50%, more preferred 25%, or most preferred 10%, of the total number of individual members expressed from the population of host cells.
  • the population of host cells is mammalian cells.
  • a population of host cells as described in the above can be utilized for the expression of a recombinant polyclonal binding protein, since individual cells of the population constitute variable region encoding sequences of different diversity.
  • One embodiment of the present invention is a recombinant polyclonal protein expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of linked variable region encoding sequences, where such a library is obtainable by the method of the present invention.
  • a recombinant polyclonal protein of the present invention is comprised of at least 2, 5, 10, 20, 50, 100, 1000, 10 4 , 10 5 or 10 6 proteins composed of different cognate pairs.
  • a preferred embodiment of the present invention is a recombinant polyclonal immunoglobulin expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of heavy chain variable region and light chain variable region encoding sequences.
  • Another preferred embodiment of the present invention is a recombinant polyclonal TcR expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of TcR alpha chain variable region linked with beta chain variable region encoding sequences and/or TcR gamma chain variable region linked with delta chain variable region encoding sequences.
  • Another embodiment of the present invention is a host cell suitable for production of a monoclonal protein.
  • a monoclonal antibody comprised of a cognate pair of a light chain variable region with a heavy chain variable region or a monoclonal TcR comprised of a cognate pair of an alpha variable region with a beta variable region or a delta variable region with a gamma variable region.
  • a monoclonal production cell line is not a hybridoma cell line.
  • Such a monoclonal antibody or TcR can be generated by adding the following steps to the method of linking a plurality of non-contiguous nucleotide sequences of interest a) inserting said linked nucleic acid sequences into a vector; b) introducing said vector into a host cell; c) cultivating said host cells under conditions suitable for expression; and d) obtaining the protein product expressed from the vector inserted into said host cell.
  • the vector introduced into the host cell encodes an individual cognate pair of variable region encoding sequences.
  • variable region encoding sequences especially immunoglobulin heavy and light chain variable region encoding sequences or TcR alpha and beta chain or gamma and delta chain variable region encoding sequences, by a high-throughput method for the generation of libraries of cognate pairs.
  • the multiplex RT-PCR followed by linkage by ligation or recombination or the multiplex overlap-extension RT-PCR techniques of the present invention may be utilized in the generation of combinatorial libraries of chimeric human/non-human antibodies by performing the technique on a population of genetically diverse cells, cell lysates from such a population of cells, or on RNA purified from such a population of cells.
  • the libraries, sub-libraries, or single clones from one of these libraries facilitate the expression of polyclonal or monoclonal proteins. Especially monoclonal or polyclonal antibodies may be obtained from the libraries of the present invention.
  • recombinant monoclonal antibodies in diagnostics, treatment, and prophylaxis
  • Recombinant monoclonal and polyclonal antibodies generated by the present invention will have the same applications as antibody products generated by existing technologies.
  • a pharmaceutical composition comprising a polyclonal recombinant immunoglobulin as active ingredient, combined with at least one pharmaceutically acceptable excipient, can be produced by means of the present invention. More preferred are pharmaceutical compositions where the polyclonal recombinant immunoglobulin is comprised of cognate pairs of variable region encoding sequences. Such pharmaceutical compositions of polyclonal recombinant immunoglobulins can be used as medicaments.
  • the polyclonal recombinant immunoglobulin of the composition can be specific for or reactive against a predetermined disease target, and the composition can thus be used for the treatment, amelioration or prevention of diseases such as cancer, infections, inflammatory diseases, allergy, asthma and other respiratory diseases, autoimmune diseases, immunological malfunctions, cardiovascular diseases, diseases in the central nervous system, metabolic and endocrine diseases, transplant rejection, or undesired pregnancy, in a mammal such as a human, a domestic animal, or a pet.
  • diseases such as cancer, infections, inflammatory diseases, allergy, asthma and other respiratory diseases, autoimmune diseases, immunological malfunctions, cardiovascular diseases, diseases in the central nervous system, metabolic and endocrine diseases, transplant rejection, or undesired pregnancy, in a mammal such as a human, a domestic animal, or a pet.
  • mice were immunised subcutaneously with 50 ⁇ g Tetanus toxoid (TT) in complete Freund's adjuvant. Mice were boosted on day 14 with 50 ⁇ g TT in Freund's incomplete adjuvant. After additional 30 days, the mice were boosted with 50 ⁇ g TT in Freund's incomplete adjuvant. Three days after the last boost, the mice were sacrificed and the spleen taken out and transferred to a tube containing 30 ml of RPMI 1640 w/10% FCS at 4 °C. The tissue was transferred to a 74 ⁇ m cell strainer (Corning,136350-3479) in a 10 cm dish.
  • TT Tetanus toxoid
  • the spleens were macerated through the filter.
  • the filter was rinsed 10 ml RPMI 1640, 10% FCS solution.
  • the filter was removed and the dish filled with 20 ml cold RPMI 1640, 10% FCS.
  • Cells were transferred to 50 ml tube and centrifuged at 300 x g at 2-8 °C for 5 minutes. Cells were re-suspended in 5-10 ml 4 °C RPMI 1640 w/1% FCS and filtered through a 50 ⁇ m syringe filter (Becton Dickinson, 340603). Cells were pelleted and re-suspended in FCS; 10% DMSO to obtain a cell density at 2x10 7 cells/ampoule and frozen.
  • Cells were pelleted and resuspended in 1 ml PBS, 2 % FCS (final volume) and subsequently stained with either anti CD43 FITC diluted 1:100 (BD cat no 553270) and anti CD138 PE diluted 1:40 (BD cat no 553714) in 1 ml PBS, 2 % FCS or with anti B220 APC (BD cat no 553092) diluted 1:40 and anti MHCII FITC (BD cat no 553547) diluted 1:200. Cells were incubated at 4 °C for 20 min in the dark.
  • Fig 3A PI positive (dead) cells were excluded in the bottom left panel (P1). Then plasma cells were gated as CD43 high, CD138 high in the bottom right panel (P2). Finally, doublets were excluded in the SSC-H, SSC-W plot, top right panel (P3). Cells positive for all three gates were sorted into ELISPOT plates according to table 1.
  • Nitrocellulose-bottomed 96 well plates (HA plates, Millipore, Bedford, MA) were pre-wet with PBS (also the blocking wells) before being coated with 100 ⁇ l 25 ⁇ g/ml tetanus toxoid (TT) or sheep anti-murine IgG (Jackson Immuno Research, cat no 515-005-062) in PBS. The same volume of PBS was present in the control wells. The plate was left at 4 °C. The day after, wells were washed three times in PBS before being blocked with 200 ⁇ l RPMI + 2 % skim milk powder and left at 4 °C. The plate was moved to the incubator (37 °C, 5 % CO 2 , 100% humidity) 1h before addition of cells.
  • PBS also the blocking wells
  • TT tetanus toxoid
  • sheep anti-murine IgG Jackson Immuno Research, cat no 515-005-062
  • Plasma cells (PC; CD43 high, CD138 high, Gate P3 in Figure 3A ) specific for TT are present at approximately 2 %, whereas TT specific plasma blasts (PB; B220 and MHCII positive, Gate P4 in Figure 3B ) are present at approximately 4 ⁇ This illustrates the superiority of the PC to PB in production of specific antibodies after TT immunization.
  • a frozen vial of splenocytes was stained as described in example 1.
  • P1 PI positive or dead cells were excluded in the bottom left panel (P1).
  • P2 is CD138 intermediate, CD43 high.
  • P3 is CD138 high, CD43 high.
  • P4 is CD138 high, Cd43 neg.
  • P5 is CD138 intermediate, CD43 low. 10,000 cells positive for P1 and each of the four gates were sorted into test tubes and frozen for evaluation by mouse Symplex.
  • Fractions P2, P3, P4 and P5 were bulk sorted into tubes each containing 10,000 cells.
  • the tubes were centrifuged and resuspended in Dulbecco's modified Eagle's medium containing 2 U/ ⁇ l RNase inhibitor (RNasin, Promoga cat. no. N2511) to a concentration of 250 cells/ ⁇ l, 10 ⁇ l per tube and frozen at -80°C.
  • RNase inhibitor Rosin, Promoga cat. no. N2511
  • a mouse Symplex amplification was performed on a dilution series of each of the 4 sets of sorted lymphocytes to compare the content of IgG-kappa antibody mRNA.
  • the basic principles of the reactions are:
  • the nested reaction was performed with the set of primers shown in Table 3 using FastStart polymerase (Roche) and supplied reagents essentially according to the manufacturer's instructions. 1 ⁇ l of the RT-PCR reaction product was used per nested reaction in a total volume of 20 ⁇ l. Reaction conditions were:
  • mice Female BALB/c, strain A, or C57B16 mice (8-10 weeks old) were used for immunizations by injections with different purified proteins in addition to EGFR overexpressing cells.
  • EGFR proteins R&D systems cat#1095-ER or Sigma # E3641
  • EGFRvIII human growth hormone
  • hGH human growth hormone
  • TEV Tobacco Etch Virus
  • the ECD of EGFR was isolated by TEV-protease cleavage and subsequent purification on a Nickel column.
  • the human head-and-neck cancer cell line, HN5 ( Easty DM, Easty GC, Carter RL, Monaghan P, Butler LJ. Br J Cancer. 1981 Jun;43(6):772-85 .
  • Cells were cultured in DMEM medium supplemented with 10% FBS (Fetal Bovine Serum), 3mM Glycerol, 5mM Sodium Pyruvate and 1% Penicillin Streptomycin. Before each immunization the cells were washed in PBS, trypsinized with TrypLE and resuspended in growth medium. Subsequently the cell suspensions was washed twice in PBS by centrifugation at 250Xg for 5 min, dislodging and resuspension in 15 ml sterile PBS.
  • CFA Complete Freund's Adjuvant
  • IFA Incomplete Freund's Adjuvant
  • IFA is an oil-in-water emulsion composed of mineral oils and CFA is IFA to which heat-killed, dried Mycobacterium species are added. Both adjuvants have a depot effect.
  • CFA gives rise to long-term persistence of the immune response and is used for the first immunizations to boost the immune response and IFA is used for subsequent immunizations.
  • the emulsions were tested by adding a drop on the surface of a glass with water. If the drop remains as one drop, the emulsion is stable and the injections can be performed. Only stable emulsions were administered to mice.
  • mice received 4 injections. All mice were injected with either 300 ⁇ l or 200 ⁇ l emulsion. Depending on the schedule, injections were performed subcutaneously (s.c.), intraperitoneally (i.p.) or intravenously (i.v.).
  • mice were sacrificed by cervical dislocation, and the spleens were removed and transferred to a 74 ⁇ m cell strainer (Corning#136350-3479).
  • the cells were macerated through the filter, resuspended in cold RPMI 1640 with 10% FBS and centrifuged at 300Xg for 5 minutes.
  • the cell pellet was resuspended in RPMI 1640 with 1% FBS, filtered through a 50 ⁇ m syringe filter (BD# 340603) and collected by centrifugation.
  • the cell pellet was cryopreserved after resuspension in FCS with 10% DMSO and frozen cells stored at -80°C until FACS sorting.
  • Vials with frozen splenocytes were thawed at 37°C and transferred to 15 ml tube with ice still present.
  • 10 ml Ice-cold RPMI, 10 % FBS (foetal bovine serum) was added drop-by-drop to the tube while swirling.
  • 10 ml FACS PBS 5 ml FCS PBS is added before filtering the cells through 50 ⁇ m Filcon.
  • Cells were then pelleted and resuspended in 1 ml PBS with 2% FBS (final volume) and stained with anti-CD43-FITC and anti-CD138-PE according to the specific dilution to a final concentration of app. 5 ⁇ g/ml.
  • V H and V L coding sequences were performed on the single cells gated as plasma cells, facilitating cognate pairing of the V H and V L coding sequences.
  • the procedure utilized a two step PCR procedure based on a one-step multiplex overlap-extension RT-PCR followed by a nested PCR.
  • the primer mixes used in the present example only amplify Kappa light chains. Primers capable of amplifying Lambda light chains could, however, be added to the multiplex primer mix and nested PCR primer mix if desired. If Lambda primers are added, the sorting procedure should be adapted such that Lambda positive cells are not excluded.
  • the principle for linkage of cognate V H and V L sequences is illustrated in Figure 1 .
  • the 96-well PCR plates produced were thawed and the sorted cells served as template for the multiplex overlap-extension RT-PCR.
  • the sorting buffer added to each well before the single-cell sorting contained reaction buffer (OneStep RT-PCR Buffer; Qiagen), primers for RT-PCR (see Table 2 above) and RNase inhibitor (RNasin, Promega). This was supplemented with OneStep RT-PCR Enzyme Mix (25x dilution; Qiagen) and dNTP mix (200 ⁇ M each) to obtain the given final concentration in a 20- ⁇ l reaction volume. The plates were incubated for 30 min at 55°C to allow for reverse transcription of the RNA from each cell. Following the RT, the plates were subjected to the following PCR cycle: 10 min at 94°C, 35x(40 sec at 94°C, 40 sec at 60°C, 5 min at 72°C), 10 min at 72°C.
  • the PCR reactions were performed in an H20BIT Thermal cycler with a Peel Seal Basket for 24 96-well plates (ABgene) to facilitate a high-throughput.
  • the PCR plates were stored at -20°C after cycling.
  • 96-well PCR plates were prepared with the following mixture in each well (20- ⁇ l reactions) to obtain the given final concentration: 1 ⁇ FastStart buffer (Roche), dNTP mix (200 ⁇ M each), nested primer mix (see Table 5 ), Phusion DNA Polymerase (0.08 U; Finnzymes) and FastStart High Fidelity Enzyme Blend (0.8 U; Roche).
  • 1 ⁇ l was transferred from the multiplex overlap-extension PCR reactions.
  • the nested PCR plates were subjected to the following thermocyling: 35x(30 sec at 95°C, 30 sec at 60°C, 90 sec at 72°C), 10 min at 72°C.
  • the plates were stored at -20°C until further processing of the PCR fragments.
  • the repertoires of linked V H and V L coding pairs from the nested PCR were pooled, without mixing pairs from different donors, and were purified by preparative 1% agarose gel electrophoresis.
  • the human kappa constant light chain encoding sequence was spliced by overlap extension to the V L coding region of the pooled PCR products of linked V H and V L coding pairs ( Figure 2 ).
  • the human kappa constant light chain encoding sequence was amplified from a plasmid containing the coding sequence of a human antibody with a kappa light chain in a reaction containing: Phusion Enzyme (2 U; Finnzymes), 1x Phusion buffer, dNTP mix (200 ⁇ M each), hKCforw-v2 primer and Kappa3' primer ( Table 6 ), and plasmid template pLL138 (10 ng/ ⁇ l) in a total volume of 50 ⁇ l.
  • the reaction was subjected to the following thermocycling: 25x(30 sec at 95°C, 30 sec at 55°C, 45 sec at 72°C), 10 min at 72°C.
  • the resulting PCR fragment was purified by preparative 1% agarose gel electrophoresis.
  • the purified pooled PCR fragments of each repertoire were spliced to the amplified and purified PCR fragment of the human kappa constant encoding region (Appendix 1) by the following splicing by overlap extension PCR (50 ⁇ l total volume) containing: human kappa constant encoding region fragment (1.4 ng/ ⁇ l), purified pooled PCR fragment (1.4 ng/ ⁇ l), Phusion DNA Polymerase (0.5 U; Finnzymes) and FastStart High Fidelity Enzyme Blend (0.2 U; Roche), 1x FastStart buffer (Roche), dNTP mix (200 ⁇ M each), mhKCrev primer and mJH set primers (see Table 6 ).
  • the reaction was subjected to the following thermocycling: 2 min at 95°C, 25x(30 sec at 95°C, 30 sec at 55°C, 1 min at 72°C), 10 min at 72°C.
  • the resulting PCR fragment (approx. 1070 bp) was purified by preparative 1% agarose gel electrophoresis.
  • V H and V L coding sequences obtained were expressed as full-length antibodies. This involved insertion of the repertoire of V H and V L coding pairs into an expression vector and transfection into a host cell.
  • a two-step cloning procedure was employed for generation of a repertoire of expression vectors containing the linked V H and V L coding pairs.
  • the repertoire of expression vectors contains ten times as many recombinant plasmids as the number of cognate paired V H and V L PCR products used for generation of the screening repertoire, there is 99% likelihood that all unique gene pairs are represented.
  • a repertoire of at least 4000 clones was generated for screening.
  • the purified PCR product of the repertoires of linked V H and V L coding pairs, spliced to the human kappa constant coding region was cleaved with Xhol and Notl DNA endonucleases at the recognition sites introduced into the termini of PCR products.
  • the cleaved and purified fragments were ligated into an Xhol/Notl digested mammalian IgG expression vector, OO-VP-002 ( Figure 6 ) by standard ligation procedures.
  • the ligation mix was electroporated into E. coli and added to 2xYT plates containing the appropriate antibiotic and incubated at 37°C over night.
  • the amplified repertoire of vectors was purified from cells recovered from the plates using standard DNA purification methods (Qiagen).
  • the plasmids were prepared for insertion of promoter-leader fragments by cleavage using Ascl and Nhel endonucleases. The restriction sites for these enzymes were located between the V H and V L coding gene pairs.
  • an Asc l- Nhe l digested bi-directional mammalian promoter-leader fragment was inserted into the Asc l and Nhe l restriction sites by standard ligation procedures.
  • the ligated vector was amplified in E. coli and the plasmid was purified using standard methods.
  • the generated repertoire of screening vectors was transformed into E. coli by conventional procedures. Colonies obtained were consolidated into 384-well master plates and stored. The number of arrayed colonies exceeded the number of input PCR products by at least 3-fold, thus giving 95% percent likelihood for the presence of all unique V-gene pairs obtained.
  • DNA plasmid was prepared from selected clones above and FreeStyle CHO-S cells (Invitrogen) were transfected in 2-ml scale for expression of antibodies (according to the manufacturer's instructions). The supernatants were harvested 96 hours after transfection.
  • the screening was made as a two step procedure.
  • the antibody libraries were screened for reactivity to recombinant EGFR protein in ELISA after which FMAT (FLISA) was used as a cell based approach, with the NR6wtEGFR cell line ( Batra et al, 1995, Cell Growth Differ, 6(10):1251-9 ), for detection of EGFR-antibodies binding to cell-surface expressed EGFR.
  • FMAT FLISA
  • NR6wtEGFR cell line Batra et al, 1995, Cell Growth Differ, 6(10):1251-9
  • the ELISA was performed with recombinant EGFR representing the extracellular domain of the EGFR.
  • Nunc maxisorb plates (cat no 464718) were coated with 1 ⁇ g/ml protein (in house produced), diluted in PBS at 4C over night. Prior to blocking in 50ul 2%-Milk-PBS-T the plates were washed once with PBS + 0.05 % Tween 20 (PBS-T). The plates were washed once with PBS-T, 20 ⁇ l of 2%- milk-PBS-T and 5 ⁇ l supernatants from the FreeStyle CHO-S transfectants (see above) were added and incubated for 11 ⁇ 2 hours R.T after which the plates were washed once with PBS-T 20 ⁇ l per well.
  • SKBR-3 ATCC #HTB-30
  • NR6wtEGFR cells were kept in growth medium. The cells were counted and diluted to 125,000 cells/ml with the Alexa-647 conjugated goat-anti-human IgG (H-L) antibody (Molecular probes No. A21445, lot no. 34686A) diluted 1:40,000. A total of 20 ⁇ l of this suspension was transferred to 384 well clear bottom Nunc plates. Subsequently 10 ⁇ l transfection supernatant was added to the cells. The FMAT signal from the reaction was measured after 6-10 hours of incubation.
  • H-L Alexa-647 conjugated goat-anti-human IgG
  • the data from the screening indicates that 221 (4.8%) of the total clones were positive in the ELISA. 93 (2.0%) of those clones were also positive in FMAT. In total 220 (4.8%) of the clones were positive in the FMAT and among those 127 (220-93) uniquely positive for the cell surface antigen.
  • the 111 library was screened in a similar fashion, but since the immunization procedure was made to generate antibodies specific for the deletion mutant EGFR receptor EGFRvIII, the ELISA screenings included assays to detect both wild-type EGFR and EGFRvIII.
  • the clones identified as EGFR-specific in ELISA were retrieved from the original master plates (384-well format) and consolidated into new plates. DNA was isolated from the clones and submitted for DNA sequencing of the V-genes. The sequences were aligned and all the unique clones were selected. Multiple alignments of obtained sequences revealed the uniqueness of each particular clone and allowed for identification of unique antibodies. Following sequence analysis of 220 clones, 70 genetically distinct antibody sequence clusters were identified. Each cluster of related sequences have probably been derived through somatic hypermutations of a common precursor clone. Overall, one to two clones from each cluster were chosen for validation of sequence and specificity.
  • DNA plasmid was prepared and transfection of FreeStyle CHO-S cells (Invitrogen) in 2-ml scale was performed for expression. The supernatant were harvested 96 hours after transfection. Expression levels were estimated with standard anti-IgG ELISA, and the specificity was determined by EGFR- and EGFRvIII-specific ELISA. 85% of the clones were shown to have the correct specificity and sequence.
  • the Cell Proliferation Reagent WST-1 (Roche Cat. No. 11 644 807 001) is a ready-to-use substrate which measures the metabolic activity of viable cells. It is then assumed that the metabolic activity correlates with the number of viable cells. In this example the WST-1 assay was used to measure the number of metabolically active cells after treatment with cell culture supernatants containing different anti-EGFR antibodies.
  • HN5 cells were then washed with 1xPBS and detached by trypsination with 3 ml trypsin solution. 17 ml of complete media was then added and the cells were spun down at 300xg (1200 rcf) for 5 min. The supernatant was removed and cells were re-suspended in DMEM + 0.5% FBS. Cells were counted and their concentration was adjusted and 1500 cells were added to the wells with supernatants so that each well contained 200 ⁇ l media in total. The plates were incubated for 4 days in a humidified incubator at 37°C. Then 20 ⁇ l WST-1 reagent was added pr. well and the plates were incubated for one hour at 37°C.
  • MAC metabolically active cells
  • cross-priming introduces amino acids that are not naturally occurring in the immunoglobulin framework with several potential consequences, e.g. structural changes and increased immunogenicity, all resulting in a decreased therapeutic activity.
  • the V H sequence was PCR amplified with a primer set containing the sequence corresponding to the V H -gene the clone of interest originated from, thereby correcting any mutations introduced by cross-priming.
  • the PCR fragment was digested with Xho l and Asc l and ligated back into the Xho l/ Asc l digested mammalian expression vector ( Figure 6 ) using conventional ligation procedures.
  • the ligated vector was amplified in E. coli and the plasmid was purified by standard methods.
  • the V H sequence was sequenced to verify the correction and the vector was digested with Nhe l/ Not l to prepare it for insertion of the light chain.
  • the complete light chain was PCR amplified with a primer set containing the sequence corresponding to the V L -gene the clone of interest originated from, thereby correcting any mutations introduced by cross-priming.
  • the PCR fragment was digested with Nhe l/ Not l and ligated into the V H containing vector prepared above.
  • the ligation product was amplified in E. coli and the plasmid was purified by standard methods. Subsequently, the light chain was sequenced to verify the correction.
  • the Kappa constant region of a selected clone contains mutations, introduced during the amplification of the genes, it is replaced by an unmutated constant region. This is done in an overlap PCR where the repaired V L -gene (amplified without the constant region) is fused to a constant region with correct sequence (obtained in a separate PCR). The whole sequence is amplified and cloned into the V H containing vector as described above and the repaired light chain is sequenced to verify the correction.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Immunology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Medicinal Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Virology (AREA)
  • Plant Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)

Abstract

The invention relates to a procedure for linking cognate pairs of VH and VL encoding sequences from a population of cells enriched in particular surface antigen markers. The linking procedure involves a multiplex molecular amplification procedure capable of linking nucleotide sequences of interest in connection with the amplification, in particular polymerase chain reaction (multiplex PCR). The method is particularly advantageous for the generation of cognate pair libraries as well as combinatorial libraries of variable region encoding sequences from immunoglobulins. The invention also relates to methods for generation of chimeric human/non-human antibodies and expression libraries generated by such methods.

Description

    Technical Field of the Invention
  • The present invention relates to a procedure for linking cognate pairs of VH and VL encoding sequences from a population of cells enriched in particular surface antigen markers. The linking procedure involves a multiplex molecular amplification procedure capable of linking nucleotide sequences of interest in connection with the amplification, in particular polymerase chain reaction (multiplex PCR). The method is particularly advantageous for the generation of cognate pair libraries as well as combinatorial libraries of variable region encoding sequences from immunoglobulin. The invention also relates to methods for generation of chimeric human/non-human antibodies and expression libraries generated by such methods.
  • Background of the Invention
  • WO 2005/042774 (Symphogen) discloses a method for linking nucleotide sequences of interest, in particular cognate pairs of VH and VL encoding sequences using a multiplex molecular procedure. The sequences of interest are preferably amplified and linked from isolated single cells following limited dilution or other cell separation techniques. The reference discloses various ways of enriching a lymphocyte containing cell population to obtain a population of plasma cells that are particularly suitable for the multiplex molecular amplification procedure.
  • WO 2007/003041 discloses a method of isolating an antibody-secreting cell that has a high likelihood of secreting an antibody specific for a desired antigen, comprising the steps: a) providing a sample from an animal that was recently immunized with the desired antigen, wherein the sample comprises a population of antibody-secreting cells that are secreting antibodies specific for the desired antigen; b) increasing the concentration of antibody-secreting cells in the sample; and c) isolating an antibody-secreting cell. The concentration of antibody-secreting cells in the sample may be increased by enriching for cells or depleting cells in the sample with surface markers.
  • Summary of the invention
  • The present invention focuses on methods for generating libraries of immunoglobulin encoding sequences from non-human animals and methods for generating in a few steps adapted for high throughput cloning and screening, libraries of vectors coding for chimeric antibodies comprising human constant regions and non-human variable regions.
  • In a first aspect the invention relates to a method of producing a library of cognate pairs comprising linked variable region encoding sequences, said method comprising:
    1. a) providing a lymphocyte-comprising cell fraction from a rodent donor;
    2. b) obtaining a population of isolated single cells by distributing cells from said cell fraction individually into a plurality of vessels, wherein at least a subpopulation of the cells express are enriched in and express CD43 and CD138 antigen or MHCII and B220 antigen; and
    3. c) amplifying and effecting linkage of the variable region encoding sequences contained in said population of isolated single cells by amplifying, in a multiplex molecular amplification procedure, nucleotide sequences of interest using a template derived from an isolated single cell or a population of isogenic cells; and effecting linkage of the amplified nucleotide sequences of interest.
  • This method provides a library of cognate pair antibodies or antibody fragments.
  • In another aspect the invention relates to a method of randomly linking a plurality non-contiguous nucleotide sequences of interest, wherein said nucleotide sequences of interest comprise variable region encoding sequences and the linkage generates a combinatorial library of pairs of variable region encoding sequences, said method comprising:
    1. a) amplifying, in a multiplex molecular amplification procedure, nucleotide sequences of interest using a template derived from a population of genetically diverse cells, wherein the genetically diverse cells are derived from a lymphocyte-comprising cell fraction from a rodent donor, ane wherein at least a subpopulation of the cells are enriched in and express CD43 and CD138 antigen or MHCII and B220 antigen; and
    2. b) effecting linkage of the nucleotide sequences of interest amplified in step a).
  • This method preferably provides a combinatorial library of randomly combined heavy and light chain immunoglobulin variable domains.
  • Experimental evidence provided in the present application establishes that cell populations isolated from mouse splenocytes and being positive for the listed surface antigens provide a good starting material for cloning of antibody encoding sequences using a multiplex molecular amplification method. The methods of the invention can easily be applied to other species expressing the orthologs of CD43 and CD138 or MHCII and B220, in particular the methods can be applied to other rodents such as rats.
  • The method provides several advantages over the alternative, viz generation of hybridoma. When preparing hybridoma from an immunised mouse, the established cell lines will encode a repertoire of different antibody isotypes. The hybridoma cell lines will subsequently have to be screened for both function (e.g. binding to a specific antigen, efficacy in neutralising a pathogen) and for the antibody isotype. Thus it requires a two step screening procedure to select a hybridoma with a particular antibody isotype and with a particular effect in a functional assay. Finally, in order to generate a producer cell line, the antibody secreted by the selected hybridoma will need to be cloned and sequenced before it can be transferred to a producer cell line.
  • Using the method of the present invention, the antibody isotype is determined by the primers used for the multiplex molecular amplification, and therefore this will not need to be determined. The antibody isotype may be also be determined (and can be changed) by the subsequent linking or splicing of constant domain(s) to cloned variable sequences. Furthermore, the method of the invention provides a library of polynucleotides that can be easily sequenced and/or inserted into vectors, such as expression, transfer, display or shuttle vectors, so that once a particular antibody has been selected, it is already cloned, its sequence is already known and it can be transferred easily to an appropriate expression vector for production of antibody.
  • It is expected that the cells sorted according to the provided protocol provide a source of high affinity antibodies, potentially with affinities in the picomolar range. Monoclonal antibodies from hybridoma do not possess affinities in the picomolar range, and will need to be synthetically affinity matured to reach these affinities.
  • According to one embodiment these methods further comprise assessing prior to multiplex molecular amplification that the population of lymphocyte-comprising cells comprises cells that express CD43 and CD138 antigen or MHCII and B220 antigen, preferably CD43 and CD138. Also the methods may include enriching said lymphocyte-comprising cell fraction for a lymphocyte population expressing CD43 and CD138 antigen or MHCII and B220 prior to multiplex molecular amplification.
  • Preferably the methods further comprise isolating from said lymphocyte-comprising population cells expressing CD43 and CD138 antigen or MHCII and B220 antigen prior to multiplex molecular amplification. In a preferred embodiment the isolated cells or subpopulation of cells are CD138 High/CD43 High or CD138 Intermediate/CD43 High relative to the lymphocyte-comprising cell fraction. More preferably the isolated cells or subpopulation of cells are CD138 High/CD43 High relative to the lymphocyte-comprising cell fraction.
  • Enrichment or isolation preferably comprises an automated sorting procedure, such as MACS or FACS.
  • Also described herein is a method for generating a vector encoding a chimeric antibody with human constant regions and non-human variable regions, said method comprising:
    1. a) providing a lymphocyte-comprising cell fraction from a non-human animal;
    2. b) obtaining a population of isolated single cells, comprising distributing cells from said cell fraction individually into a plurality of vessels;
    3. c) amplifying and effecting linkage of the variable region encoding nucleic acids contained in said population of isolated single cells by amplifying, in a multiplex molecular amplification procedure, said nucleic acids using a template derived from an isolated single cell or a population of isogenic cells; and effecting linkage of the amplified nucleic acids encoding variable regions of heavy and light chains;
    4. d) effecting linkage of the amplified variable regions to human constant regions; and
    5. e) inserting the obtained nucleic acid into a vector.
  • Preferably the non-human animal is a mouse. To the extent that the methods of the invention are applied to mouse cells, the methods are named: Mouse-Symplex™ or mSymplex™.
  • This method provides a novel method for generation of libraries of chimeric human/non-human antibodies. This is made possible by combining the multiplex molecular amplification and subsequent cloning into a vector backbone with ligation and/or splicing of human heavy and light chain constant domains. Traditionally, in a method for generating chimeric human/non-human antibodies, the chimerisation has been the last step after hybridoma have been established and screened and the encoded antibody has been cloned. Chimerisation may affect the binding specificity and/or affinity of an antibody, and thus there is a risk that a good monoclonal mouse antibody loses its efficacy when it is chimerised into a human/mouse antibody.
  • By the provision of a method that directly generates an antibody of chimeric antibodies, the screening can be carried out on products that do not need to be modified further prior to preclinical and clinical development.
  • The constant human regions can be provided in a molecular amplification step or they can be provided as part of a vector-backbone, into which the variable regions are cloned following molecular amplification. In a preferred embodiment the method comprises a further amplification step, wherein a polynucleotide encoding a human constant light chain or a fragment thereof with an overlap capable of providing linkage to the variable light chain, is added to the PCR mixture together with a primer set capable of amplification of a construct comprising in order: a murine VH chain, a linker, a murine VL chain, and a human constant light chain.
  • In another embodiment the method comprises a further amplification step, wherein a polynucleotide encoding human constant heavy chain or a fragment thereof with an overlap capable of providing linkage to the variable heavy chain, is added to the PCR mixture together with a primer set capable of amplification of a construct comprising in order: a human constant heavy chain, a murine VH chain, a linker, and a murine VL chain.
  • Consequently there is also provided a library of vectors encoding chimeric antibodies, each antibody member consisting of non-human immunoglobulin variable region encoding sequences, and human immunoglobulin heavy and light chain constant regions.
  • Preferably, the vectors are expression vectors enabling the expression of the antibody members of the library for subsequent screening. More preferably the expression vector is for mammalian expression.
  • The vectors of the library may be obtained by a method of the invention.
  • In one embodiment the light chain constant region is a kappa constant region.
  • The non-human sequences may be from rat, sheep, goat, rabbit, guinea-pig or other suitable animal for which immunisation protocols have been described, for which sequence information is available to allow the design of suitable primers, and for which suitable cell sorting techniques enable sorting of plasma cells for single cell mulitiplex molecular amplification to link cognate pairs of variable region sequences. In one embodiment, the non-human sequences are of non-human primate origin, such as cynomolgous monkey, Rhesus monkey, chimpanzee, or maqaque. Preferably the non-human sequences are rodent, such as murine or rat. In another embodiment the non-human sequences are rabbit sequences.
  • Preferably the variable regions of the antibodies are cognate pairs.
  • Also described herein is a sub-library which codes for antibodies exhibiting desired binding specificities directed against a particular target, selected from a library prepared according to the invention.
  • Description of the Figures
    • Figure 1. Murine - mSymplex™ PCR: Multiplex overlap extension RT-PCR for the amplification and cognate linkage of heavy and light chain antibody genes from a single cell. Exemplary primer mixes used for the RT-PCR and the nested PCR are described in detail in Table 2 and Table 3 (or Table 5).
    • Figure 2. Murine repertoire cloning: a pool of mSymplex™ PCR products encoding VHNL gene pairs from single plasma cells were spliced to the gene encoding human kappa constant light chain by splicing by overlap extension. The pool of genes, encoding complete human-mouse chimeric antibodies, was inserted into an expression vector followed by an insertion of a bi-directional promoter cassette (2xCMV). The primer mixes used for the human kappa splicing are described in detail in Table 6.
    • Figure 3A. Sorting of mouse splenocytes. (A) For isolating plasma cells defined as CD43 high, CD138 high, PI (propidium iodide) positive or dead cells were excluded in the bottom left panel (Not P1). Then plasma cells were gated as CD43 high, CD138 high in the bottom right panel (P2). Finally, doublets were excluded in the SSC-H, SSC-W plot, top right panel (P3). Cells positive for all three gates were sorted into ELISPOT plates. (B) For isolating plasmablasts defined as MHCII intermediate, B220 intermediate, PI (propidium iodide) positive or dead cells were excluded in the bottom left panel (Not P1). Then plasmablasts were gated as MHCII intermediate, B220 intermediate, bottom right panel (P2). Doublets were excluded in the SSC-H, SSC-W plot, top right panel (P3), and finally cells were gated for size in the top left panel (P4). Cells positive for all four gates were sorted into ELISPOT plates.
    • Figure 4. Sorting of mouse splenocytes. Firstly, PI positive or dead cells were excluded in the bottom left panel (P1). In the top right dot plot, CD138 PE and CD43 FITC are depicted. Four gates were set on different phenotypic cell populations: P2 is CD138 intermediate, CD43 high. P3 is CD138 high, CD43 high. P4 is CD138 high, Cd43 neg. P5 is CD138 intermediate, CD43 low. 10,000 cells positive for P1 and each of the four gates were sorted into test tubes and frozen for evaluation by Symplex.
    • Figure 5. Gel electrophoresis of PCR products from Symplex PCR titration of cellular lysate from the 4 sorted fractions. P2, P3, P4 and P5 are the gates sorted according to Figure 4. M are molecular weight markers.
    • Figure 4. A schematic representation of the mammalian full-length antibody expression vector 00-VP-002. Amp and Amp pro, ampicillin resistance gene and its promoter; pUC origin, pUC origin of replication; CMV, mammalian promoter driving the expression of the light chain and the heavy chain; IGHV Leader, genomic human heavy chain leader; H stuffer, insert that is exchanged for the heavy chain variable region encoding sequence; IGHG1, sequence encoding genomic immunoglobulin isotype G1 heavy chain constant region (sequence is shown in Appendix 1); Rabbit B-globin A, rabbit beta-globin polyA sequence; IGKV Leader, murine kappa leader; L Stuffer, insert that is exchanged for the light chain encoding sequence; SV40 term, simian virus 40 terminator sequence; FRT, Flp recognition target site; Neo, neomycin resistance gene; SV40 poly A, simian virus 40 poly A signal sequence.
    • Figure 5. Analysis of a repertoire of chimeric anti-hEGFR antibodies. Cluster analysis of the absorbance difference at 450-620 nm. Supernatants are clustered by reactivity as indicated by the number (1 to 4) following the clone no. Dark grey indicates a decrease in the number of metabolically active cells, whereas light grey indicates an increase in the number of metabolically active cells. Black indicates supernatants with no effect on the number of metabolically active cells.
    Description of the Invention
  • The present invention sets out to provide further possibilities for using the amplification and linkage method disclosed in WO 2005/042774 for providing collections of antibody vectors from non-human animals. These improvements enable the cloning of human/non-human chimeric antibody encoding sequences with cognate pairs of variable regions to be fitted to a high-throughput format. This is basically achieved by providing a new starting material for the amplification and linkage processes and by providing methods for generation of libraries of chimeric human/non-human antibodies with cognate pairs of variable regions.
  • One aspect of the invention is a method of linking heavy and light chain variable sequences comprising amplifying, in a multiplex molecular amplification procedure, the relevant nucleotide sequences using a template derived from an isolated single cell, a population of isogenic cells or a population of genetically diverse cells and effecting a subsequent linkage of the amplified sequences.
  • Definitions
  • The term "cognate pair" describes an original pair of non-contiguous nucleic acids of interest that are contained within or derived from a single cell. In preferred embodiments, a cognate pair comprises two variable region encoding sequences which together encode a binding protein variable domain and which gene sequences are derived from the same cell. Thus, when expressed either as a complete binding protein or as a stable fragment thereof, they preserve the binding affinity and specificity of the binding protein originally expressed from this cell. A cognate pair can for example be comprised of an antibody variable heavy chain encoding sequence associated with a variable light chain encoding sequence from the same cell, or a T cell receptor α chain encoding sequence associated with a β chain encoding sequence from the same cell. A library of cognate pairs is a collection of such cognate pairs.
  • The term "hot-start polymerase" describes polymerases that are inactive or have very low activity at temperatures used for reverse transcription. Such polymerases need to be activated by high temperatures (90 to 95°C) to become functional. This is for example an advantage in single-step RT-PCR procedures, since this prohibits interference of the polymerase with the reverse transcriptase reaction.
  • The term "isogenic population of cells" describes a population of genetically identical cells. In particular, an isogenic population of cells derived by clonal expansion of an isolated single cell is of interest in the present invention.
  • The term "isolated single cell' describes a cell that has been physically separated from a population of cells corresponding to "a single cell in a single vessel'. When distributing a population of cells individually among a plurality of vessels, a population of isolated single cells is obtained. As specified in the section entitled "Template sources" the proportion of vessels with a single cell is not necessarily 100% in order to call it a population of single cells.
  • Terms derived from "link" or "linkage" in relation to amplification describe the association of the amplified nucleic acid sequences encoding the nucleic acid sequences of interest into a single segment. In relation to cognate pairs a segment comprises nucleic acid sequences encoding a variable domain, e.g. an antibody heavy chain variable region associated with an antibody light chain variable region encoding sequence, derived from the same cell. The linkage can either be achieved simultaneously with the amplification or as an immediate step following the amplification. There are no requirements to the form or functionality of the segment, it may be linear, circular, single stranded or double stranded. Nor is the linkage necessarily permanent, one of the nucleic acid sequences of interest may be isolated from the segment if desired, one of the variable region encoding sequence may for example be isolated from a cognate pair segment. However, as long as the original variable regions constituting the cognate pair are not scrambled with other variable regions, they are still considered a cognate pair, although not linked together into a single segment. The linkage is preferably a nucleotide phosphodiester linkage. However, linkage can also be obtained by different chemical cross linking procedures.
  • The term "multiplex molecular amplification" describes the simultaneous amplification of two or more target sequences in the same reaction. Suitable amplification methods include the polymerase chain reaction (PCR) ( U.S. 4,683,202 ), ligase chain reaction (LCR), (Wu and Wallace, 1989, ), strand displacement amplification (SDA) technique (Walker et al., 1992, Nucl. Acids Res. 20, 1691-6), self-sustained sequence replication (Guatelli et al., 1990, Proc. Nat. Acad. Sci. U.S.A., 87, 1874-8) and nucleic acid based sequence amplification (NASBA) (Compton J., 1991, Nature 350, 91-2). The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA).
  • The term "multiplex PCR' describes a variant of PCR in which two or more target sequences are amplified simultaneously, by including more than one set of primers in the same reaction, e.g. one primer set adapted for amplification of the heavy chain variable region and one primer set adapted for amplification of the kappa chain variable region in the same PCR reaction. Additionally or alternatively a primer set adapted for amplification of the lambda chain variable region may be combined with these primer sets.
  • The term "multiplex RT-PCR'describes a multiplex PCR reaction, which is preceded by a reverse transcription (RT) step. The multiplex RT-PCR, can either be performed as a two-step process with a separate RT step prior to the multiplex PCR, or as a single-step process where all components for both RT and multiplex PCR are combined in a single tube.
  • The terms "multiplex overlap-extension PCR" and "multiplex overlap-extension RT-PCR" imply that the multiplex PCR or multiplex RT-PCR is performed utilizing a multiplex overlap-extension primer mix to amplify the target sequences, thereby enabling simultaneous amplification and linkage of the target sequences.
  • The term "a plurality of vessels" describes any object (or collection of objects) which enables the physical separation of a single cell from a population of cells. This may be tubes, multiwell plates (e.g. 96-well, 384-well, microtiter plates or other multiwell plates), arrays, microarrays, microchips, gels, or a gel matrix. Preferably the object is applicable for PCR amplification.
  • The term "polyclonal protein" or "polyclonality" as used herein refers to a protein composition comprising different, but homologous protein molecules, preferably selected from the immunoglobulin superfamily. Thus, each protein molecule is homologous to the other molecules of the composition, but also contains one or more stretches of variable polypeptide sequence, which is/are characterized by differences in the amino acid sequence between the individual members of the polyclonal protein. Known examples of such polyclonal proteins include antibody or immunoglobulin molecules, T cell receptors and B cell receptors. A polyclonal protein may consist of a defined subset of protein molecules, which has been defined by a common feature such as the shared binding activity towards a desired target, e.g. a polyclonal antibody exhibiting binding specificity towards a desired target antigen.
  • The term "a population of genetically diverse cells" as used herein refers to a cell population where the individual cells in the population differ among each other on the genomic level. Such a population of genetically diverse cell is for example a population of cells derived from a donor, or a fraction of such cells, e.g. a B lymphocyte or a T lymphocyte containing cell fraction.
  • The term "primer set' is used interchangeably with the term "primer pair" and describes two or more primers which together are capable of priming the amplification of a nucleotide sequence of interest (i.e., one member of a cognate pair). A primer set of the present invention may be designed to prime a family of nucleotide sequences containing variable region encoding sequences. Examples of different families are antibody kappa light chains, lambda light chains, heavy chain variable regions, and α, β, γ, or δ T cell receptor variable regions. A primer set for the amplification of a family of nucleotide sequences containing variable region encoding sequences often constitutes a plurality of primers where several primers can be degenerate primers.
  • The term "sequence identity" is expressed as a percentage which indicates the degree of identity between to nucleic acid sequences over the length of the shortest of the two sequences. It can be calculated as (Nref - Ndif)×100%/Nref, wherein Nref is the number of residues in the shorter of the sequences, and wherein Ndif is the total number of non-identical residues in an Nref long optimally aligned match between the two sequences. Hence, the DNA sequence AGTCAGTC (Seq. no. 32) will have a sequence identity of 75% with the sequence TAATCAATCGG (Seq. no. 33) (Ndif=2 and Nref=8) (underlining shows the optimal alignment, and bold indicates the two non-identical residues out of 8).
  • The terms "randomly" or "random" with respect to linkage refer to linkage of nucleotide sequences which are not derived from the same cell but are linked transversely among a population of genetically diverse cells. If the nucleotide sequences of interest are variable region encoding sequences, this will result in a combinatorial library of linked sequences. If, on the other hand, the nucleotide sequences of interest encode a non-diverse heteromeric protein the randomly linked sequences will appear similar to sequences linked from a single cell.
  • The term "template derived from an isolated single cell", with regard to reverse transcription, relates to the nucleic acids within such an isolated cell. The nucleic acids can for example be in the form of RNA, mRNA, DNA or genomic DNA. The nucleic acids can either be isolated from the cell or still be with the remaining contents of the cell, where the cell is in an intact form or a lysed form.
  • The term "CD43" refers to a mouse surface antigen known under numerous synonyms including 3E8 antigen, A630014B01Rik, B cell differentiation antigen LP-3, Cd43, CD43 antigen, Galgp, Leucocyte sialoglycoprotein, leukosialin, Leukosialin precursor, Ly48, Ly-48, Sialophorin, as well as to orthologous surface markers from other animals.
  • The term "CD138" refers to a mouse surface antigen known under numerous synonyms including Syndecan-1, AA408134, AA409076, CD138, syn-1, Synd, Synd1, SYND1, Synd-1, Syndecan-1 precursor as well as to orthologous surface markers from other animals.
  • The term "MHCII" refers to a mouse surface antigen known under numerous synonyms including CD74 antigen, CLIP, DHLAG, H-2 class II histocompatibility antigen gamma chain, HLADG, HLA-DR-GAMMA, la antigen-associated invariant chain, la-GAMMA, Ii, MHC class II-associated invariant chain as well as to orthologous surface markers from other animals.
  • The term "B220" refers a mouse surface antigen known under numerous synonyms including B220, Cd45, CD45, CD45 antigen, CD45R, L-CA, Leukocyte common antigen precursor, loc, Ly-5, Lymphocyte common antigen Ly-5, Lyt-4, T200 as well as to orthologous surface markers from other animals.
  • Detailed description of the invention The Amplification and Linkage Process
  • One feature of the present invention reduces the number of tubes necessary to amplify the nucleotide sequences of interest, utilizing a variant of PCR in which two or more target sequences are amplified simultaneously in the same tube, by including more than one set of primers, for example all the primers necessary to amplify variable region encoding sequences, in the same reaction. Generally this approach is known as multiplex polymerase chain reaction (multiplex PCR).
  • A further feature of the present invention is that two or more target sequences amplified by multiplex PCR are linked in close proximity to the amplification process. In particular cognate pairs of variable region encoding sequences are linked by this process.
  • One embodiment of the present invention exploits that a multiplex primer mix can be designed to work in an overlap-extension PCR procedure, resulting in a simultaneous amplification and linkage of nucleotide sequences of interest. This multiplex overlap-extension PCR technique serves to reduce the number of reactions necessary to isolate and link nucleotide sequences of interest, in particular cognate pairs of linked variable regions.
  • Other embodiments of the present invention apply linkage by ligation or by recombination as an alternative to linkage by multiplex overlap-extension PCR. In these procedures, the linkage is not performed simultaneously with the multiplex PCR amplification, but as an immediate step following the amplification. However, linkage can still be performed in the same tube as the multiplex PCR was performed in.
  • A multiplex overlap-extension PCR requires the presence of two or more primer sets (a multiplex primer mix), where at least one primer of each set is equipped with an overlap-extension tail. The overlap-extension tails enable the linkage of the products generated by each of the primer sets during amplification. Such a primer mix is called a multiplex overlap-extension primer mix. The multiplex overlap-extension PCR differs from conventional overlap-extension PCR in that the sequences to be linked are generated simultaneously in the same tube, thereby providing immediate linkage of the target sequences during amplification, without any intermediate purification.
  • A further feature of the present invention is a reverse transcription (RT) step preceding the multiplex PCR or multiplex overlap-extension PCR amplification, utilizing a template derived from an isolated single cell or a population of isogenic cells.
  • A further feature of the present invention is the use of nucleotide sequences derived from an isolated single cell or a population of isogenic cells as template for the multiplex PCR amplification. Preferably, RNA from a single cell is reverse transcribed into cDNA prior to the multiplex PCR. For the amplification of some nucleic acid sequences of interest genomic DNA may be used as an alternative to mRNA. By using isolated single cells or a population of isogenic cells derived by clonal expansion of an isolated single cell as template source, it is possible to avoid scrambling of nucleotide sequences encoding a heteromeric protein of interest, with nucleotide sequences derived from different cells within a population of cells. This is of importance if one wishes to obtain the original composition of the sequences of interest. Especially for the generation of a cognate pair of variable region encoding sequences, the use of an isolated single cell or a population of isogenic cells as template source is an important feature.
  • Additionally, the present invention facilitates the generation of libraries of linked nucleic acid sequences of interest, in particular combinatorial libraries and libraries of cognate pairs of variable regions. Further, the present invention utilizes nucleic acids derived from single cells, preferably in the form of RNA that does not need to be isolated from the remaining cell contents before it can be utilized as template.
  • One embodiment of the present invention encompasses the linkage of a plurality of non-contiguous nucleotide sequences of interest. The method comprises amplifying, in a multiplex PCR or multiplex RT-PCR amplification procedure, nucleotide sequences of interest using a template derived from an isolated single cell or a population of isogenic cells and effecting linkage of the amplified nucleotide sequences of interest. Further, the method comprises an optional step of performing an additional amplification of the linked products. A further embodiment of the present invention encompasses a method of producing a library of cognate pairs comprising linked variable region encoding sequences. The method comprises providing a lymphocyte-containing cell fraction from a donor, which is optionally enriched for a particular lymphocyte population from said cell fraction, or wherein a particular lymphocyte population has been isolated from said cell fraction. Further, a population of isolated single cells is obtained by distributing cells from the lymphocyte-containing cell fraction, or the enriched cell fraction, individually among a plurality of vessels. Multiplex molecular amplification (multiplex RT-PCR amplification) of the variable region encoding sequences contained in the population of isolated single cells is performed and linkage of pairs of variable region encoding sequences is effected, wherein an individual pair of variable region sequences is derived from a single cell, within the population of isolated single cells. Further, the technique comprises two optional steps: in the first step the individual isolated single cell in the population of single cells is expanded to a population of isogenic cells prior to performing multiplex RT-PCR amplification, thereby obtaining a plurality of vessels with a diverse population of isogenic cells (one population of isogenic cells in one vessel). The second optional step encompasses performing an additional amplification of the linked variable region encoding sequences.
  • In preferred embodiments of the present invention, an individual member of said library of cognate pairs comprised of an immunoglobulin light chain variable region encoding sequence is associated with an immunoglobulin heavy chain variable region encoding sequence, originating from the same cell or of sequences encoding a T cell receptor binding domain, constituted of an alpha chain variable region associated with a beta chain variable region or a gamma chain variable region associated with a delta chain variable region, where the associated variable regions originate from the same cell.
  • The multiplex RT-PCR amplification of the present invention can be performed either as a two-step process, where reverse transcription (RT) is performed separate from the multiplex PCR amplification (or alternative multiplex molecular amplification), or as a single-step process, where the RT and multiplex PCR amplification steps are performed with the same primers in one tube.
  • The reverse transcription (RT) is performed with an enzyme containing reverse transcriptase activity resulting in the generation of cDNA from total RNA, mRNA or target specific RNA from an isolated single cell. Primers which can be utilized for the reverse transcription are for example oligo-dT primers, random hexamers, random decamers, other random primers, or primers that are specific for the nucleotide sequences of interest.
  • The two-step multiplex RT-PCR amplification procedure allows for the cDNA generated in the RT step to be distributed to more than one vessel, permitting the storage of a template fraction before proceeding with the amplification. Additionally, the distribution of cDNA to more than one tube allows for the performance of more than one multiplex PCR amplification of nucleic acid derived from the same template. Although this results in an increased number of separate reactions, it opens for the possibility to decrease the complexity of the multiplex primer mix if this should be desired. This two-step approach can for example be applied to amplify and link heavy chain variable region and kappa light chain variable region encoding sequences in one tube and heavy chain variable region and lambda light chain variable region encoding sequences in a different tube utilizing the same template. A single cell usually only expresses one of the light chains. However, it will often be easier to perform the reactions simultaneously instead of awaiting the result of one of the reactions before performing the other. Further, the amplification of both kappa and lambda serves as an internal negative control, since it would be expected that only kappa or lambda amplify from a single cell.
  • In the single-step multiplex RT-PCR procedure, reverse transcription and multiplex PCR amplification is carried out in the same vessel. All the components necessary to perform both the reverse transcription and the multiplex PCR in a single step are initially added into the vessels and the reaction is performed. Generally, there is no need to add additional components once the reaction has been started. The advantage of single-step multiplex RT-PCR amplification is that it reduces the number of steps necessary to generate the linked nucleotide sequences of the present invention even further. This is particularly useful when performing multiplex RT-PCR on an array of single cells, where the same reaction needs to be carried out in a plurality of vessels. Single-step multiplex RT-PCR is performed by utilizing the reverse primers present in the multiplex primer mix needed for the multiplex PCR amplification as primers for the reverse transcription as well. Generally, the composition needed for the single-step multiplex RT-PCR comprises a nucleic acid template, an enzyme with reverse transcriptase activity, an enzyme with DNA polymerase activity, deoxynucleoside triphosphate mix (dNTP mix comprising dATP, dCTP, dGTP and dTTP) and a multiplex primer mix. The nucleic acid template is preferably total RNA or mRNA derived from an isolated single cell either in a purified form, as a lysate of the cell or still within the intact cell. Generally, the exact composition of the reaction mixture requires some optimization for each multiplex primer mixture to be used with the present invention. This applies both for the two-step and the single-step multiplex RT-PCR procedures.
  • For some single-step multiplex RT-PCR reactions it may be an advantage to add additional components during the reaction, for example, addition of the polymerase following the RT step. Other components could for example be a dNTP mixture or a multiplex primer mix possibly with a different primer composition. This can then be considered as a one-tube multiplex RT-PCR, which generally has the same advantages as the single-step multiplex RT-PCR, since it also limits the number of tubes necessary to obtain the desired linked products.
  • The nucleotide sequences of interest, amplified by the multiplex RT-PCR, can be linked to one another by several methods, such as multiplex overlap-extension RT-PCR, ligation or recombination, using different multiplex primer mixes. Preferably the multiplex RT-PCR amplification and linkage process is a single step or a two step process. However, the linkage process may also be performed as a multi step process, using for example a stuffer fragment to link the nucleic acid sequences of interest, either with PCR, ligation or recombination. Such a stuffer fragment may contain cis-elements, promoter elements or a relevant coding sequence or recognition sequence. In a preferred embodiment the linkage process is performed in the same vessel as the multiplex RT-PCR amplification.
  • In one embodiment of the present invention the linkage of a plurality of non-contiguous nucleotide sequences of interest is performed in association with the multiplex PCR amplification, utilizing a multiplex overlap-extension primer mix. This results in the combined amplification and linkage of the target sequences. Generally, the composition needed for the multiplex overlap-extension PCR comprises a nucleic acid template, an enzyme with DNA polymerase activity, deoxynucleoside triphospate mix (dNTP mix comprising dATP, dCTP, dGTP and dTTP) and a multiplex overlap-extension primer mix.
  • In a particular embodiment of the present invention, the linkage of a plurality of non-contiguous nucleotide sequences of interest is performed by multiplex overlap-extension RT-PCR using a template derived from an isolated single cell or a population of isogenic cells. Further, the method comprises an optional step of performing an additional molecular amplification of linked products. Preferably, the multiplex overlap-extension RT-PCR is performed as a single-step/one-tube reaction.
  • A multiplex overlap-extension primer mix of the present invention comprises at least two primer sets capable of priming the amplification and linkage of at least two variable region encoding sequences, for example, amplification and linkage of sequences from immunoglobulin heavy chain variable region families with kappa or lambda light chain variable region families, or amplification and linkage of sequences from T cell receptor families α, β, γ, or δ.
  • In another embodiment of the present invention the plurality of nucleotide sequences of interest, amplified by multiplex RT-PCR, are linked by ligation. To achieve this, the multiplex primer mix used for the multiplex RT-PCR is designed such that the amplified target sequences can be cleaved with appropriate restriction enzymes, and covalent linkage by DNA ligation can be performed (the primer design is described in the section "Primer Mixtures and Design"). Following multiplex RT-PCR amplification with such a multiplex primer mix, the restriction enzymes needed to form compatible ends of the target sequences are added to the mixture together with the ligase. No purification of the PCR products is needed prior to this step, although purification may be performed. The reaction temperature for the combined restriction cleavage and ligation is approximately between 0 and 40°C. However, if the polymerase from the multiplex PCR reaction is still present in the mixture, an incubation temperature below room temperature is preferred, most preferred are temperatures between 4 and 16°C.
  • In yet another embodiment of the present invention, the plurality of nucleotide sequences of interest, amplified by multiplex RT-PCR, are linked by recombination. In this approach, the target sequences amplified can be joined using identical recombination sites. Linkage is then performed by adding the recombinases facilitating recombination. Some suitable recombinase systems are Flp recombinase with a variety of FRT sites, Cre recombinase with a variety of lox sites, integrase ΦC31 which carries out recombination between the attP site and the attB site, the β-recombinase-six system as well as the Gin-gix system. Linkage by recombination has been exemplified for two nucleotide sequences (VH linked with VL) (Chapal, N. et al. 1997 BioTechniques 23, 518-524).
  • In a preferred embodiment of the present invention, the nucleotide sequences of interest comprise variable region encoding sequences and the linkage generates a cognate pair of variable region encoding sequences. Such a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions. Preferably the constant regions are of human origin and the variable region cognate pair is of different origin, such as mouse, rat, or rabbit.
  • In an even more preferred embodiment of the present invention, the nucleotide sequences of interest comprise immunoglobulin variable region encoding sequences and the linkage generates a cognate pair of light chain variable region and heavy chain variable region encoding sequences. Such a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions. Further, such a cognate pair may be isolated from template derived from cells of the B-lymphocyte lineage enriched from a lymphocyte-containing cell fraction, such as whole blood, mononuclear cells or white blood cells.
  • In another embodiment of the present invention, the nucleotide sequences of interest comprise TcR variable region encoding sequences and the linkage generates a cognate pair of α chain variable region and β chain variable region encoding sequences or γ chain variable region and δ chain variable region encoding sequences. Such a cognate pair may comprise one or more constant region encoding sequences in addition to the variable regions. Further, such a cognate pair may be isolated from template derived from cells of the T-lymphocyte lineage enriched from a lymphocyte-containing cell fraction, such as whole blood, mononuclear cells or white blood cells.
  • Another aspect of the present invention is to utilize the multiplex RT-PCR with a population of genetically diverse cells as template source. The majority of heteromeric protein encoding sequences do not vary from cell to cell as is the case with variable region encoding sequences from binding proteins. Thus, when utilizing the present invention for the cloning of such non-variable heteromeric protein encoding sequences there is no need to perform an initial isolation of single cells.
  • In this embodiment of the present invention, a plurality of non-contiguous nucleotide sequences of interest are linked randomly by a method comprising performing multiplex RT-PCR amplification of nucleotide sequences of interest using a template derived from a population of genetically diverse cells and effecting linkage of the amplified nucleotide sequences of interest. Further, the method comprises an optional step of performing an additional amplification of the linked products. As with the single cell approach the linkage can either be performed utilizing a multiplex overlap-extension primer mix for the amplification or alternatively by ligation or recombination. Preferably the template derived from the population of cells is not strictly contained within the cells. The population of cells may for example be lysed.
  • Application of the process of random linkage on a population of cells expressing variant binding proteins allows for a simplified generation of combinatorial libraries of variable region encoding sequences. Preferably, the population of cells constitutes cells that express variable region binding proteins, such as B lymphocytes, T lymphocytes, hybridoma cells, plasma cells, plasmablasts, or a mixture of these cells.
  • The population of cells in the above mentioned embodiment can for example be permeabilized or lysed, without additional purification, or the template nucleic acids can be isolated from the cells by standard procedures. The single-step multiplex RT-PCR procedure is preferred. However, the two-step procedure may also be used in the embodiment.
  • An efficient way to increase the specificity, sensitivity, and yield of the multiplex RT-PCR-linkage process is by performing an additional molecular amplification of the linked nucleotide sequences obtained from the multiplex RT-PCR followed by linkage by ligation or recombination or linkage using the multiplex overlap-extension RT-PCR. This additional amplification is preferably performed with PCR amplification, utilizing a primer mix adapted for amplifying the linked nucleic acid sequences of interest. The primer mix utilized may be the outer primers of the multiplex primer mix or multiplex overlap-extension primer mix, meaning the primers which anneal to the outermost 5' end and 3' end of the sense strand of the linked variable region encoding sequences, thereby enabling the amplification of the entire linked product. The outer primers can also be described as the primers of the multiplex overlap-extension primer mixture that do not contain overlap extension tails. Alternatively, a nested or semi-nested primer set can be used for the additional amplification of the linked nucleotide sequences. Such a nested PCR especially serves to increase the specificity of the method as well as to increase the amount of linked product. For the present invention, semi-nested PCR (as described in the section entitled Primer Mixtures and Design) is considered to function as well as the nested PCR. Thus, it is desired although not necessary for the present invention to perform an additional PCR amplification of the linked products from the multiplex overlap-extension RT-PCR or of the products linked by ligation or recombination, preferably using nested PCR or semi-nested PCR.
  • The additional amplification can either be performed directly using a fraction or the entire multiplex overlap-extension RT-PCR reaction product or ligation product or recombination product, or a fraction of any one of these products or using partially purified linked products from any one of these reactions, e.g. by performing an agarose gel electrophoresis of the linked products, and excising the fragment corresponding to the expected size of the linked variable region encoding sequences. For products linked by multiplex overlap-extension RT-PCR, the additional amplification is preferably performed directly on a fraction from the multiplex overlap-extension RT-PCR reaction, since this would assist linkage of the individual target sequences that were not linked in the first reaction.
  • Sequences of interest
  • The nucleotide sequences of interest of the present invention can be selected from sequences that encode different subunits or domains, which when expressed, form a protein or part of a protein. Such proteins that are composed of at least two non-identical subunits are known as heteromeric proteins. Heteromeric proteins are common in all kinds of species. Some of the classes to which such proteins belong are for example enzymes, inhibitors, structural proteins, toxins, channel proteins, G-proteins, receptor proteins, immunoglobulin superfamily proteins, transportation proteins etc. The nucleotide sequences encoding such heteromeric proteins are non-contiguous, meaning for example that they originate from different genes, or different mRNA molecules. However, non-contiguous as used in the present invention may also mean nucleotide sequences encoding domains of the same protein, where the domains are separated by nucleotide sequences which are not of interest.
  • In one embodiment of the present invention the nucleotide sequences of interest contain variable region encoding sequences from the immunoglobulin superfamily, such as immunoglobulins (antibodies), B cell receptors and T cell receptors (TcRs). Especially variable region encoding sequences from immunoglobulins are of interest. Such variable region encoding sequences comprise full-length antibodies as well as Fab's, Fv's, scFv's and combinations of fragments of the variable region encoding sequences, e.g. complementarity determining regions (CDR's), joining genes or V-genes or combinations of these. Generally the present invention can be applied to any combinations of variable region encoding sequences and fragments thereof. The present application enables the linkage of only the variable domains of the heavy and light chains generating Fv or scFv encoding sequences, or the linkage of the entire light chain with the heavy chain variable region + constant region domain CH1 + parts of the hinge region, generating Fab, Fab' or F(ab)2. Further, it is possible to add any region of the heavy chain constant region domains to the variable heavy chain, thereby generating full-length antibody encoding sequences or truncated antibody encoding sequences. In one aspect of the invention the non-human variable sequences are linked to human constant regions to generate full chimeric human/non-human antibodies, preferably chimeric antibodies with human constant regions.
  • In a further embodiment of the present invention variable region encoding sequences comprise one type of immunoglobulin light chain (kappa or lambda) encoding sequence and one immunoglobulin heavy chain variable region encoding sequence. This is achieved by selecting primers amplifying only one isotype of light and heavy chain. The isotype can also be determined by linking or splicing human constant regions from one or more particular isotypes of heavy and light chain.
  • Variable region encoding sequences derived from T cell receptors (TcR's) are also of interest. Such TcR encoding sequences comprise encoding sequences for full-length alpha and beta chains or gamma and delta chains as well as soluble TcRs or only the variable domains of these chains or single chain fusion proteins thereof (e.g. single chain αβ or single chain γδ).
  • Template sources
  • One feature of the present invention is the ability to link nucleotide sequences derived from an isolated single cell, a population of isogenic cells, or a genetically diverse population of cells which have not been separated into single vessels.
  • A preferred feature of the present invention is the use of isolated single cells or a population of isogenic cells as template source, since scrambling of the nucleic acid sequences of interest, in particular variable region encoding sequences is avoided. This is of importance if one wishes to obtain an original pair of for example variable region encoding sequences.
  • Another preferred feature of the present invention is obtaining a single cell or population of single cells from a cell fraction comprising lymphocytes, such as B lymphocytes, T lymphocytes, plasma cells and/or various developmental stages of these cell lineages. Other populations of cells that express binding proteins from the immunoglobulin superfamily might also be used to obtain single cells. Cell lines such as hybridoma cells, cell lines of B lymphocyte or T lymphocyte lineage or virus immortalized cell lines or donor derived cells participating in the immune response are also applicable in the present invention. Donor derived lymphocyte-containing cell fractions may be obtained from natural tissue or fluid which is rich in such cells, e.g. blood, bone marrow, lymph nodes, spleen tissue, tonsil tissue or from infiltrations in and around tumors or inflammatory tissue infiltrations. Preferably, in the case of non-human animals, spleen tissue or bone marrow is used. Donors can either be naïve or hyperimmune with respect to a desired target. For the isolation of antigen binding proteins with binding specificities toward a desired target, hyperimmune donors are preferred. Such hyperimmune donors can either be donors immunized with the target, or fragments of the target, or can be convalescent patients, or non-healthy individuals with a natural immune response towards the target, e.g. autoimmune patients, cancer patients, patients with infectious diseases e.g. HIV patients, Hepatitis A, B or C patients, SARS patients etc., or patients with chronic diseases. However, in a particularly preferred embodiment, the donor is a non-human animal that has been immunised with a human self-antigen, such as a human protein implicated in cancer, such as EGFR.
  • For use in the present invention, cell donors may be of the same species as the species to be treated with the products obtainable from the linked nucleotide sequences of the present invention. Preferably, a cell donor is a domestic animal, a pet, or a human. A special feature of the present invention is that it enables the generation of chimeric human/non-human antibody libraries for generation of chimeric antibodies for use in human therapy. Such an approach is preferred when the antibodies are directed against so-called self-antigens, i.e. human antigens.
  • The donor may also be a transgenic animal, in particular a transgenic mouse. Transgenic animals carrying human immunoglobulin loci are described in U.S. Patent No. 6,111,166 and Kuroiwa, Y. et al. Nature Biotechnology; 2002; 20: 889-893. Such transgenic animals are capable of producing human immunoglobulins. Thus, fully human antibodies against a specific target can be raised by usual immunization techniques of such transgenic animals. This allows for generation of libraries encoding binding proteins with specificities towards more difficult targets such as human antigens to which no or limited natural human antibody response exists. Such transgenic animals can likewise be developed to produce human T cell receptors.
  • In a further embodiment of the present invention, the lymphocyte-containing cell fraction is constituted of whole blood, bone marrow, mononuclear cells, or white blood cells obtained from a donor. Mononuclear cells can be isolated from blood, bone marrow, lymph nodes, spleen, infiltrations around cancer cells and inflammatory infiltrations. Mononuclear cells can be isolated by density centrifugation techniques, e.g. Ficoll gradients. If the mononuclear cells are isolated from samples composed of tissue, the tissue is disintegrated before the gradient centrifugation is performed. Disintegration can be performed, for example, by mechanical methods such as grinding, electroporation and/or by chemical methods such as enzymatic treatments. The isolation of white blood cells can be performed directly from donors using leukopheresis. Raw preparations of for example bone marrow or tissue which contains lymphocytes can also be used in the present invention. Such preparations will need to be disintegrated, for example as described above, in order to facilitate single cell distribution.
  • A further feature of the present invention is enrichment of the lymphocyte-containing cell fraction e.g. whole blood, mononuclear cells, white blood cells or bone marrow, with respect to a particular lymphocyte population, such as cells from the B lymphocyte or T lymphocyte lineage. Enrichment of B lymphocytes can for example be performed using magnetic bead cell sorting (MACS) or fluorescence activated cell sorting (FACS) taking advantage of lineage-specific cell surface marker proteins such as CD19 or other B cell lineage-specific markers such as B220. Enrichment of T lymphocytes can for example be performed utilizing a cell surface marker such as CD3 or other T cell lineage-specific markers.
  • A preferred feature of the present invention is to sort the enriched B lymphocytes further in order to acquire plasma cells, before distributing the cells individually among a plurality of vessels. Isolation of plasma cells is generally performed by MACS sorting or FACS sorting, utilizing surface markers such as CD19. Other plasma cell-specific surface markers or combinations thereof can be utilized as well, for example CD138, CD43, CD19, MHC-II, the exact choice of marker depending on the plasma cell source, e.g. spleen, tonsils, blood or bone marrow. Of course, the exact choice of surface markers also depends on the species from which the cells are isolated.
  • In one aspect of the invention, the markers used for sorting and/or selection of cells are CD43 and CD138 or MHCII and B220 or orthologs. Preferably, the combination of markers is CD43 and CD138, and preferably the selected cells have an intermediate or high expression of these markers relatively to the lymphocyte-comprising cell population from which they are selected or isolated. More preferably the level of expression of CD43 and CD138 is high relative to the lymphocyte-comprising cell population from which they are selected or isolated.
  • Plasma cells can also be obtained from a non-enriched lymphocyte-containing cell population obtained from any of these sources. The plasma cells isolated from blood are sometimes called early plasma cells or plasmablasts. In the present invention these cells are also termed plasma cells. Plasma cells are desired for the isolation of cognate pairs of immunoglobulin encoding sequences because a higher frequency of these cells produces antigen-specific antibodies that reflect the acquired immunity toward the desired antigen and most of the cells have undergone somatic hypermutation and therefore encode high-affinity antibodies. Further, the mRNA levels in plasma cells are elevated compared to the remaining B lymphocyte population, thus the reverse transcription procedure is more efficient when using single plasma cells. As an alternative to plasma cell isolation, memory B cells may be isolated from a lymphocyte containing cell fraction utilizing a cell surface marker such as CD27 and IgG.
  • An alternative feature of the present invention is selecting the enriched B lymphocytes for antigen specificity before distributing the cells among a plurality of vessels. Isolation of antigen-specific B lymphocytes is performed by contacting the enriched B lymphocytes with the desired antigen or antigens enabling binding of antigen to surface exposed immunoglobulin, followed by isolation of binders. This can be done, for example, by biotinylation of the desired antigen or antigens followed by suitable cell sorting techniques. Plasma cells as well as B lymphocytes, non-enriched mononuclear cells, white blood cells, whole blood, bone marrow or tissue preparations can be subjected to isolation with respect to antigen specificity if this is desired.
  • Another feature of the present invention is to sort enriched T lymphocytes (e.g. CD3 positive cells) using surface markers such as e.g. CD27 to obtain a fraction of memory T cells. T lymphocytes can also be selected for MHC-antigen specificity using MHC-peptide complexes (e.g. Callan, M.F. et al. 1998. J. Exp. Med. 187, 1395-1402; Novak, E.J. et al. 1999. J. Clin. Invest 104, R63-R67).
  • As an alternative to sorting cells expressing certain surface markers, i.e. a positive selection, it is conceivable that cells NOT expressing the markers are depleted from the composition of cells, leaving the cells behind that actually express the markers.
  • A further feature of the present invention is immortalization of any of the isolated cell fractions described in the above (e.g. B lymphocytes, plasma cells, memory cells or T lymphocytes). Immortalization may for example be performed with Epstein-Barr virus (Traggiai, E., et al., 2004. ) prior to cell distribution. Alternatively, isolated single cells may be immortalized and expanded prior to reverse transcription. Traggiai et al., Nat Med. 2004 Aug;10(8):871-5.
  • A further feature of the present invention is the distribution of a population of desired cells (e.g. hybridoma cells, cell lines of B lymphocyte or T lymphocyte lineage, whole blood cells, bone marrow cells, mononuclear cells, white blood cells, B lymphocytes, plasma cells, antigen-specific B lymphocytes, memory B cells, T lymphocytes, peptide/MHC-specific T lymphocytes, or memory T cells) individually into a plurality of vessels, in order to obtain a population of isolated single cells. This isolation of single cells refers to the physical separation of cells from a population of cells in such a way that a single vessel contains a single cell, or a micro array, chip or gel matrix is loaded in a manner that produces single cells. The cells may be distributed directly into multitudes of vessels such as arrays of single vessels by limiting dilution. The single vessels utilized in the present invention are preferably those applicable in PCR (e.g. PCR tubes and 96 well or 384 well PCR plates or larger arrays of vessels). However, other vessels may also be used. When distributing single cells into a large number of single vessels (e.g. 384 well plates), a population of single cells is obtained. Such a distribution may be performed, for example, by dispensing a volume into a single vessel that on average encompasses a cell concentration of one, 0.5 or 0.3 cell, thereby obtaining vessels that on average contain a single cell or less. Since distribution of cells by limiting dilution is a statistical event, a fraction of the vessels will be empty, a major fraction will contain a single cell, and a minor fraction will contain two or more cells. Where two or more cells are present in a vessel some scrambling of the variable region encoding sequences may occur among the cells present in the vessel. However, since it is a minor event it will not affect the overall utility of the present invention. Additionally, combinations of variable region encoding sequences which do not posses the desired binding affinity and specificity will most likely not be selected and hence will be eliminated during a screening process. Therefore, minor events of scrambling will not significantly affect the final library of the present invention.
  • There are alternatives to cell distribution by limiting dilution using, for example, cell sorters such as FACS machines or robots that can be programmed to accurately dispense single cells into single vessels. These alternatives are preferable, since they are less laborious and are more efficient in uniformly obtaining a distribution of single cells into single vessels.
  • The enrichment, sorting and isolation procedures described in the above are performed such that the majority of the cells are kept intact. Rupture of cells during enrichment and sorting might result in scrambling of the variable region encoding sequences. However, this is not expected to be a problem since the frequency of rupture is expected to be low. Washing and possible RNAse treatment of the cells prior to distribution into single vessels will remove any RNA that has leaked during the process.
  • Further, when considering the above descriptions of how to distribute cells in order to obtain a population of single cells in a population of single vessels, it is not to be interpreted as an absolutely required feature that every vessel must contain a single cell. Rather, it indicates that a majority of the vessels contain single cells, e.g. the number of vessels with two or more cells is below 25% of the total amount of cells distributed, or even better it is below 10%.
  • A further feature of the present invention is the performance of a reverse transcription using template derived from cells distributed individually among a plurality of vessels.
  • For the purpose of reverse transcription (RT), in accordance with the present invention, the nucleic acids within a single cell that is to serve as template source for the RT are considered to be derived from a single cell although they have not necessarily been separated from the remaining contents of that single cell.
  • When the final distribution of the single cells to their single vessels has been performed, the single cells may be expanded in order to obtain a population of isogenic cells prior to reverse transcription. This process yields more mRNA to be used as template, which might be important if a rare target is to be amplified and linked. However, the cells should remain genetically identical with respect to the target gene during the expansion. The isolated cells or the population of isogenic cells can either be kept intact or lysed, as long as the template for the reverse transcription is not degraded. Preferentially, the cells are lysed in order to ease the following reverse transcription and PCR amplification.
  • In a different embodiment of the present invention, the disclosed multiplex overlap-extension RT-PCR method or multiplex RT-PCR followed by linkage by ligation or recombination may also be utilized on template derived from a genetically diverse population of cells which have not been separated into single vessels, but all remain together as a pool of cells. This method may be used for the generation of combinatorial libraries. Such an approach will not require the distribution of single cells. However, the cells which may be used in this approach are the same as those described for the single cell approach, for example a population (pool) of sorted B lymphocytes or T lymphocytes. When performing the single-step multiplex overlap-extension RT-PCR or single-step multiplex RT-PCR followed by linkage by ligation or recombination on such a population of cells, it is preferable to lyse the cells prior to the reaction and if desired total RNA or mRNA may be isolated from the lysate. The sensitivity of the single-step multiplex overlap-extension RT-PCR of the present invention enables the use of a very low amount of template, e.g. an amount of template corresponding to the lysate of a single cell.
  • Primer Mixtures and Design
  • The primer mixtures of the present invention comprise at least four primers that form primer sets two by two, which are capable of amplifying at least two different target sequences of interest. Mixtures of two or more of such primer sets constitute a multiplex primer mix. Preferably, a multiplex mix comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20, 30, 40, 50, 60, 70, 80, 90 100, 110, 120, 130, 140 or 150 primer sets (primer pairs). In particular for the amplification of variable region encoding sequences, an individual primer set within the multiplex primer mix may constitute several more than two primers. Preferably, an individual primer set comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280 or 300 primers. Preferably the total number of primers in a multiplex primer mix is at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 45, 50, 60, 70, 80, 90, 100, 125, 150 or 200 and at the most 225, 250, 275, 300, 325, 350, 375 or 400 primers.
  • All the primers of the present invention comprise a gene-specific region, and preferably all primers are additionally equipped with a primer tail at the 5' end of the primer, i.e. 5' noncoding sequences which are fused to the 3' end of the gene-specific primer part. Such a primer tail is approximately from 6 to 50 nucleotides long, but it may also be longer if desired. Upon amplification the primer tails are added to the target sequences.
  • Primer tails of the present invention are for example, cloning tails and linkage tails such as, tails adapted for linkage by ligation, tails adapted for linkage by recombination or overlap-extension tails.
  • Cloning tails may be from 6 to 20 nucleotides long or longer and comprise restriction sites and/or recombination sites, which are useful for the insertion of the linked product into an appropriate vector.
  • To enable linkage by ligation, the primer sets of the multiplex primer mix are designed such that one part (forward or reverse primer(s)) of the first primer set is equipped with a linkage-tail containing a restriction site that upon cleavage will be compatible with a restriction site located in the linkage tail of one part of the second primer set. For linkage of more than two target sequences, the second part of the second primer set is equipped with a restriction site that upon cleavage will be compatible with a restriction site located in one part of the third primer set. This second restriction site located in the second primer set should be non-compatible with that of the first primer set. A considerable number of target sequences can be linked by designing primer sets this way. Restriction sites with a low frequency or no occurrence in the target sequences should be chosen. Further, it is preferable that compatible restriction sites are not identical, such that the site of ligation becomes cleavage-resistant for the particular restriction enzymes used. This will drive the reaction towards linkage of target sequence one with target sequence two, since linkage between identical target sequences will be cleavable by the restriction enzymes. Suitable pairs of restriction sites are, for example, Spel with Xbal (alternatively Nhel or Avrll can substitute one or both of these), Ncol with BspHI, EcoRI with Mfel or Pstl with Nsil. For linkage, Spel can for example be located in target sequence one, Xbal can be located in target sequence two, Ncol can be located at the other end of target sequence two and BspHI in target sequence three and so forth. To simplify the process further, it is an advantage if the restriction enzymes function in the same buffer.
  • To enable linkage by recombination the primer sets of the multiplex primer mix can for example be designed as exemplified in the article by Chapal (1997 BioTechniques 23, 518-524).
  • To enable the linkage of the nucleotide sequences of interest in the same step as the multiplex PCR amplification, tails adapted for overlap-extension PCR are added to at least one primer of each primer set of the multiplex primer mix, thereby generating a multiplex overlap-extension primer mixture.
  • The overlap-extension tails are typically longer, ranging from 8 to 75 nucleotides in length, and may contain restriction sites or recombination sites which allow for subsequent insertion of regulatory elements such as promoters, ribosomal binding sites, termination sequences, or linker sequences such as in a scFv. The overlap-extension tail may also contain a stop codon if that is desired. Generally there are three types of overlap-extension tails, as illustrated in Figure 1 of WO 2005/042774 . In type I the overlap-extension tails of two primer sets solely overlap with each other. Not necessarily all of the nucleotides of two overlap-extension tails are complementary to each other. In one aspect of the present invention the complementary nucleotides represent between 60 to 85% of the overlap-extension tail. In type II overlap-extension tails, 4 to 6 of the 5' nucleotides are complementary to the gene-specific region of the adjacent target sequence. In type III overlap-extension tails, the entire overlap is complementary to the adjacent target sequence. The type I and II overlap-extension tails are preferred when regulatory elements and the like are later to be inserted between the linked target sequences. Type II overlap-extension tails are preferred if the target sequences are to be linked by a defined linker as seen with scFv. Type III overlap-extension tails are preferred if the target sequences are to be linked in-frame to each other. Design of overlap-extension tails is dependent on sequence features such as length, relative GC content (GC%), presence of restriction sites, palindromes, melting temperature, the gene-specific part to which they are coupled etc. The length of the overlap-extension tails should be between 8 and 75 nucleotides long, preferably they are from 15 to 40 nucleotides long. Even more preferred they are from 22 to 28 nucleotides long. The use of very long overlap-extension tails (50 to 75 nucleotides) could favor the linkage of the products produced by each primer set. However, the proportion between the length of the overlap-extension tail and the gene-specific region probably will need to be adjusted when using very long overlap-extension tails. The GC% preference is dependent on the length of the overlap-extension tail. Since shorter tails have a shorter area where they are complementary they need a higher GC% to strengthen the interaction than longer tails. Other principles of primer design should likewise be observed, e.g. primer dimerization and hairpin formation should be minimized. Neither shall they engage in false priming. Further, it is known that Taq DNA polymerase often adds an adenosine (A) at the 3' end of the newly synthesized DNA strand, and this can be accommodated for in overlap-extension tail design by enabling overlap-extension tails to accommodate 3' non-template A addition.
  • The choice of primers that carry the linkage tail, e.g. the overlap-extension tail, tail adapted for linkage by ligation or tail adapted for linkage by recombination, defines the order and direction of linkage of the target sequences. It is not essential to the invention whether it is the forward primer(s) or reverse primer(s) of a primer set or possibly both forward and reverse primers that are equipped with the linkage tail. However, some consideration should be given to this anyway since the order and direction of the target sequences in the final product might be of relevance e.g. for the insertion of regulatory elements such as promoters and termination sequences or for the in-frame linkage of the individual target sequences.
  • For the linkage of two nucleotide sequences of interest the linkage tail may be added either to the reverse primer(s) or forward primer(s) of each primer set used for the PCR amplification of each target sequence.
  • The present invention exemplifies addition of overlap-extension tails and tails adapted for linkage by ligation, to the mVH and mVK forward primers of each set. This results in a linking direction of the products that is 5' to 5' (head-to-head and bi-directional). However, linkage tails might as well be added to the reverse primer(s) of each set. This results in a linking direction of the product that is 3' to 3' (tail-to-tail and bi-directional). A third option is adding the linkage tails to the reverse primer(s) of the first primer set and the forward primer(s) of the second primer set or visa versa. This results in a 3' to 5' orientation (head-to-tail and unidirectional).
  • When linking more than two nucleotide sequences of interest some of the primer sets need to have linkage tails on both the forward and reverse primers, such that one tail is complementary to a tail of the preceding primer set and the other tail is complementary to one of the primers of the subsequent primer set. This principle holds for all the primer sets that amplify target sequences that are to be linked between two other target sequences.
  • The design of the gene-specific primer part generally should observe known primer design rules such as minimizing primer dimerization, hairpin formation and non-specific annealing. Further, multiple G or C nucleotides as the 3' bases are to be avoided when possible. The melting temperature (Tm) of the gene-specific regions in a primer set should preferably be equal to each other plus/minus 5°C. In the present invention Tm values between 45°C and 75°C are desirable and Tm values of about 60°C are optimal for most applications. Advantageously, the initial primer design can be aided by computer programs developed for this task. However, primer designs generally need laboratory testing and routine optimization. This may be done, for example, by analyzing size, restriction fragment length polymorphism (RFLP) and sequencing of the amplification products obtained using the primer sets. The use of degenerate positions within primers is a useful approach when amplifying sequences with variable regions or when searching for new family members belonging to a specified class of proteins. The numbers of degenerate positions may also require optimization.
  • One feature of the present invention is primer mixes composed of at least two primer sets that are able to prime amplification and promote linkage of at least two nucleotide sequences of interest. The primer mixes of the present invention are capable of priming the amplification of at least two subunits or domains from heteromeric proteins, e.g. belonging to the class of enzymes, inhibitors, structural proteins, toxins, channel proteins, G-proteins, receptor proteins, immunoglobulin superfamily proteins, transportation proteins etc, preferably immunoglobulins.
  • A further feature of the present invention is a multiplex overlap-extension primer mix comprising primer sets wherein at least one primer set member of each primer set comprises an overlap-extension tail capable of hybridizing to the overlap-extension tail of a primer set member of a second primer set.
  • The overlap-extension tails enables the immediate linkage of the nucleotides of interest during the multiplex overlap-extension PCR amplification by equipping each individual product arising from the primer sets with a tail that is complementary to an adjoining product. This however does not mean that the linkage necessarily occurs during this first PCR amplification. Depending on the reaction setup, the majority of the actual linkage may be performed during an additional amplification with the outer primers of the first PCR amplification (multiplex PCR amplification).
  • A further feature of the present invention is a primer set designed to amplify a family of nucleotide sequences containing variable region encoding sequences. Examples of such families are kappa light chains (e.g. VK1-19 in mice), lambda light chains (e.g. VL1-8 in mice) and variable heavy chains (e.g. VH1-15 in mice) from immunoglobulins, and α, β, γ or δ TcR variable regions. A primer set for the amplification of a family of nucleotide sequences containing variable region encoding sequences often comprises a plurality of primers where several primers can be degenerate primers. Amplification of families of immunoglobulin light chain variable region encoding sequences is for example performed using a primer set comprised of a plurality of primers complementary to the 5' end of the variable region of the kappa chain (mVK primer(s)) or the kappa leader sequence and/or the lambda chain or the lambda leader sequence (forward primers) together with constant region kappa (mKappar1 primer(s)) and/or lambda primers (reverse primers) or a plurality of such primers. Alternatively, light chain joining region primers may be used as reverse primers instead of the constant region primers. Alternatively, forward primers annealing in the UTR region preceding the leader sequence of the variable light chain may be used. Equally, families of immunoglobulin heavy chain variable region encoding sequences can be amplified with one primer set utilizing various primer combinations, for example a plurality of primers complementary to the 5' end of the heavy chain variable region (mVH primer(s)) or the leader sequence of this region (forward primers) together with a plurality of heavy chain joining region primers or heavy chain constant region primer(s) (reverse primers). The mCH primer may be isotype-specific and in principle any mCH primer can be utilized, also one that would result in a full-length heavy chain. Preferably a mCH primer is used that does not amplify the full length heavy chain to enable addition of a human heavy chain constant region. Alternatively, forward primers annealing in the UTR region preceding the leader sequence of the variable heavy chain may be used.
  • The use of forward primers annealing in the leader sequence instead of the 5' end of the variable region is particularly useful if cross-hybridization due to high degree of sequence similarity is observed for the variable region primers. Mutations due to cross-hybridization using leader-primers will be eliminated from the final protein because leader sequences are cleaved off during protein processing within the cell.
  • One feature of the present invention is primers which anneal in the 3' end of the leader encoding sequence preceding a variable region encoding sequence, and their use for amplification of variable region encoding sequences.
  • In one embodiment of the present invention, the multiplex overlap-extension primer mix utilized for the multiplex overlap-extension PCR and possibly for the reverse transcription step as well comprises:
    1. a) at least one mKappar1 or hmJK primer complementary to the sense strand of an immunoglobulin light chain region encoding sequence;
    2. b) at least one mVK primer complementary to the antisense strand of an immunoglobulin light chain variable region encoding sequence or light chain variable region leader sequence, and capable of forming a primer set with the primer(s) in step a);
    3. c) at least one mCHrev1, mHCrev1-ext, or mJH primer complementary to the sense strand of an immunoglobulin heavy chain domain encoding sequence; and
    4. d) at least one mVH primer complementary to the antisense strand of an immunoglobulin heavy chain variable region encoding sequence or heavy chain variable region leader sequence, and capable of forming a primer set with the primer(s) in step c).
  • In one embodiment of the present invention the light chain primers are adapted for amplifying both kappa and lambda light chain variable region encoding sequences.
  • In a further embodiment of the present invention, the immunoglobulin mVK primers carry linkage tails, preferably in the form of complementary overlap-extension tails. This generates variable region encoding sequences that are linked in a head-to-head fashion. For the linkage of variable region encoding sequences in a head-to-tail fashion, either the mKappar1 and mVH primers contain linkage tails or the mVK and mCHrev1 primers contain linkage tails, preferably in the form of complementary overlap-extension tails. For the linkage of variable region encoding sequences in a tail-to-tail fashion, mCH and mKappar1 primers contain linkage tails, preferably in the form of complementary overlap-extension tails.
  • Preferentially, the multiplex primer mixes, including multiplex overlap-extension primer mixes, of the present invention comprise two primer sets. Thus, a multiplex primer mix comprises at least four different primers. In a further aspect of the present invention a multiplex primer mix comprises more than four different primers. A multiplex primer mix of the present invention is used for the amplification of target sequences in a single vessel. For example kappa, lambda and heavy chain variable regions may be amplified in the same vessel.
  • The present invention also encompasses primers for an additional PCR amplification of the linked products obtained by multiplex RT-PCR followed by linkage by ligation or recombination or by multiplex overlap-extension RT-PCR. This additional PCR amplification can be performed using a primer mix adapted for amplifying the linked target sequences. Such a primer mix may comprise the outer primers of the multiplex primer mix or multiplex overlap-extension primer mix, meaning the primers that anneal to the outermost 5 end and 3' end of the sense strand of the linked nucleotide sequences, thereby selectively enabling the amplification of the entire linked product. This process generally serves to increase the amount of linked product obtained from the multiplex RT-PCR followed by linkage by ligation or recombination or from the multiplex overlap-extension RT-PCR.
  • Alternatively, a primer set which is nested compared to the outer primers used in the primary multiplex RT-PCR or multiplex overlap-extension RT-PCR reaction can be used for the additional amplification of the linked nucleotide sequences. In the present invention such a primer set is termed a nested primer set. The design of nested primers generally observes the same design rules as for the gene-specific primers previously described, except that they prime partly or totally 3' to the annealing position of the outer primers used in the multiplex RT-PCR or multiplex overlap-extension RT-PCR. The product resulting from a nested PCR may therefore be shorter than the linked product obtained by the multiplex RT-PCR followed by linkage by ligation or recombination or by multiplex overlap-extension RT-PCR. In addition to increasing the amount of linked product, the nested PCR further serves to increase the overall specificity, especially of the multiplex overlap-extension RT-PCR technology. However, it should be noted that not all multiplex primer mixes/multiplex overlap-extension primer mixes that have been described previously are suitable for combination with a nested primer set when performing the additional amplification. In such cases the outer primers of the multiplex primer mix/multiplex overlap-extension primer mix can be used for the additional amplification or a semi-nested PCR can be applied as described later.
  • In one embodiment of the present invention, a mixture of JL and JH primers is used as nested primers for the additional amplification of the linked immunoglobulin variable region encoding sequences.
  • Nested primer sets of the present invention can also be comprised of a reverse (or forward) outer primer(s) from the first multiplex primer mix/multiplex overlap-extension primer mix and a second nested primer that prime 3' to the annealing position of the forward (or reverse) outer primer(s) of the first multiplex primer mix/multiplex overlap-extension primer mix. The use of such a primer set for an additional PCR amplification is generally known as a semi-nested PCR. Semi-nested PCR can for example be applied if it is difficult to design a nested primer in one specific region e.g. for the variable region sequences, because such a primer would have to anneal in the complementarity determining regions (CDRs). Further, semi-nested PCR can be used when it is desirable to keep one end of the linked sequences intact e.g. for cloning purposes.
  • Optimization of Multiplex Overlap-extension PCR
  • The parameters of the multiplex overlap-extension PCR step of both the two-step and the single-step procedure can be optimized for several parameters (see, for example, Henegariu, O. et al. 1997. BioTechniques 23, 504-511; Markoulatos, P. et al. 2002. J. Clin. Lab. Anal. 16, 47-51). Generally the same optimization parameters apply for multiplex RT-PCR, although the ratio between outer and inner primers is less important for such a reaction.
  • a. Primer Concentration
  • The concentration of the primers carrying the overlap-extension tail (for example the VH and VL primers) is preferably lower than the concentration of the outer primers without overlap-extension tail (for example JH and kappa primers).
  • If one of the target sequences amplifies with a lower efficiency than the others, for example as a result of a higher GC%, it may be possible to equalize the amplification efficacy. This may be done by using a higher concentration of the primer set which mediates amplification with low efficiency, or lowering the concentration of the other primers. For example, sequences encoding heavy chain variable regions tend to have a higher GC% and hence lower amplification efficiency than light chain variable regions. This points towards using VL primers at a lower concentration than the VH primers.
  • Further, when using a large number of primers the total primer concentration might be an issue. The upper limit is determined experimentally by titration experiments. For the "AmpliTaq Gold' PCR system from Applied Biosystems the upper limit was found to be 1.1 µM total oligonucleotide concentration, for other systems it may however be as high as 2.4 µM. Such an upper limit of total oligonucleotide concentration influences the maximal concentration of individual primers. If the individual primer concentration is too low it is likely to cause a poor PCR sensitivity.
  • The quality of the oligonucleotide primers has also been found to be important for the multiplex overlap-extension PCR. HPLC-purified oligonucleotides have produced the best results.
  • b. PCR Cycling Conditions:
  • Preferentially the cycling conditions are as follows:
    Denaturation: 10-30 s 94°C
    Annealing:
    Tm
    30-60 s 50-70°C Approximately 5°C below of primers.
    Extension: 1 min x EPL 65-72°C EPL is Expected Product Length in kb.
    Cycle number: 30-80
    Final extension: 10 min 65-72°C
  • For the single-step multiplex overlap-extension RT-PCR the following steps were built into the cycling program prior to the amplification cycling outlined above:
    Reverse transcription: 30 min 42-60°C These conditions are also used where separate reverse transcription is performed.
    Polymerase activation: 10-15 min 95°C Hot-start polymerases are favorable in single-step RT-PCR. Activation according to manufacturer.
  • It is possible to optimize all these parameters. Especially the annealing temperature is important. Thus, initially all the individual primer sets that are to constitute the final primer mix should be tested separately in order to identify optimal annealing temperature and time, as well as elongation and denaturing times. This will give a good idea about the window within which these parameters can be optimized for the multiplex overlap-extension primer mix.
  • Problems with poor PCR sensitivity, for example due to low primer concentration or template concentration, can be overcome by using a high number of thermal cycles. A high number of thermal cycles constitutes between 35 and 80 cycles, preferably around 40 cycles.
  • Further, longer extension times can improve the multiplex overlap-extension PCR process. Long extension times constitute 1.5-5 min x EPL compared to the normal 1 min extension.
  • c. Use of Adjuvants
  • Multiplex PCR reactions can be significantly improved by using a PCR additive, such as DMSO, glycerol, formamide, or betaine, which relax DNA, thus making template denaturation easier.
  • d. dNTP and MgCl2
  • Deoxynucleoside triphosphate (dNTP) quality and concentration is important for the multiplex overlap-extension PCR. The best dNTP concentration is between 200 and 400 µM of each dNTP (dATP, dCTP, dGTP and dTTP), above which the amplification is rapidly inhibited. Lower dNTP concentrations (100 µM of each dNTP) suffice to achieve PCR amplification. dNTP stocks are sensitive to thawing/freezing cycles. After three to five such cycles, multiplex PCR often does not work well. To avoid such problems, small aliquots of dNTP can be made and kept frozen at -20°C.
  • Optimization of Mg2+ concentration is critical since most DNA polymerases are magnesium-dependent enzymes. In addition to the DNA polymerase, the template DNA primers and dNTP's bind Mg2+. Therefore, the optimal Mg2+ concentration will depend on the dNTP concentration, template DNA, and sample buffer composition. If primers and/or template DNA buffers contain chelators such as EDTA or EGTA, the apparent Mg2+ optimum may be altered. Excessive Mg2+ concentration stabilizes the DNA double strand and prevents complete denaturation of DNA, which reduces yield. Excessive Mg2+ can also stabilize spurious annealing of primer to incorrect template sites, thereby decreasing specificity. On the other hand, an inadequate Mg2+ concentration reduces the amount of product.
  • A good balance between dNTP and MgCl2 is approximately 200 to 400 µM dNTP (of each) to 1.5 to 3 mM MgCl2.
  • e. PCR Buffer Concentration
  • Generally KCI based buffers suffice for multiplex overlap-extension PCR; however, buffers based on other components such as (NH4)2SO4, MgSO4, Tris-HCl, or combinations thereof may also be optimized to function with the multiplex overlap-extension PCR. Primer pairs involved in the amplification of longer products work better at lower salt concentrations (e.g. 20 to 50 mM KCI), whereas primer pairs involved in the amplification of short products work better at higher salt concentrations (e.g. 80 to 100 mM KCl). Raising the buffer concentration to 2X instead of 1X may improve the efficiency of the multiplex reaction.
  • f. DNA Polymerase
  • The present invention is exemplified with Taq polymerase. Alternatively, other types of heat-resistant DNA polymerases including, for example, Pfu, Phusion, Pwo, Tgo, Tth, Vent, or Deep-vent may be used. Polymerases without or with 3' to 5' exonuclease activity may either be used alone or in combination with each other.
  • Vectors and Libraries
  • The linkage of nucleotide sequences of interest according to the present invention produces a nucleotide segment comprising the linked nucleotide sequences coding for variable regions of immunoglobulins. Further, libraries of such linked nucleic acid sequences are produced by the methods of the present invention, in particular libraries of non-human variable region encoding sequences linked or spliced to human constant region (heavy and light chain) sequences.
  • One feature of the present invention is the insertion of a segment containing linked nucleotide sequences of interest or a library of linked nucleotide sequences of interest, generated by a method of the present invention, into suitable vectors. The libraries may be combinatorial libraries or more preferably libraries of cognate pairs of variable region encoding sequences. The restriction sites generated by the outer primers, nested primers or semi-nested primers are preferably designed to match appropriate restriction sites of the vector of choice. The linked nucleic acid sequences of interest can also be inserted into vectors by recombination, if one of the semi-nested, nested primers or outer primers is equipped with a suitable recombination site and the vector of choice contains one as well.
  • Basically there are no limitations to the vectors that can be used as carriers of the products generated by one of the multiplex RT-PCR-linkage methods of the present invention. Vectors of choice may be those suitable for amplification and expression in cells including, for example, bacteria, yeast, other fungi, insect cells, plant cells, or mammalian cells. Such vectors may be used to facilitate further cloning steps, shuttling between vector systems, display of the product inserted into the vector, expression of the inserted product and/or integration into the genome of a host cell.
  • Cloning and shuttle vectors are preferably bacterial vectors. However, the other types of vectors may also be applied in cloning and shuttle procedures.
  • Display vectors can for example be phage vectors or phagemid vectors originating from the class of fd, M13, or f1 filamentous bacteriophages. Such vectors are capable of facilitating the display of a protein including, for example, a binding protein or a fragment thereof, on the surface of a filamentous bacteriophage. Display vectors suitable for display on ribosomes, DNA, yeast cells or mammalian cells are also known in the art. These comprise for example viral vectors or vectors encoding chimeric proteins.
  • Expression vectors exist for all the mentioned species and the one to be chosen completely depends on the protein to be expressed. Some expression vectors are additionally capable of integrating into the genome of a host cell either by random integration or by site-specific integration, utilizing appropriate recombination sites. Expression vectors may be designed to provide additional encoding sequences that, when the linked product is inserted in-frame to these sequences, enable the expression of a larger protein, e.g. a full-length monoclonal antibody, when introduced into an appropriate host cell. This in-frame insertion may also facilitate the expression of chimeric proteins that facilitate the display on the surface of a filamentous bacteriophage or cell. In a bacteriophage display system, the linked nucleotide sequences of interest may be inserted in-frame to a sequence encoding a coat protein such as pIII or pvIII (Barbas, C.F. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 7978-7982; Kang, A.S. et al. 1991. Proc. Natl. Acad. Sci. USA 88, 4363-4366).
  • In one embodiment of the present invention, the individual segments of linked nucleotide sequences of interest is comprised of an immunoglobulin heavy chain variable region encoding sequence associated with a light chain variable region encoding sequence from one species, inserted into a vector that contains sequence(s) encoding one or more human immunoglobulin constant domains, preferably both human light and heavy chain constant regions. The insertion is engineered such that the linked heavy chain variable region and/or light chain variable region encoding sequences are inserted in-frame to the constant region encoding sequences. Such an insertion can for example generate a Fab or F(ab')2 expression vector, a full-length antibody expression vector or an expression vector encoding a fragment of a full-length antibody. Preferentially such a vector is an expression vector suitable for expression (e.g. E. coli, phagemid, or mammalian vectors) and the constant region heavy chain encoding sequences are chosen from the human immunoglobulin classes IgGI, IgG2, IgG3, IgG4, IgM, IgA1, IgA2, IgD, or IgE, thereby enabling the expression of a Fab or full-length recombinant antibody. In addition to the constant heavy chain encoding sequences the vector may also contain a constant light chain encoding sequence chosen from human lambda or kappa chains. This is preferred in the generation of chimeric antibodies as the linked nucleotide sequences in these cases only encode the immunoglobulin variable region encoding sequences (Fv's) from the non-human species.
  • In an alternative embodiment, the human constant region(s) is/are spliced or linked to the non-human variable regions in a step of the molecular amplification procedure, by adding to the vessels a human constant region providing an overlap with the non-human sequence and appropriate primers ensuring the amplification of both variable and constant region(s) in frame. In this way the human constant kappa or lambda chain may be added and/or a human constant heavy chain may be added. By using this procedure there is no need for providing a restriction site within the coding sequence, which is an advantage In another embodiment of the present invention, the individual segments of the linked nucleotide sequences are comprised of a TcR α chain variable region encoding sequence associated with a β chain variable region encoding sequence or a γ chain variable region encoding sequence associated with a δ chain variable region encoding sequence. Preferably, these linked sequences are inserted into a vector that contains sequences encoding one or more TcR constant domains. The insertion is engineered such that the inserted linked variable region encoding sequences are in-frame to the corresponding TcR constant region encoding sequences. In a further embodiment, such a vector is a chimeric expression vector comprising sequences that encode a leucine zipper in-frame to the TcR constant regions. It has been shown that such constructs increase stability of soluble TcRs (Willcox, B.E. et al. 1999. Protein Sci 8, 2418-2423).
  • Libraries of cognate pairs of the present invention may be introduced into vectors by two different approaches. In the first approach, the single cognate pairs are inserted individually into a suitable vector. This library of vectors may then either be kept separate or be pooled. In the second approach, all the cognate pairs are pooled prior to vector insertion, followed by in-mass insertion into suitable vectors generating a pooled library of vectors. Such a library of vectors comprises a large diversity of pairs of variable region encoding sequences.
  • One aspect of the present invention is a library of antibodies with cognate pairs of linked variable region encoding sequences. Preferably the individual antibodies of the library comprise an immunoglobulin light chain variable region encoding sequence associated with a heavy chain variable region encoding sequence from one species and human constant regions.
  • Another preferred library of cognate pairs comprises linked TcR region encoding sequences, where each individual TcR region encoding sequence comprises an alpha chain variable region encoding sequence associated with a beta chain variable region encoding sequence and/or a TcR gamma chain variable region encoding sequence associated with a delta chain variable region encoding sequence.
  • An embodiment of the present invention is a sub-library of cognate pairs of linked variable region encoding sequences which encode desired binding specificities directed against a particular target. Preferably these cognate pairs comprise linked immunoglobulin light chain variable region and heavy chain variable region encoding sequences, TcR alpha chain variable region and beta chain variable region encoding sequences and/or TcR gamma chain variable region and delta chain variable region encoding sequences.
  • A further embodiment is a sub-library selected from a parent library of cognate pairs of variable region encoding sequences as described throughout the invention.
  • A preferred embodiment of the present invention is a library or sub-library encoding cognate pairs of full-length chimeric immunoglobulins selected from human immunoglobulin classes IgA1, IgA2, IgD, IgE, IgG1, IgG2, IgG3, IgG4, or IgM.
  • Another preferred feature of the present invention is a library or sub-library encoding soluble and stable cognate pairs of TcRs.
  • A feature of the present invention is the diversity of said libraries, which are comprised of at least 5, 10, 20, 50, 100, 1000, 104, 105 or 106 different cognate pair antibodies.
  • In a further embodiment of the present invention, said libraries of cognate pairs of linked variable region encoding sequences are obtainable by a method comprising the steps described herein. This library is also termed the parent library.
  • Screening and Selection
  • The parent library of pairs of linked variable region encoding sequences isolated from a donor, utilizing one of the methods of the present invention, is expected to represent a diversity of binding proteins of which some will be irrelevant, i.e. not binding to a desired target, in particular for combinatorial libraries. Therefore, the present invention encompasses enrichment and screening for a sub-library encoding a subset of diversities of binding specificities directed against a particular target.
  • For libraries of cognate pairs the diversity of the library is expected to represent the diversity present in the donor material, with only a minor number of randomly linked variable regions. Thus, an enrichment step may not be necessary prior to the screening for target-specific binding affinities in a library composed of cognate pairs.
  • In a further embodiment of the present invention, the method of generating a library of pairs of linked variable region encoding sequences further comprises creating a sub-library by selecting a subset of pairs of linked variable region sequences that encode binding proteins with a desired target specificity. Such a selection of linked variable region encoding sequences is also termed a library of target-specific cognate pairs.
  • In a preferred embodiment of the present invention, the library of target-specific cognate pairs of variable region encoding sequences is transferred to an expression vector. The expression vector may be a mammalian expression vector, a yeast expression vector, a fungus expression vector, a plant expression vector, or a bacterial expression vector depending on the type of cell used for screening. Preferably the expression vector is mammalian.
  • Immunological assays are generally suitable for the selection of target-specific immunoglobulin variable region encoding sequences. Such assays are well know in the art and constitute for example FMAT, FLISA, ELISPOT, ELISA, membrane assays (e.g. Western blots), arrays on filters, or FACS. The assays can either be performed in a direct manner, utilizing the polypeptides produced from the immunoglobulin variable region encoding sequences, or alternatively, the immunoassays can be performed in combination with or following enrichment methods such as phage display, ribosome display, bacterial surface display, yeast display, eukaryotic virus display, RNA display or covalent display (reviewed in FitzGerald, K., 2000. Drug Discov. Today 5, 253-258). Both cognate Fab expression libraries and cognate full-length antibody expression libraries can be subjected to screening, thereby generating a sub-library of positive clones. Such screening assays and enrichment procedures are also suitable for Fv or scFv fragments or combinatorial libraries of linked variable regions.
  • In addition to immunological screening, a special feature of the invention is that it enables the use of various types of functional screening to select antibody secreting clones with desired properties. Such screening assays include but are not limited to proliferation assays, virus inactivation assays, cell killing assays, etc. Preferably the functional assays can be carried out in high-throughput format using supernatants from cells transfected with expression vectors of the invention.
  • In a preferred embodiment of the present invention, the selection of a sub-library of target-specific cognate pairs or combinatorial pairs of variable region encoding sequences is performed by using a high-throughput screening assay. High-throughput screening assays could be, but are not restricted to, ELISA assays performed with semi-automated or fully automated equipment. It could also be a membrane assay in which bacteria are robotically picked and gridded onto an appropriate membrane on top of agar plates generating arrays of colonies expressing antigen-binding molecules. The molecules are secreted through the membrane onto a second underlying antigen-coated membrane which can be developed separately and used to identify clones that secrete antigen binding molecules towards the desired target (de Wildt, R.M., et al. 2000. Nat. Biotechnol. 18, 989-994).
  • When a sub-library of cognate pairs or combinatorial pairs of antigen-binding clones has been selected by an appropriate technology it is possible to perform an additional analysis by DNA sequencing of the linked immunoglobulin light chain variable region and heavy chain variable region encoding sequences. First of all such a DNA sequencing will provide information about the library diversity such as germline origin, family distribution and maturation within the CDR regions. Such an analysis will enable the selection of clones which represent a broad diversity, and leaving out repeated clones. Secondly, DNA sequencing will reveal mutations introduced during the isolation process.
  • When analyzing variable region encoding sequences there are three types of mutations to consider when assessing whether a mutation is acceptable: i) The most frequent type of mutations result from cross-priming, where a V gene primer due to sequence similarities primes a germline sequence to which it is not totally homologous. The changes introduced are mainly substitutions of naturally occurring codons at one particular position. Due to the high degree of sequence homology between V-gene sequences. Some of these changes may be significant, sometimes with no natural counterpart. Such changes could potentially affect the immunogenicity of the variable region by creating new epitopes. Such changes can easily be identified and subsequently repaired using standard molecular biological techniques or the clones can be excluded from the library; ii) Errors created by the Taq DNA polymerase are most easily identified in the constant region encoding sequences and can easily be eliminated. However, Taq induced mutations will of course also be present in the variable region encoding sequences where they are indistinguishable from the naturally occurring somatic mutations, which are also the result of random mutations in the variable region encoding sequences. Considering that the mutations are non-systematic and only affect particular pairs in distinct ways, it appears reasonable to disregard such changes.
  • In a further embodiment of the present invention, the sub-library of target-specific and possibly sequence analysed pairs of linked immunoglobulin light chain variable region and heavy chain variable region encoding sequences are transferred to a mammalian expression vector. Such a transfer can be performed into any of the vectors described in the previous section, enabling the expression of a full-length recombinant antibody. If the screening is performed with a mammalian cognate full-length antibody expression library such a transfer may not be needed.
  • In another embodiment of the present invention, the parent library is generated from a lymphocyte-containing cell fraction which is enriched for T lymphocytes. The pairs of linked variable region encoding sequences constituting the parent library may be selected for encoding a subset of pairs of linked variable region sequences composed of alpha and beta and/or gamma and delta chains that encode binding proteins with a desired target specificity, generating a sub-library of cognate pairs or combinatorial pairs. Antigen-specific T cell receptors can subsequently be identified from a pool of transfected cells using standard methodology such as staining with tetrameric MHC-peptide complexes (e.g., Callan, M.F. et al. 1998. J. Exp. Med. 187, 1395-1402; Novak, E.J. et al. 1999. J. Clin. Invest 104, R63-R67), by measuring cellular responses in the form of IL-2 release or by more sophisticated means such as yeast or retroviral display techniques.
  • Host cells and Expression
  • The libraries of the present invention can be transferred to vectors suitable for expression and production of proteins encoded from the linked nucleic acid sequences of interest, in particular variable region containing binding proteins or fragments thereof. Such vectors are described in the Vectors and Libraries section, and provide for the expression of for example full-length antibodies, Fab fragments, Fv fragments, scFv, membrane bound or soluble TcRs or TcR fragments of a species of choice.
  • One feature of the present invention is the introduction into a host cell of a library or a sub-library of vectors of cognate pairs of linked variable region encoding sequences or a single clone encoding a cognate pair of linked variable region encoding sequences, for amplification and/or expression. Host cells can be chosen from bacteria, yeast, other fungi, insect cells, plant cells, or mammalian cells. For expression purposes mammalian cells, such as Chinese hamster ovary (CHO) cells, COS cells, BHK cells, myeloma cells (e.g., Sp2/0 cells, NS0), NIH 3T3, fibroblast or immortalized human cells such as HeLa cells, HEK 293 cells, or PER.C6 are preferred.
  • The introduction of vectors into host cells may be accomplished by a number of transformation or transfection methods known to those skilled in the art, including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like. The production of monoclonal full-length antibodies, Fab fragments, Fv fragments and scFv fragments is well known.
  • The production of recombinant polyclonal antibodies to be used for treatment is a quite new area. A recombinant polyclonal manufacturing technology has been described in PCT application WO 2004/061104 . In brief, this technology involves the generation of a collection of cells, suitable as a manufacturing cell line. The following description of the technique is made for a library of cognate pairs, it is however just as applicable for a combinatorial library. The individual cells in the collection of cells are capable of expressing a distinct member of the recombinant polyclonal binding protein, for example from a library of cognate pairs. In order to ensure that the individual cells express a single cognate pair and not several cognate pairs of the polyclonal binding protein, the nucleic acid sequences encoding the cognate pairs are introduced into a single site-specific site in the genome of each individual cell. This is an important feature of the collection of cells, since this prevents scrambling of the heavy and light chains expressed from each cell, but also because it generates cells that are virtually identical to one another, except for the small differences in the variable regions of the individual cognate pairs. This trait will enable an unbiased growth of the collection of cells over the period of time necessary for the production. To ensure single site-specific integration, a host cell line with only one integration site should be used, these are commercially available, e.g. Invitrogen's CHO Flp-In cells containing a single FRT site. Appropriate vectors for this cell line contain a corresponding FRT site and are introduced into the genome using the Flp recombinase. There are several other known recombinases e.g. Cre, beta-recombinase, Gin, Pin, PinB, PinD, R/RS, lambda integrase, or phage ΦC31 integrase that can be used in combination with their corresponding recombination sites. Further, appropriate vectors contain a selection marker that enables the selection of site-specific integrants.
  • The generation of a polyclonal manufacturing cell line and the production of a recombinant polyclonal protein from such a cell line can be obtained by several different transfection and manufacturing strategies.
  • One way is to use a library of vectors mixed together into a single composition, for the transfection of a host cell line with a single integration site per cell. This method is termed bulk transfection or transfection in bulk. Generally, the vector and host cell design previously described will ensure that a polyclonal cell line capable of unbiased growth will be obtained upon appropriate selection. A frozen stock of the polyclonal cell line will be generated before initiation of the recombinant polyclonal protein manufacturing.
  • Another way is to use a library of vectors split into fractions, containing approximately 5 to 50 individual vectors of the library in a composition, for transfection. Preferably, a fraction of the library constitutes 10 to 20 individual vectors. Each composition is then transfected into an aliquot of host cells. This method is termed semi-bulk transfection. The number of aliquots transfected will depend on the size of the library and the number of individual vectors in each fraction. If the library for example constitutes 100 distinct cognate pairs, which are split into fractions containing 20 distinct members in a composition, 5 aliquots of host cells would need to be transfected with a library composition constituting a distinct fraction of the original library. The aliquots of host cells are selected for site-specific integration. Preferably, the distinct aliquots are selected separately. However, they can also be pooled before selection. The aliquots can be analyzed for their clonal diversity and only those with sufficient diversity will be used to generate a polyclonal cognate pair library stock. To obtain the desired polyclonal cell line for manufacturing, the aliquots can be mixed before generating the freezing stock, immediately after they have been retrieved from the stock or after a short proliferation and adaptation time. Optionally, the aliquots of cells are kept separate throughout production, and the polyclonal protein composition is assembled by combining the products of each aliquot rather than the aliquots of cells before production.
  • A third way is a high throughput method in which host cells are transfected separately using the individual vectors constituting the library of cognate pairs. This method is termed individual transfection. The individually transfected host cells are preferably selected for site specific integration separately. The individual cell clones generated upon selection may be analyzed with respect to proliferation time and preferably, those with similar growth rates are used to generate a polyclonal cognate pair library stock. The individual cell clones can be mixed to obtain the desired polyclonal cell line before generating the stock, immediately after they have been retrieved from the stock, or after a short proliferation and adaptation time. This approach may eliminate any possible residual sequence bias during transfection, integration and selection. Alternatively the individually transfected host cells are mixed before selection is performed, this will enable control of sequence bias due to transfection.
  • A shared feature in the manufacturing strategies outlined in the above is that all the individual cognate pairs constituting the recombinant polyclonal protein can be produced in one, or a limited number of bioreactors. The only difference is the stage at which one chooses to generate the collection of cells that constitutes the polyclonal manufacturing cell line.
  • One embodiment of the present invention is a population of host cells comprising a cognate library or sub-library of linked pairs of variable region encoding sequences.
  • In a further embodiment, a population of host cells comprises a library obtained from a population of isolated single cells constituting lymphocytes, utilizing the multiplex RT-PCR amplification followed by linkage by ligation or recombination or the multiplex overlap-extension RT-PCR technology of the present invention, to link the cognate pairs.
  • Another embodiment of the present invention is a population of host cells comprising a combinatorial library or sub-library of linked pairs of variable region encoding sequences.
  • A population of host cells according to the present invention will encompass a diverse population of cells corresponding to the diversity of the library the cells have been transformed/transfected with. Preferably, each cell of the population of cells only constitutes one cognate pair of the entire library of cognate pairs, and no individual member of the library of cognate pairs exceeds more than 50%, more preferred 25%, or most preferred 10%, of the total number of individual members expressed from the population of host cells.
  • In a preferred embodiment of the present invention, the population of host cells is mammalian cells.
  • A population of host cells as described in the above can be utilized for the expression of a recombinant polyclonal binding protein, since individual cells of the population constitute variable region encoding sequences of different diversity.
  • One embodiment of the present invention is a recombinant polyclonal protein expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of linked variable region encoding sequences, where such a library is obtainable by the method of the present invention. Typically, a recombinant polyclonal protein of the present invention is comprised of at least 2, 5, 10, 20, 50, 100, 1000, 104, 105 or 106 proteins composed of different cognate pairs.
  • A preferred embodiment of the present invention is a recombinant polyclonal immunoglobulin expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of heavy chain variable region and light chain variable region encoding sequences.
  • Another preferred embodiment of the present invention is a recombinant polyclonal TcR expressed from a population of host cells comprising a library of vectors encoding diverse cognate pairs of TcR alpha chain variable region linked with beta chain variable region encoding sequences and/or TcR gamma chain variable region linked with delta chain variable region encoding sequences.
  • Another embodiment of the present invention is a host cell suitable for production of a monoclonal protein. In particular a monoclonal antibody comprised of a cognate pair of a light chain variable region with a heavy chain variable region or a monoclonal TcR comprised of a cognate pair of an alpha variable region with a beta variable region or a delta variable region with a gamma variable region. Preferably such a monoclonal production cell line is not a hybridoma cell line.
  • Such a monoclonal antibody or TcR can be generated by adding the following steps to the method of linking a plurality of non-contiguous nucleotide sequences of interest a) inserting said linked nucleic acid sequences into a vector; b) introducing said vector into a host cell; c) cultivating said host cells under conditions suitable for expression; and d) obtaining the protein product expressed from the vector inserted into said host cell. Preferably, the vector introduced into the host cell encodes an individual cognate pair of variable region encoding sequences.
  • Applications of the Invention
  • One of the major applications of the present invention is the linkage of cognate pairs of variable region encoding sequences, especially immunoglobulin heavy and light chain variable region encoding sequences or TcR alpha and beta chain or gamma and delta chain variable region encoding sequences, by a high-throughput method for the generation of libraries of cognate pairs. In addition to the generation of cognate pair libraries, the multiplex RT-PCR followed by linkage by ligation or recombination or the multiplex overlap-extension RT-PCR techniques of the present invention may be utilized in the generation of combinatorial libraries of chimeric human/non-human antibodies by performing the technique on a population of genetically diverse cells, cell lysates from such a population of cells, or on RNA purified from such a population of cells. The libraries, sub-libraries, or single clones from one of these libraries facilitate the expression of polyclonal or monoclonal proteins. Especially monoclonal or polyclonal antibodies may be obtained from the libraries of the present invention.
  • The use of recombinant monoclonal antibodies in diagnostics, treatment, and prophylaxis is well known. Recombinant monoclonal and polyclonal antibodies generated by the present invention will have the same applications as antibody products generated by existing technologies. In particular, a pharmaceutical composition comprising a polyclonal recombinant immunoglobulin as active ingredient, combined with at least one pharmaceutically acceptable excipient, can be produced by means of the present invention. More preferred are pharmaceutical compositions where the polyclonal recombinant immunoglobulin is comprised of cognate pairs of variable region encoding sequences. Such pharmaceutical compositions of polyclonal recombinant immunoglobulins can be used as medicaments. The polyclonal recombinant immunoglobulin of the composition can be specific for or reactive against a predetermined disease target, and the composition can thus be used for the treatment, amelioration or prevention of diseases such as cancer, infections, inflammatory diseases, allergy, asthma and other respiratory diseases, autoimmune diseases, immunological malfunctions, cardiovascular diseases, diseases in the central nervous system, metabolic and endocrine diseases, transplant rejection, or undesired pregnancy, in a mammal such as a human, a domestic animal, or a pet.
  • Examples Example 1:
  • Balb/c mice were immunised subcutaneously with 50 µg Tetanus toxoid (TT) in complete Freund's adjuvant. Mice were boosted on day 14 with 50 µg TT in Freund's incomplete adjuvant. After additional 30 days, the mice were boosted with 50 µg TT in Freund's incomplete adjuvant. Three days after the last boost, the mice were sacrificed and the spleen taken out and transferred to a tube containing 30 ml of RPMI 1640 w/10% FCS at 4 °C. The tissue was transferred to a 74 µm cell strainer (Corning,136350-3479) in a 10 cm dish. With the back of a syringe plunger, the spleens were macerated through the filter. The filter was rinsed 10 ml RPMI 1640, 10% FCS solution. The filter was removed and the dish filled with 20 ml cold RPMI 1640, 10% FCS. Cells were transferred to 50 ml tube and centrifuged at 300 x g at 2-8 °C for 5 minutes. Cells were re-suspended in 5-10 ml 4 °C RPMI 1640 w/1% FCS and filtered through a 50 µm syringe filter (Becton Dickinson, 340603). Cells were pelleted and re-suspended in FCS; 10% DMSO to obtain a cell density at 2x107 cells/ampoule and frozen.
  • Frozen vials with splenocytes in single cell suspension from Balb/c mice immunized with Tetanus toxoid were thawed at 37 °C and transferred to 15 ml tube with ice still present. 10 ml ice-cold RPMI, 10 % FCS was added drop-by-drop to the tube while swirling. After one wash in 10 ml FACS PBS, 5 ml PBS, 2 % FCS was added before filtering the cells through 50 µm Filcon (Becton Dickinson cat no 340603). Cells were pelleted and resuspended in 1 ml PBS, 2 % FCS (final volume) and subsequently stained with either anti CD43 FITC diluted 1:100 (BD cat no 553270) and anti CD138 PE diluted 1:40 (BD cat no 553714) in 1 ml PBS, 2 % FCS or with anti B220 APC (BD cat no 553092) diluted 1:40 and anti MHCII FITC (BD cat no 553547) diluted 1:200. Cells were incubated at 4 °C for 20 min in the dark. Finally, cells were washed 2 times with 2 ml PBS, 2 % FCS and added up to 15 ml PBS, 2 % FCS (foetal calf serum). Just before sorting, PI was added at 1:100, and cells were sorted in an event count of app. 1000-2000 cells/sec. Gatings for both stainings are depicted in Figure 3.
  • Fig 3A: PI positive (dead) cells were excluded in the bottom left panel (P1). Then plasma cells were gated as CD43 high, CD138 high in the bottom right panel (P2). Finally, doublets were excluded in the SSC-H, SSC-W plot, top right panel (P3). Cells positive for all three gates were sorted into ELISPOT plates according to table 1.
  • ELISPOT:
  • Nitrocellulose-bottomed 96 well plates (HA plates, Millipore, Bedford, MA) were pre-wet with PBS (also the blocking wells) before being coated with 100 µl 25µg/ml tetanus toxoid (TT) or sheep anti-murine IgG (Jackson Immuno Research, cat no 515-005-062) in PBS. The same volume of PBS was present in the control wells. The plate was left at 4 °C. The day after, wells were washed three times in PBS before being blocked with 200 µl RPMI + 2 % skim milk powder and left at 4 °C. The plate was moved to the incubator (37 °C, 5 % CO2, 100% humidity) 1h before addition of cells. 100 µl cells in complete RPMI were added to TT-, anti-IgG coated wells and blocked-only-wells. Medium without cells was included as control. The plate was moved to an incubator (37 °C, 5 % CO2, 100% humidity). The day after, the plate was washed six times to remove the cells, 3x in buffer: PBS+0.01% Tween20 and 3x in PBS. Subsequently, the wells were added HRP-conjugated goat anti-mouse IgG (Caltag M30007) diluted 1:3000 in RPMI + 2% Skim milk powder (100µl/well). After 2 hrs incubation 37 °C, wells were washed three times in PBS+0.01% Tween 20 followed by three times with only PBS. Spots were then developed with 100 µl chromogenic substrate consisting of 0.015% H2O2 and 0.3 mg/ml of 3-amino-9-ethylcarbazole in 0.1M sodium acetate, pH 5.1. The color development was stopped by washing with tap water after 5 min.
  • Judging the results in table 1, Plasma cells (PC; CD43 high, CD138 high, Gate P3 in Figure 3A) specific for TT are present at approximately 2 %, whereas TT specific plasma blasts (PB; B220 and MHCII positive, Gate P4 in Figure 3B) are present at approximately 4‰ This illustrates the superiority of the PC to PB in production of specific antibodies after TT immunization. Table 1
    Number of spots (number of sorted cells)
    Coat: PBS PC 0 (1,001) ND ND
    PB 0 (13,976) 0 (1500) 0 (150)
    Coat: TT PC 22 (1,002) 2 (250) 0 (50)
    PB 54 (13,257) 0 (1500) 0 (150)
    Coat: α-IgG PC 55 (1,000) 6 (250) 1 (50)
    PB 48 (5000) 8 (1500) 0 (150)
  • Example 2
  • A frozen vial of splenocytes was stained as described in example 1.
  • Four different phenotypes were four-way sorted. The sorting gates are depicted in figure 4. Firstly, PI positive or dead cells were excluded in the bottom left panel (P1). P2 is CD138 intermediate, CD43 high. P3 is CD138 high, CD43 high. P4 is CD138 high, Cd43 neg. P5 is CD138 intermediate, CD43 low. 10,000 cells positive for P1 and each of the four gates were sorted into test tubes and frozen for evaluation by mouse Symplex.
  • Fractions P2, P3, P4 and P5 were bulk sorted into tubes each containing 10,000 cells. The tubes were centrifuged and resuspended in Dulbecco's modified Eagle's medium containing 2 U/µl RNase inhibitor (RNasin, Promoga cat. no. N2511) to a concentration of 250 cells/µl, 10 µl per tube and frozen at -80°C.
  • A mouse Symplex amplification was performed on a dilution series of each of the 4 sets of sorted lymphocytes to compare the content of IgG-kappa antibody mRNA. The basic principles of the reactions are:
    • First an RT reaction is performed where heavy and light chain are primed by specific constant region primers
    • Secondly a multiplex reaction is performed using VH and VK 5' region primers covering all variable regions and equipped with complementary overhangs facilitating the formation of overlap bands. 3' primers are located in the constant region of heavy and light chain
    • Last a nested reaction is performed amplifying only conjugated VH and VK using JH and JK primers
    • The final reaction product consists of VH and VK conjugated 5' end to 5' end and connected with a linker. The size should be approximately 700 bp
  • For the combined multiplex RT-PCR reaction the set of primers shown in Table 2 was used, employing the Qiagen OneStep RT-PCR kit essentially according to the manufacturer's instructions. Frozen cells were thawed on ice, resuspended and centrifuged. In each dilution series cell lysate corresponding to 100, 32, 10, 3.2, 1, 0.32, 0.1 and 0 cells was used. Total reaction volumes were 20 µl. Cycling conditions were:
    • 55°C, 30 min.
    • 95°C, 15 min.
      Figure imgb0001
    • 72°C, 5 min.
    • 72°C, 10 min.
  • The nested reaction was performed with the set of primers shown in Table 3 using FastStart polymerase (Roche) and supplied reagents essentially according to the manufacturer's instructions. 1 µl of the RT-PCR reaction product was used per nested reaction in a total volume of 20 µl. Reaction conditions were:
    Figure imgb0002
    • 72°C, 90 sec.
    • 72°C, 10 min.
  • 10 µl of each final reaction product was finally analyzed on a 1% agarose gel.
  • From the result of the Symplex procedure on the titrated cell lysates (Figure 5), it is clear that we can link heavy and light chain variable regions with cells from P3 down to approximately 0.1 cell, and from P2 from approx. 3.2 cells. The other gates were less efficient with linkage only being possible from approximately 32 cells and more. In conclusion the CD43 high CD138 high (P3) is most useful for the Symplex procedure at single cell level, while the CD43high CD138 intermediate could be used but is less efficient. Table 2: Mouse Symplex primer set used for the combined RT and multiplex reaction
    Primer Conc (nM) Sequence Seq.no.
    mHCrev1 0.2 GACSGATGGGCCCTTGGTGG 1
    mKappar 0.2 GCTGTAGGTGCTGTCTTTGC 2
    mVH set
    mVH A 0.04 TATTCCCATGGCGCGCCSAGGTCCARCTGCARCAGYC 3
    mVH B 0.04 TATTCCCATGGCGCGCCGARGTGMAGCTKGTKGAGTC 4
    mVH C 0.04 TATTCCCATGGCGCGCCSAGGTGCAGCTKMAGGAGTC 5
    mVH 8 0.04 TATTCCCATGGCGCGCCCAGGTTACTCTGAAAGAGTC 6
    mVH 9 0.04 TATTCCCATGGCGCGCCCAGATCCAGTTGGTGCAGTC 7
    mVK set
    mVK D 0.04 GGCGCGCCATGGGAATAGCTAGCCGAYATCCAGATGA 8
    mVK E 0.04 GGCGCGCCATGGGAATAGCTAGCCRACATTGTGMTGA 9
    mVK F 0.04 GGCGCGCCATGGGAATAGCTAGCCSAMATTGTKCTSA 10
    mVK 1-2 0.04 GGCGCGCCATGGGAATAGCTAGCCGATRTTGTGATGA 11
    W=A/T, R=A/G, S=G/C, Y=C/T, K=G/T, M=A/C, H=ACT, B=GCT; Conc. - final concentration.
    Table 3: Primers for mouse Symplex nested PCR
    Primer Conc. nM Sequence Seq. no
    mJH set
    mJH1 200 GGAGGCGCTCGAGACGGTGACCGTGGTCCC 12
    mJH2 200 GGAGGCGCTCGAGACTGTGAGAGTGGTGCC 13
    mJH3 200 GGAGGCGCTCGAGACAGTGACCAGAGTCCC 14
    mJH4 200 GGAGGCGCTCGAGACGGTGACTGAGGTTCC 15
    mJK set
    mJK1 200 GATGGTGCAGCCACAGTTCGTTTGATTTCCAGCTTGGTG 16
    mJK2 200 GATGGTGCAGCCACAGTTCGTTTTATTTCCAGCTTGGTC 30
    mJK4 200 GATGGTGCAGCCACAGTTCGTTTTATTTCCAACTTTGTC 31
    mJK5 200 GATGGTGCAGCCACAGTTCGTTTCAGCTCCAGCTTGGTC 17
  • EXAMPLE 3. Cloning of anti-EGFR antibodies Immunizations
  • Female BALB/c, strain A, or C57B16 mice (8-10 weeks old) were used for immunizations by injections with different purified proteins in addition to EGFR overexpressing cells.
  • Commercially available EGFR proteins (R&D systems cat#1095-ER or Sigma # E3641) were used for some of the immunizations. For other of the immunizations recombinant human EGFR and EGFRvIII produced as fusion proteins were used consisting of the ECD of EGFR or EGFRvIII and human growth hormone (hGH), also including a Tobacco Etch Virus (TEV)-cleavage site in addition to a His-tag. In some cases the ECD of EGFR was isolated by TEV-protease cleavage and subsequent purification on a Nickel column.
  • The human head-and-neck cancer cell line, HN5 (Easty DM, Easty GC, Carter RL, Monaghan P, Butler LJ. Br J Cancer. 1981 Jun;43(6):772-85. Ten human carcinoma cell lines derived from squamous carcinomas of the head and neck.) expressing approximately 107 receptors/cell was used for cell based immunizations. Cells were cultured in DMEM medium supplemented with 10% FBS (Fetal Bovine Serum), 3mM Glycerol, 5mM Sodium Pyruvate and 1% Penicillin Streptomycin. Before each immunization the cells were washed in PBS, trypsinized with TrypLE and resuspended in growth medium. Subsequently the cell suspensions was washed twice in PBS by centrifugation at 250Xg for 5 min, dislodging and resuspension in 15 ml sterile PBS.
  • Cells or antigen were diluted in PBS and then mixed 1:1 with Freund's Adjuvant. Adjuvant is used to enhance and modulate the immune response. For the first immunizations Complete Freund's Adjuvant (CFA) was used whereas Incomplete Freund's Adjuvant (IFA) was used for the subsequent immunizations. IFA is an oil-in-water emulsion composed of mineral oils and CFA is IFA to which heat-killed, dried Mycobacterium species are added. Both adjuvants have a depot effect. CFA gives rise to long-term persistence of the immune response and is used for the first immunizations to boost the immune response and IFA is used for subsequent immunizations. The emulsions were tested by adding a drop on the surface of a glass with water. If the drop remains as one drop, the emulsion is stable and the injections can be performed. Only stable emulsions were administered to mice.
  • Depending on the schedule (see Table 4), 25-100 µg antigen or 107 cells were used for each injection. In total, mice received 4 injections. All mice were injected with either 300 µl or 200 µl emulsion. Depending on the schedule, injections were performed subcutaneously (s.c.), intraperitoneally (i.p.) or intravenously (i.v.).
  • At termination, the mice were sacrificed by cervical dislocation, and the spleens were removed and transferred to a 74 µm cell strainer (Corning#136350-3479). The cells were macerated through the filter, resuspended in cold RPMI 1640 with 10% FBS and centrifuged at 300Xg for 5 minutes. The cell pellet was resuspended in RPMI 1640 with 1% FBS, filtered through a 50 µm syringe filter (BD# 340603) and collected by centrifugation. The cell pellet was cryopreserved after resuspension in FCS with 10% DMSO and frozen cells stored at -80°C until FACS sorting.
  • FACS sorting of murine plasma cells
  • Vials with frozen splenocytes were thawed at 37°C and transferred to 15 ml tube with ice still present. 10 ml Ice-cold RPMI, 10 % FBS (foetal bovine serum) was added drop-by-drop to the tube while swirling. After one wash in 10 ml FACS PBS, 5 ml FCS PBS is added before filtering the cells through 50 µm Filcon. Cells were then pelleted and resuspended in 1 ml PBS with 2% FBS (final volume) and stained with anti-CD43-FITC and anti-CD138-PE according to the specific dilution to a final concentration of app. 5 µg/ml. Cells were incubated at 4°C for 20 min in the dark. Subsequently, cells were washed 2 times with 2 ml FACS buffer. Up to 15 ml FACS PBS was added. Propidium Iodide (PI) was added at 1:100, and cells were subsequently sorted into 96 well PCR-plates, containing PCR reaction buffer (see below), and spun down for 2 min 400Xg before the plates were frozen at -80°C. Plasma cells were gated as CD43-positive/CD-138 positive.
  • Linkage of cognate VH and VL pairs
  • The linkage of VH and VL coding sequences was performed on the single cells gated as plasma cells, facilitating cognate pairing of the VH and VL coding sequences. The procedure utilized a two step PCR procedure based on a one-step multiplex overlap-extension RT-PCR followed by a nested PCR. The primer mixes used in the present example only amplify Kappa light chains. Primers capable of amplifying Lambda light chains could, however, be added to the multiplex primer mix and nested PCR primer mix if desired. If Lambda primers are added, the sorting procedure should be adapted such that Lambda positive cells are not excluded. The principle for linkage of cognate VH and VL sequences is illustrated in Figure 1 .
  • The 96-well PCR plates produced were thawed and the sorted cells served as template for the multiplex overlap-extension RT-PCR. The sorting buffer added to each well before the single-cell sorting contained reaction buffer (OneStep RT-PCR Buffer; Qiagen), primers for RT-PCR (see Table 2 above) and RNase inhibitor (RNasin, Promega). This was supplemented with OneStep RT-PCR Enzyme Mix (25x dilution; Qiagen) and dNTP mix (200 µM each) to obtain the given final concentration in a 20-µl reaction volume. The plates were incubated for 30 min at 55°C to allow for reverse transcription of the RNA from each cell. Following the RT, the plates were subjected to the following PCR cycle: 10 min at 94°C, 35x(40 sec at 94°C, 40 sec at 60°C, 5 min at 72°C), 10 min at 72°C.
  • The PCR reactions were performed in an H20BIT Thermal cycler with a Peel Seal Basket for 24 96-well plates (ABgene) to facilitate a high-throughput. The PCR plates were stored at -20°C after cycling.
  • For the nested PCR step, 96-well PCR plates were prepared with the following mixture in each well (20-µl reactions) to obtain the given final concentration: 1× FastStart buffer (Roche), dNTP mix (200 µM each), nested primer mix (see Table 5), Phusion DNA Polymerase (0.08 U; Finnzymes) and FastStart High Fidelity Enzyme Blend (0.8 U; Roche). As template for the nested PCR, 1 µl was transferred from the multiplex overlap-extension PCR reactions. The nested PCR plates were subjected to the following thermocyling: 35x(30 sec at 95°C, 30 sec at 60°C, 90 sec at 72°C), 10 min at 72°C.
  • Randomly selected reactions were analyzed on a 1% agarose gel to verify the presence of an overlap-extension fragment of approximately 890 basepairs (bp).
  • The plates were stored at -20°C until further processing of the PCR fragments.
  • The repertoires of linked VH and VL coding pairs from the nested PCR were pooled, without mixing pairs from different donors, and were purified by preparative 1% agarose gel electrophoresis. The human kappa constant light chain encoding sequence was spliced by overlap extension to the VL coding region of the pooled PCR products of linked VH and VL coding pairs ( Figure 2 ). The human kappa constant light chain encoding sequence was amplified from a plasmid containing the coding sequence of a human antibody with a kappa light chain in a reaction containing: Phusion Enzyme (2 U; Finnzymes), 1x Phusion buffer, dNTP mix (200 µM each), hKCforw-v2 primer and Kappa3' primer (Table 6), and plasmid template pLL138 (10 ng/µl) in a total volume of 50 µl. The reaction was subjected to the following thermocycling: 25x(30 sec at 95°C, 30 sec at 55°C, 45 sec at 72°C), 10 min at 72°C. The resulting PCR fragment was purified by preparative 1% agarose gel electrophoresis.
  • The purified pooled PCR fragments of each repertoire were spliced to the amplified and purified PCR fragment of the human kappa constant encoding region (Appendix 1) by the following splicing by overlap extension PCR (50 µl total volume) containing: human kappa constant encoding region fragment (1.4 ng/µl), purified pooled PCR fragment (1.4 ng/µl), Phusion DNA Polymerase (0.5 U; Finnzymes) and FastStart High Fidelity Enzyme Blend (0.2 U; Roche), 1x FastStart buffer (Roche), dNTP mix (200 µM each), mhKCrev primer and mJH set primers (see Table 6). The reaction was subjected to the following thermocycling: 2 min at 95°C, 25x(30 sec at 95°C, 30 sec at 55°C, 1 min at 72°C), 10 min at 72°C. The resulting PCR fragment (approx. 1070 bp) was purified by preparative 1% agarose gel electrophoresis.
  • Insertion of cognate VH and VL coding pairs into a screening vector
  • In order to identify antibodies with binding specificity to EGFR, the VH and VL coding sequences obtained were expressed as full-length antibodies. This involved insertion of the repertoire of VH and VL coding pairs into an expression vector and transfection into a host cell.
  • A two-step cloning procedure was employed for generation of a repertoire of expression vectors containing the linked VH and VL coding pairs. Statistically, if the repertoire of expression vectors contains ten times as many recombinant plasmids as the number of cognate paired VH and VL PCR products used for generation of the screening repertoire, there is 99% likelihood that all unique gene pairs are represented. Thus, if 400 overlap-extension V-gene fragments were obtained, a repertoire of at least 4000 clones was generated for screening.
  • Briefly, the purified PCR product of the repertoires of linked VH and VL coding pairs, spliced to the human kappa constant coding region, was cleaved with Xhol and Notl DNA endonucleases at the recognition sites introduced into the termini of PCR products. The cleaved and purified fragments were ligated into an Xhol/Notl digested mammalian IgG expression vector, OO-VP-002 ( Figure 6) by standard ligation procedures. The ligation mix was electroporated into E. coli and added to 2xYT plates containing the appropriate antibiotic and incubated at 37°C over night. The amplified repertoire of vectors was purified from cells recovered from the plates using standard DNA purification methods (Qiagen). The plasmids were prepared for insertion of promoter-leader fragments by cleavage using Ascl and Nhel endonucleases. The restriction sites for these enzymes were located between the VH and VL coding gene pairs. Following purification of the vector, an Ascl-Nhel digested bi-directional mammalian promoter-leader fragment was inserted into the Ascl and Nhel restriction sites by standard ligation procedures. The ligated vector was amplified in E. coli and the plasmid was purified using standard methods. The generated repertoire of screening vectors was transformed into E. coli by conventional procedures. Colonies obtained were consolidated into 384-well master plates and stored. The number of arrayed colonies exceeded the number of input PCR products by at least 3-fold, thus giving 95% percent likelihood for the presence of all unique V-gene pairs obtained.
  • DNA plasmid was prepared from selected clones above and FreeStyle CHO-S cells (Invitrogen) were transfected in 2-ml scale for expression of antibodies (according to the manufacturer's instructions). The supernatants were harvested 96 hours after transfection.
  • Screening for binding to EGFR extracellular domain
  • In general, the screening was made as a two step procedure. The antibody libraries were screened for reactivity to recombinant EGFR protein in ELISA after which FMAT (FLISA) was used as a cell based approach, with the NR6wtEGFR cell line (Batra et al, 1995, Cell Growth Differ, 6(10):1251-9), for detection of EGFR-antibodies binding to cell-surface expressed EGFR. For the 101 and 108/109 libraries (Table 4) the ELISA was performed with recombinant EGFR representing the extracellular domain of the EGFR.
  • Briefly for the ELISA, Nunc maxisorb plates (cat no 464718) were coated with 1 µg/ml protein (in house produced), diluted in PBS at 4C over night. Prior to blocking in 50ul 2%-Milk-PBS-T the plates were washed once with PBS + 0.05 % Tween 20 (PBS-T). The plates were washed once with PBS-T, 20 µl of 2%- milk-PBS-T and 5 µl supernatants from the FreeStyle CHO-S transfectants (see above) were added and incubated for 1½ hours R.T after which the plates were washed once with PBS-T 20 µl per well. Secondary antibody (HRP-Goat-anti-human IgG, Jackson, cat no 109-035-097) diluted 1:10000 in 2% milk-PBS-T was added to detect the antibodies bound to the wells and incubated for 1 hour at Room Temperature. The plates were washed once in PBS-T before addition of 25 µl substrate (Kem-en-tec Diagnostics, cat no 4390) that was incubated for 5 min. 25 µl 1 M sulfuric acid was added after the incubation to stop the reaction. Specific signal was detected on an ELISA reader at 450 nm.
  • For the cell based FMAT detection of anti-EGFR antibodies, SKBR-3 (ATCC #HTB-30) or NR6wtEGFR cells were kept in growth medium. The cells were counted and diluted to 125,000 cells/ml with the Alexa-647 conjugated goat-anti-human IgG (H-L) antibody (Molecular probes No. A21445, lot no. 34686A) diluted 1:40,000. A total of 20 µl of this suspension was transferred to 384 well clear bottom Nunc plates. Subsequently 10 µl transfection supernatant was added to the cells. The FMAT signal from the reaction was measured after 6-10 hours of incubation.
  • The data from the screening indicates that 221 (4.8%) of the total clones were positive in the ELISA. 93 (2.0%) of those clones were also positive in FMAT. In total 220 (4.8%) of the clones were positive in the FMAT and among those 127 (220-93) uniquely positive for the cell surface antigen. The 111 library was screened in a similar fashion, but since the immunization procedure was made to generate antibodies specific for the deletion mutant EGFR receptor EGFRvIII, the ELISA screenings included assays to detect both wild-type EGFR and EGFRvIII. Seven clones were identified to be specific for the EGFRvIII in the ELISA and interestingly those clones were negative for staining of wtEGFR expressing cells in the FMAT. 13 clones were identified to be positive for the wtEGFR in FMAT and ELISA but not for the EGFRvIII, which were unique for this library compared to the 101 and 108/109 libraries. All the ELISA positive clones were selected for further analysis.
  • Sequence analysis and clone selection
  • The clones identified as EGFR-specific in ELISA were retrieved from the original master plates (384-well format) and consolidated into new plates. DNA was isolated from the clones and submitted for DNA sequencing of the V-genes. The sequences were aligned and all the unique clones were selected. Multiple alignments of obtained sequences revealed the uniqueness of each particular clone and allowed for identification of unique antibodies. Following sequence analysis of 220 clones, 70 genetically distinct antibody sequence clusters were identified. Each cluster of related sequences have probably been derived through somatic hypermutations of a common precursor clone. Overall, one to two clones from each cluster were chosen for validation of sequence and specificity.
  • Sequence and specificity validation
  • In order to validate the antibody encoding clones, DNA plasmid was prepared and transfection of FreeStyle CHO-S cells (Invitrogen) in 2-ml scale was performed for expression. The supernatant were harvested 96 hours after transfection. Expression levels were estimated with standard anti-IgG ELISA, and the specificity was determined by EGFR- and EGFRvIII-specific ELISA. 85% of the clones were shown to have the correct specificity and sequence.
  • Screening for anti-proliferative effects
  • Cellular damage will inevitably result in loss of the ability of the cell to maintain and provide energy for metabolic cell function and growth. Metabolic activity assays are based on this premise. Usually they measure mitochondrial activity. The Cell Proliferation Reagent WST-1 (Roche Cat. No. 11 644 807 001) is a ready-to-use substrate which measures the metabolic activity of viable cells. It is then assumed that the metabolic activity correlates with the number of viable cells. In this example the WST-1 assay was used to measure the number of metabolically active cells after treatment with cell culture supernatants containing different anti-EGFR antibodies.
  • Prior to performing the WST-1 assay different volumes of 2-ml supernatants (0, 10, 25, 50 and 150 µl) were transferred to appropriate wells in a 96 well plate.
  • HN5 cells were then washed with 1xPBS and detached by trypsination with 3 ml trypsin solution. 17 ml of complete media was then added and the cells were spun down at 300xg (1200 rcf) for 5 min. The supernatant was removed and cells were re-suspended in DMEM + 0.5% FBS. Cells were counted and their concentration was adjusted and 1500 cells were added to the wells with supernatants so that each well contained 200 µl media in total. The plates were incubated for 4 days in a humidified incubator at 37°C. Then 20 µl WST-1 reagent was added pr. well and the plates were incubated for one hour at 37°C. Plates were then transferred to a orbital plate shaker and left another hour. The absorbance was measured at 450 and 620 nm (reference wavelength) on an ELISA reader. The difference in the levels of metabolically active cells (MAC) was calculated as percent of the control supernatants as follows: % MAC = 1 - OD exp . - OD media ODuntreat . - ODmedia × 100
    Figure imgb0003
  • These values were then used as the basis for a supervised hierarchical cluster analysis (clustered based on reactivity in ELISA) performed using the free software Cluster and TreeView.
  • It is preferable to be able to screen for functional antibodies at an early stage in the antibody selection process. The culture supernatants from 83 2-ml transfections were used to screen for growth inhibitory functions in a proliferation assay performed using HN5 cells in 0.5% FBS. Results were visualized by simple hierarchical cluster analysis. As can be seen in the cluster analysis (Figure 7) a number of supernatants were found to decrease the number of metabolically active HN5 cells (dark grey) in a concentration dependent manner (Cluster 2). Similarly, some supernatants increased the number of metabolically active HN5 cells (light grey) in a concentration dependent manner ( Clusters 1, 3 and 4). An interesting observation was that supernatants which decreased the number of metabolically active HN5 cells had reactivity 2 (black arrows) whereas supernatants which increased the number of metabolically active HN5 cells had reactivity 1 (grey arrows). Supernatants with reactivity 2 were positive in both wtEGFR and EGFRvIII ELISAs, while supernatants with reactivity 1 only had reactivity towards wtEGFR. Thus, such analyses may provide relationships between antibody reactivity in ELISA and functionality in cellular assays.
  • Clone repair
  • When using a multiplex PCR approach, a certain degree of intra- and inter-V-gene family cross-priming is expected due to primer degeneracy and the high degree of homology. The cross-priming introduces amino acids that are not naturally occurring in the immunoglobulin framework with several potential consequences, e.g. structural changes and increased immunogenicity, all resulting in a decreased therapeutic activity.
  • In order to eliminate these drawbacks and to ensure that selected clones mirror the natural humoral immune response, such cross-priming mutations were corrected in a process called clone repair.
  • In the first step of the clone repair procedure, the VH sequence was PCR amplified with a primer set containing the sequence corresponding to the VH-gene the clone of interest originated from, thereby correcting any mutations introduced by cross-priming. The PCR fragment was digested with Xhol and Ascl and ligated back into the Xhol/Ascl digested mammalian expression vector (Figure 6) using conventional ligation procedures. The ligated vector was amplified in E. coli and the plasmid was purified by standard methods. The VH sequence was sequenced to verify the correction and the vector was digested with Nhel/Notl to prepare it for insertion of the light chain.
  • In the second step the complete light chain was PCR amplified with a primer set containing the sequence corresponding to the VL-gene the clone of interest originated from, thereby correcting any mutations introduced by cross-priming. The PCR fragment was digested with Nhel/Notl and ligated into the VH containing vector prepared above. The ligation product was amplified in E. coli and the plasmid was purified by standard methods. Subsequently, the light chain was sequenced to verify the correction.
  • In the case where the Kappa constant region of a selected clone contains mutations, introduced during the amplification of the genes, it is replaced by an unmutated constant region. This is done in an overlap PCR where the repaired VL-gene (amplified without the constant region) is fused to a constant region with correct sequence (obtained in a separate PCR). The whole sequence is amplified and cloned into the VH containing vector as described above and the repaired light chain is sequenced to verify the correction. Table 4 Immunization schedules used to generate starting material for anti-EGFR cloning
    Schedule, Mouse group Strain Injection 1 Injection 2 Injection 3 Injection 4 Termination
    101 Balb/c Day 1 25 µg rhEGFR (R&D systems 1095-ER) CFA s.c. Day 35 25 µg rhGH-EGFR (Symphogen ) IFA s.c Day 56 25 µg rhEGFR* (Symphogen) IFA s.c Day 70 25 µg rhEGFR* (Symphogen) IFA s.c Day 73
    108 Balb/c Day 1 1x107 HN5 cells CFA i.p. Day 28 25 µg rhEGFR* (Symphogen ) IFA s.c. Day 42 1x107 HN5 cells IFA i.p. Day 56 25 µg rhEGFR*, (Symphogen) IFA s.c. Day 59
    109 Balb/c Day 1 1x107 HN5 cells CFA i.p. Day 28 25 µg rhEGFR* (Symphogen ) IFA s.c. Day 42 1x107 HN5 cells IFA i.p. Day 56 25 µg rhEGFR* (Symphogen) PBS i.v. Day 59
    111 Balb/c Day 1 25 µg rhEGFR* (Symphog en) CFA s.c. Day 28 25 µg rhEGFR+ rhEGFRvIII** (Symphogen ) IFA s.c. Day 42 25 µg rhEGFR+ rhEGFRvIII** (Symphogen) IFA s.c. Day 56 25 µg rhEGFR+ rhEGFRvIII** (Symphogen) IFA s.c. Day 59
    118 Balb/c Day 1 1x107 HN5 cells CFA i.p. Day 29 100 µg rhGH-EGFR (Symphogen ) IFA s.c. Day 44 1x107 HN5 cells IFA i.p. Day 58 25 µg rhEGFR, (Sigma E3641) IFA s.c. Day 61
    119 C57B Day 1 1x107 HN5 cells CFA i.p. Day 29 100 µg rhGH-EGFR (Symphogen ) IFA s.c. Day 44 1x107 HN5 cells IFA i.p. Day 58 25 µg rhEGFR, (Sigma E3641) IFA s.c. Day 61
    Table 5 Nested primer set
    Primer name Conc. (nM) Sequence Seq. no
    mHCrev1- 0.2 GGACAGGGMTCCAKAGTTCCADKT 18
    hmJK set
    hmJK1-v2 0.2 GACAGATGGTGCAGCCACAGTTCGTTTGATTTCCAG 19
    hmJK2-v2 0.2 GACAGATGGTGCAGCCACAGTTCGTTTTATTTCCAGC 20
    hmJK4-v2 0.2 GACAGATGGTGCAGCCACAGTTCGTTTTATTTCCAAC 21
    hmJK5-v2 0.2 GACAGATGGTGCAGCCACAGTTCGTTTCAGCTCCAG 22
    K=G/T, M=A/CD=AGT, Conc. - final concentration.
    Table 6 Kappa constant splicing primer set
    Primer Con c. Sequence Seq. no.
    Human kappa constant amplification
    hKCforw- 0.2 GAACTGTGGCTGCACCATCTGTC 23
    Kappa3' 0.2 ACCGCCTCCACCGGCGGCCGCTTATTAACACTCTCCCCTGTTG 24
    Splicing by overlap extension
    mhKCrev 0.2
    Figure imgb0004
    25
    mJH set
    mJH1 0.2 GGAGGCGCTCGAGACGGTGACCGTGGTCCC 12
    mJH2 0.2 GGAGGCGCTCGAGACTGTGAGAGTGGTGCC 13
    mJH3 0.2 GGAGGCGCTCGAGACAGTGACCAGAGTCCC 14
    mJH4 0.2 GGAGGCGCTCGAGACGGTGACTGAGGTTCC 15
  • Appendix 1, Antibody constant region sequences
  • >Human IGKC region (Seq. no. 26)
    Figure imgb0005
  • >Human IGKC region (Seq. no. 27)
    Figure imgb0006
  • Exon1 1..298
    Intron 299..689
    Exon2 690..734
    Intron 735..852
    Exon3 853..1182
    Intron 1183..1279
    Exon4 1280..1602
  • >human IGHG1 constant domain genomic sequence (Seq. no. 28)
    Figure imgb0007
  • >IGHG1 (Seq. no. 29)
    Figure imgb0008

Claims (16)

  1. A method of producing a library of cognate pairs comprising linked variable region encoding sequences, said method comprising:
    a) providing a lymphocyte-comprising cell fraction from a rodent donor;
    b) obtaining a population of isolated single cells by distributing cells from said cell fraction individually into a plurality of vessels, wherein at least a subpopulation of the cells are enriched in and express CD43 and CD138 antigen or MHCII and B220 antigen; and
    c) amplifying and effecting linkage of the variable region encoding sequences contained in said population of isolated single cells by amplifying, in a multiplex molecular amplification procedure, nucleotide sequences of interest using a template derived from an isolated single cell or a population of isogenic cells; and effecting linkage of the amplified nucleotide sequences of interest.
  2. The method according to claim 1, wherein the lymphocyte-containing cell fraction comprises splenocytes, whole blood, bone marrow, mononuclear cells, or white blood cells, preferably splenocytes or bone marrow, more preferably splenocytes; and optionally wherein the lymphocyte-containing cell fraction or B lymphocyte lineage is enriched for plasma cells or plasmablasts.
  3. The method according to claim 1 or 2, wherein the nucleotide sequences of interest comprise immunoglobulin variable region encoding sequences and the linkage generates a cognate pair of a light chain variable region encoding sequence associated with a heavy chain variable region encoding sequence.
  4. A method of randomly linking a plurality non-contiguous nucleotide sequences of interest, wherein said nucleotide sequences of interest comprise variable region encoding sequences and the linkage generates a combinatorial library of pairs of variable region encoding sequences, preferably wherein said nucleotide sequences of interest comprise immunoglobulin variable region encoding sequences and the linkage generates a combinatorial library of pairs of light chain variable region and heavy chain variable region encoding sequences, said method comprising:
    a) amplifying, in a multiplex molecular amplification procedure, nucleotide sequences of interest using a template derived from a population of genetically diverse cells;
    wherein the genetically diverse cells are derived from a lymphocyte-comprising cell fraction from a rodent donor; and wherein at least a subpopulation of the cells are enriched in and express CD43 and CD138 antigen or MHCII and B220 antigen; and b) effecting linkage of the nucleotide sequences of interest amplified in step a).
  5. The method of any of the preceding claims, further comprising assessing prior to multiplex molecular amplification that the population of lymphocyte-comprising cells comprises cells that express CD43 and CD138 antigen or MHCII and B220 antigen, preferably CD43 and CD138.
  6. The method of any of the preceding claims, further comprising enriching said lymphocyte-comprising cell fraction for a lymphocyte population expressing CD43 and CD138 antigen or MHCII and B220 prior to multiplex molecular amplification.
  7. The method of any of the preceding claims, further comprising isolating from said lymphocyte-comprising population cells expressing CD43 and CD138 antigen or MHCII and B220 antigen prior to multiplex molecular amplification.
  8. The method of any of the preceding claims, wherein the enrichment or isolation comprises MACS, FACS or another automated sorting procedure.
  9. The method of any of the preceding claims, wherein the donor is a mouse.
  10. The method of claim 9, wherein the mouse is a transgenic mouse.
  11. The method according to any of the preceding claims, wherein said multiplex molecular amplification procedure is a multiplex RT-PCR amplification; and wherein:
    said multiplex RT-PCR amplification is a two step process comprising a separate reverse transcription step prior to the multiplex PCR amplification, or
    wherein said multiplex RT-PCR amplification is performed in a single step comprising initially adding all the components necessary to perform both reverse transcription and multiplex PCR amplification into a single vessel.
  12. The method according to any of the preceding claims, wherein said linkage of the nucleotide sequences of interest is effected in association with the multiplex PCR amplification, utilizing a multiplex overlap-extension primer mix.
  13. The method according to claim 12, wherein the multiplex overlap-extension primer mix comprises:
    a) at least one mKappar1 or hmJK primer complementary to the sense strand of an immunoglobulin light chain region encoding sequence;
    b) at least one mVK primer complementary to the antisense strand of an immunoglobulin light chain variable region encoding sequence or light chain variable region leader sequence, and capable of forming a primer set with the primer(s) in step a);
    c) at least one mCHrev1, mHCrev1-ext, or mJH primer complementary to the sense strand of an immunoglobulin heavy chain domain encoding sequence; and
    d) at least one mVH primer complementary to the antisense strand of an immunoglobulin heavy chain variable region encoding sequence or heavy chain variable region leader sequence, and capable of forming a primer set with the primer(s) in step c).
  14. The method according to any of the preceding claims, further comprising inserting the linked nucleotide sequences or a library of cognate pairs into a vector; and optionally further comprising the steps:
    a) introducing a vector encoding a segment of linked nucleotide sequences into a host cell;
    b) cultivating said host cell under conditions adapted for expression; and
    c) obtaining the protein product expressed from the vector inserted into said host cell.
  15. The method according to claim 14, wherein the linked nucleotide sequences or the individual members of the library of cognate pairs comprise an immunoglobulin heavy chain variable region encoding sequence associated with light chain variable region encoding sequence and said sequences are inserted in-frame into a vector containing sequences encoding one or more immunoglobulin constant domains or fragments thereof.
  16. The method according to claim 15, further comprising transferring said cognate pair or library of target-specific cognate pairs of variable region encoding sequences to a mammalian expression vector, wherein the mammalian expression vector optionally encodes one or more constant region domains selected from human immunoglobulin classes IgA1, IgA2, IgD, IgE, IgG1, IgG2, IgG3, IgG4, IgM, kappa light chain or lambda light chain.
EP08706942A 2007-03-01 2008-02-27 Method for cloning cognate antibodies Active EP2121920B1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
PL08706942T PL2121920T3 (en) 2007-03-01 2008-02-27 Method for cloning cognate antibodies
SI200830415T SI2121920T1 (en) 2007-03-01 2008-02-27 Method for cloning cognate antibodies
CY20111101105T CY1112664T1 (en) 2007-03-01 2011-11-16 METHOD OF CLASSIFICATION OF CONNECTED ANTIBODIES

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DKPA200700316 2007-03-01
US90477207P 2007-03-05 2007-03-05
PCT/DK2008/050048 WO2008104184A2 (en) 2007-03-01 2008-02-27 Method for cloning cognate antibodies

Publications (2)

Publication Number Publication Date
EP2121920A2 EP2121920A2 (en) 2009-11-25
EP2121920B1 true EP2121920B1 (en) 2011-08-24

Family

ID=38561767

Family Applications (1)

Application Number Title Priority Date Filing Date
EP08706942A Active EP2121920B1 (en) 2007-03-01 2008-02-27 Method for cloning cognate antibodies

Country Status (24)

Country Link
US (2) US8283294B2 (en)
EP (1) EP2121920B1 (en)
JP (1) JP5669397B2 (en)
KR (1) KR20090118993A (en)
CN (1) CN101622346A (en)
AT (1) ATE521704T1 (en)
AU (1) AU2008221119B2 (en)
BR (1) BRPI0808171A2 (en)
CA (1) CA2677771A1 (en)
CY (1) CY1112664T1 (en)
DK (1) DK2121920T3 (en)
ES (1) ES2369946T3 (en)
HK (1) HK1133905A1 (en)
HR (1) HRP20110686T1 (en)
IL (1) IL199773A0 (en)
MX (1) MX2009008908A (en)
NZ (1) NZ578851A (en)
PL (1) PL2121920T3 (en)
PT (1) PT2121920E (en)
RU (1) RU2459868C2 (en)
SI (1) SI2121920T1 (en)
TW (1) TW200911986A (en)
WO (1) WO2008104184A2 (en)
ZA (1) ZA200905831B (en)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2009008909A (en) * 2007-03-01 2009-08-28 Symphogen As Recombinant anti-epidermal growth factor receptor antibody compositions.
JP2008289483A (en) * 2007-05-25 2008-12-04 Symphogen As Screening of transformant expressable in eukaryote system
DE102007037215A1 (en) * 2007-08-07 2009-02-12 Robert Bosch Gmbh actuator
WO2010000864A1 (en) * 2008-07-04 2010-01-07 Novo Nordisk A/S Method for producing monoclonal antibodies
RU2011111725A (en) * 2008-08-29 2012-10-10 Симфоген А/С (Dk) BIRD ANTIBODY CLONING METHOD
WO2010022736A2 (en) 2008-08-29 2010-03-04 Symphogen A/S Recombinant anti-epidermal growth factor receptor antibody compositions
US9365901B2 (en) 2008-11-07 2016-06-14 Adaptive Biotechnologies Corp. Monitoring immunoglobulin heavy chain evolution in B-cell acute lymphoblastic leukemia
US8691510B2 (en) * 2008-11-07 2014-04-08 Sequenta, Inc. Sequence analysis of complex amplicons
ES2453066T3 (en) 2008-11-07 2014-04-03 Sequenta, Inc. Methods for monitoring conditions by sequence analysis
US9506119B2 (en) 2008-11-07 2016-11-29 Adaptive Biotechnologies Corp. Method of sequence determination using sequence tags
US8628927B2 (en) 2008-11-07 2014-01-14 Sequenta, Inc. Monitoring health and disease status using clonotype profiles
US9528160B2 (en) 2008-11-07 2016-12-27 Adaptive Biotechnolgies Corp. Rare clonotypes and uses thereof
US8748103B2 (en) 2008-11-07 2014-06-10 Sequenta, Inc. Monitoring health and disease status using clonotype profiles
PT2387627E (en) 2009-01-15 2016-06-03 Adaptive Biotechnologies Corp Adaptive immunity profiling and methods for generation of monoclonal antibodies
US9079942B2 (en) * 2009-02-09 2015-07-14 Epitomics, Inc. CDR-anchored amplification method
WO2010094475A1 (en) 2009-02-20 2010-08-26 F. Hoffmann-La Roche Ag Method for obtaining immunoglobulin encoding nucleic acid
US20100317539A1 (en) * 2009-06-12 2010-12-16 Guo-Liang Yu Library of Engineered-Antibody Producing Cells
AU2010263172B2 (en) 2009-06-25 2016-03-31 Fred Hutchinson Cancer Research Center Method of measuring adaptive immunity
US8293483B2 (en) 2009-09-11 2012-10-23 Epitomics, Inc. Method for identifying lineage-related antibodies
US9043160B1 (en) 2009-11-09 2015-05-26 Sequenta, Inc. Method of determining clonotypes and clonotype profiles
DE102010037622B4 (en) 2010-09-17 2012-07-12 Lophius Biosciences Gmbh Method for detecting, differentiating and quantifying T-cell populations using reverse transcription quantitative real-time PCR (RT-qPCR) technology
EP3561073A1 (en) 2010-10-08 2019-10-30 President and Fellows of Harvard College High-throughput immune sequencing
US9155802B2 (en) 2010-11-01 2015-10-13 Symphogen A/S Pan-HER antibody composition
JP5920835B2 (en) * 2010-12-01 2016-05-18 田辺三菱製薬株式会社 Polynucleotide construct capable of presenting Fab in cell-free translation system, and Fab production method and screening method using the same
US20130316358A1 (en) * 2011-01-31 2013-11-28 Yeda Research And Development Co. Ltd. Methods of diagnosing disease using overlap extension pcr
BR112013027540A2 (en) 2011-04-28 2016-09-20 Us Dept Veterans Affairs polynucleotide composition bank, method for producing one or more polynucleotides of interest, and method for analyzing and using sequencing data
US10385475B2 (en) 2011-09-12 2019-08-20 Adaptive Biotechnologies Corp. Random array sequencing of low-complexity libraries
RU2014113678A (en) * 2011-09-21 2015-10-27 Ф. Хоффманн-Ля Рош Аг METHOD FOR PRODUCING FAB FRAGMENTS FROM SINGLE-CELL PRODUCING ANTIBODIES BY MULTIPLEX PCR IN COMBINATION WITH TAQMAN PROBES
US9279159B2 (en) 2011-10-21 2016-03-08 Adaptive Biotechnologies Corporation Quantification of adaptive immune cell genomes in a complex mixture of cells
EP3904536A1 (en) 2011-12-09 2021-11-03 Adaptive Biotechnologies Corporation Diagnosis of lymphoid malignancies and minimal residual disease detection
US9499865B2 (en) 2011-12-13 2016-11-22 Adaptive Biotechnologies Corp. Detection and measurement of tissue-infiltrating lymphocytes
EP2626433B1 (en) 2012-02-09 2017-04-12 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Method for linking nucleic acids in a microsome from endoplasmatic reticuli
ES2662128T3 (en) 2012-03-05 2018-04-05 Adaptive Biotechnologies Corporation Determination of paired immune receptor chains from the frequency of matching subunits
KR101933990B1 (en) 2012-05-02 2018-12-31 심포젠 에이/에스 Humanized pan-her antibody compositions
MX354329B (en) 2012-05-08 2018-02-27 Adaptive Biotechnologies Corp Compositions and method for measuring and calibrating amplification bias in multiplexed pcr reactions.
DK2861760T3 (en) * 2012-06-15 2020-03-09 Univ Texas High capacity sequencing of multiple prints of a single cell
CA2886647A1 (en) 2012-10-01 2014-04-10 Adaptive Biotechnologies Corporation Immunocompetence assessment by adaptive immune receptor diversity and clonality characterization
WO2015160439A2 (en) 2014-04-17 2015-10-22 Adaptive Biotechnologies Corporation Quantification of adaptive immune cell genomes in a complex mixture of cells
CN103074349B (en) * 2012-11-07 2014-10-15 杭州鸿运华宁生物医药工程有限公司 Method for amplifying variable range gene complete sequence of murine antibody heavy chain and light chain from mononuclear B cell by nested one step method RT-PCR
CA2903025A1 (en) * 2013-03-11 2014-10-09 Regeneron Pharmaceuticals, Inc. Transgenic mice expressing chimeric major histocompatibility complex (mhc) class ii molecules
US20160033504A1 (en) * 2013-03-15 2016-02-04 Alder Biopharmaceuticals, Inc. Protocol for identifying and isolating antigen-specific b cells and producing antibodies to desired antigens
EP3381939A1 (en) 2013-05-31 2018-10-03 Genentech, Inc. Anti-wall teichoic antibodies and conjugates
JP6506262B2 (en) 2013-05-31 2019-04-24 ジェネンテック, インコーポレイテッド Anti-cell wall teichoic acid antibody and conjugate
US9803002B2 (en) 2013-05-31 2017-10-31 Genentench, Inc. Anti-wall teichoic antibodies and conjugates
US9708657B2 (en) 2013-07-01 2017-07-18 Adaptive Biotechnologies Corp. Method for generating clonotype profiles using sequence tags
WO2015134787A2 (en) 2014-03-05 2015-09-11 Adaptive Biotechnologies Corporation Methods using randomer-containing synthetic molecules
US10066265B2 (en) 2014-04-01 2018-09-04 Adaptive Biotechnologies Corp. Determining antigen-specific t-cells
EP3715455A1 (en) 2014-10-29 2020-09-30 Adaptive Biotechnologies Corp. Highly-multiplexed simultaneous detection of nucleic acids encoding paired adaptive immune receptor heterodimers from many samples
US10246701B2 (en) 2014-11-14 2019-04-02 Adaptive Biotechnologies Corp. Multiplexed digital quantitation of rearranged lymphoid receptors in a complex mixture
EP3498866A1 (en) 2014-11-25 2019-06-19 Adaptive Biotechnologies Corp. Characterization of adaptive immune response to vaccination or infection using immune repertoire sequencing
WO2016090038A1 (en) 2014-12-03 2016-06-09 Genentech, Inc. Anti-staphylococcus aureus antibody rifamycin conjugates and uses thereof
EP3262196B1 (en) 2015-02-24 2020-12-09 Adaptive Biotechnologies Corp. Method for determining hla status using immune repertoire sequencing
US10513733B2 (en) 2015-03-23 2019-12-24 Board Of Regents, The University Of Texas System High throughout sequencing of paired VH and VL transcripts from B cells secreting antigen-specific antibodies
AU2016242967B2 (en) 2015-04-01 2021-07-01 Adaptive Biotechnologies Corp. Method of identifying human compatible T cell receptors specific for an antigenic target
EP3995587B1 (en) * 2015-06-09 2024-08-14 Gigagen, Inc. Recombinant fusion proteins and libraries from immune cell repertoires
US9422547B1 (en) 2015-06-09 2016-08-23 Gigagen, Inc. Recombinant fusion proteins and libraries from immune cell repertoires
EP4105235A1 (en) 2015-10-02 2022-12-21 Symphogen A/S Anti-pd-1 antibodies and compositions
EP3423457B1 (en) 2016-03-04 2021-02-17 H. Hoffnabb-La Roche Ag Process for the preparation of an antibody-rifamycin conjugate
US20200123237A1 (en) 2016-05-18 2020-04-23 Genmab B.V. Antibodies and methods of use thereof in treatment of infectious disease
US10428325B1 (en) 2016-09-21 2019-10-01 Adaptive Biotechnologies Corporation Identification of antigen-specific B cell receptors
EP3541841B1 (en) 2016-11-18 2024-09-11 Les Laboratoires Servier Anti-pd-1 antibodies and compositions
MX2019007356A (en) * 2016-12-23 2019-09-05 Visterra Inc Binding polypeptides and methods of making the same.
US11866785B2 (en) 2017-10-27 2024-01-09 Board Of Regents, The University Of Texas System Tumor specific antibodies and T-cell receptors and methods of identifying the same
US11254980B1 (en) 2017-11-29 2022-02-22 Adaptive Biotechnologies Corporation Methods of profiling targeted polynucleotides while mitigating sequencing depth requirements
CN113226470A (en) 2018-12-21 2021-08-06 瑞泽恩制药公司 Rifamycin analogs and antibody-drug conjugates thereof
CN109972209B (en) * 2019-01-18 2022-05-03 南开大学 Method for rearranging single-chain antibody library into full-length antibody library by one-step method based on emulsion PCR
WO2020191365A1 (en) 2019-03-21 2020-09-24 Gigamune, Inc. Engineered cells expressing anti-viral t cell receptors and methods of use thereof
CN115109794A (en) * 2022-06-30 2022-09-27 武汉生之源生物科技股份有限公司 Phagemid vector and preparation method and application thereof

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US5618920A (en) * 1985-11-01 1997-04-08 Xoma Corporation Modular assembly of antibody genes, antibodies prepared thereby and use
US5576195A (en) * 1985-11-01 1996-11-19 Xoma Corporation Vectors with pectate lyase signal sequence
US5906936A (en) * 1988-05-04 1999-05-25 Yeda Research And Development Co. Ltd. Endowing lymphocytes with antibody specificity
AU1457992A (en) 1991-03-01 1992-10-06 Stratagene Pcr generated dicistronic dna molecules for producing antibodies
US6284471B1 (en) * 1991-03-18 2001-09-04 New York University Medical Center Anti-TNFa antibodies and assays employing anti-TNFa antibodies
US6277969B1 (en) * 1991-03-18 2001-08-21 New York University Anti-TNF antibodies and peptides of human tumor necrosis factor
DE69228247T2 (en) 1991-08-10 1999-07-08 Medical Research Council, London Treatment of cell populations
JPH07505293A (en) 1992-03-31 1995-06-15 アボツト・ラボラトリーズ Multiplex ligase chain reaction method
GB9220808D0 (en) 1992-10-02 1992-11-18 Medical Res Council Antibody genes
US6096878A (en) * 1993-05-10 2000-08-01 Japan Tobacco Inc. Human immunoglobulin VH gene segments and DNA fragments containing the same
DK0744958T3 (en) 1994-01-31 2003-10-20 Univ Boston Polyclonal antibody libraries
FR2724393A1 (en) * 1994-09-12 1996-03-15 Inst Nat Sante Rech Med OBTAINING A HUMANIZED RECOMBINANT MONOCLONAL ANTIBODY FROM A MURIN MONOCLONAL ANTIBODY, ITS PRODUCTION IN INSECT CELLS, AND USES THEREOF
US6111166A (en) 1994-09-19 2000-08-29 Medarex, Incorporated Transgenic mice expressing human Fcα and β receptors
US6258529B1 (en) * 1994-12-01 2001-07-10 Oravax, Inc. PCR amplification of rearranged genomic variable regions of immunoglobulin genes
US5882856A (en) * 1995-06-07 1999-03-16 Genzyme Corporation Universal primer sequence for multiplex DNA amplification
EP0985033A4 (en) * 1997-04-04 2005-07-13 Biosite Inc Polyvalent and polyclonal libraries
US6610293B1 (en) * 1997-06-16 2003-08-26 The Henry M. Jackson Foundation For The Advancement Of Military Medicine Opsonic and protective monoclonal and chimeric antibodies specific for lipoteichoic acid of gram positive bacteria
WO1999016904A1 (en) 1997-09-29 1999-04-08 City Of Hope Efficient linking of nucleic acid segments
WO1999029888A1 (en) * 1997-12-05 1999-06-17 The Scripps Research Institute Humanization of murine antibody
US6194551B1 (en) * 1998-04-02 2001-02-27 Genentech, Inc. Polypeptide variants
CA2382165A1 (en) * 1999-12-08 2001-06-14 Genset S.A. Full-length human cdnas encoding potentially secreted proteins
EP1283720B1 (en) 2000-05-26 2007-04-18 Symphogen A/S Recombinant or purified polyclonal antibodies for treating allergy
IL136459A0 (en) 2000-05-30 2001-06-14 Galim Galil Immunology Ltd Antibody library
US6849259B2 (en) * 2000-06-16 2005-02-01 Symphogen A/S Polyclonal antibody composition for treating allergy
US6994963B1 (en) * 2000-07-10 2006-02-07 Ambion, Inc. Methods for recombinatorial nucleic acid synthesis
US6867021B2 (en) * 2001-02-20 2005-03-15 Board Of Trustees Of Michigan State University Multiplex RT-PCR/PCR for simultaneous detection of bovine coronavirus, bovine rotavirus, Cryptosporidium parvum, and Escherichia coli
DK1517921T3 (en) * 2002-06-28 2006-10-09 Domantis Ltd Immunoglobulin single variable antigen binding domains and double specific constructs thereof
US20040067532A1 (en) * 2002-08-12 2004-04-08 Genetastix Corporation High throughput generation and affinity maturation of humanized antibody
US20060275766A1 (en) * 2003-01-07 2006-12-07 Haurum John S Method for manufacturing recombinant polyclonal proteins
DK1583830T3 (en) 2003-01-07 2007-01-15 Symphogen As Process for Preparation of Recombinant Polyclonal Proteins
US20050090453A1 (en) * 2003-01-15 2005-04-28 Solbec Pharmaceuticals Co., Ltd. Methods of modulating IL-6
US20060228350A1 (en) * 2003-08-18 2006-10-12 Medimmune, Inc. Framework-shuffling of antibodies
RU2392324C2 (en) * 2003-09-18 2010-06-20 Симфоген А/С Concerned sequences linkage method
TWI333977B (en) * 2003-09-18 2010-12-01 Symphogen As Method for linking sequences of interest
ATE474915T1 (en) * 2003-11-19 2010-08-15 Us Gov Health & Human Serv METHOD FOR INDUCING DEVELOPMENT AND TERMINAL DIFFERENTIATION OF MEMORY B CELLS
WO2006007850A1 (en) 2004-07-20 2006-01-26 Symphogen A/S Anti-rhesus d recombinant polyclonal antibody and methods of manufacture
WO2006007853A2 (en) 2004-07-20 2006-01-26 Symphogen A/S A procedure for structural characterization of a recombinant polyclonal protein or a polyclonal cell line
US20060094645A1 (en) * 2004-10-06 2006-05-04 Oliver Lawless Method for defining and treating chemically-induced immune disorders using tumor necrosis factor (TFNalpha), interleukin-1 (lL-1), and interleulin-6R(lL-6R) antagonists
EP1910513B1 (en) * 2005-07-01 2016-08-10 John Schrader Methods of isolating antibody secreting cells from blood samples of immunized animals and generating monoclonal antibodies from said antibody secreting cells.
US7850965B2 (en) * 2005-12-05 2010-12-14 Symphogen A/S Anti-orthopoxvirus recombinant polyclonal antibody
CN101395182A (en) 2006-03-06 2009-03-25 西福根有限公司 Recombinant antibodies for treatment of respiratory syncytial virus infections
MX2009008909A (en) * 2007-03-01 2009-08-28 Symphogen As Recombinant anti-epidermal growth factor receptor antibody compositions.
BRPI0808673A2 (en) * 2007-03-06 2014-08-12 Symphogen As RECOMBINANT ANTIBODIES FOR TREATMENT OF RESPIRATORY SYNCHIAL VIRUS INFECTIONS.
NZ581316A (en) * 2007-05-25 2012-02-24 Symphogen As Method for manufacturing a recombinant polyclonal protein
CN101821289A (en) * 2007-09-07 2010-09-01 西福根有限公司 Methods for recombinant manufacturing of anti-rsv antibodies
RU2011111725A (en) * 2008-08-29 2012-10-10 Симфоген А/С (Dk) BIRD ANTIBODY CLONING METHOD

Also Published As

Publication number Publication date
TW200911986A (en) 2009-03-16
HK1133905A1 (en) 2010-04-09
CN101622346A (en) 2010-01-06
CA2677771A1 (en) 2008-09-04
SI2121920T1 (en) 2011-12-30
WO2008104184A3 (en) 2008-10-16
KR20090118993A (en) 2009-11-18
ATE521704T1 (en) 2011-09-15
ES2369946T3 (en) 2011-12-09
JP2010535150A (en) 2010-11-18
AU2008221119B2 (en) 2013-11-14
CY1112664T1 (en) 2016-02-10
DK2121920T3 (en) 2011-11-21
US20080227660A1 (en) 2008-09-18
PL2121920T3 (en) 2012-01-31
US20130023447A1 (en) 2013-01-24
PT2121920E (en) 2011-10-03
IL199773A0 (en) 2010-04-15
WO2008104184A2 (en) 2008-09-04
US8283294B2 (en) 2012-10-09
RU2009136342A (en) 2011-04-10
MX2009008908A (en) 2009-08-28
ZA200905831B (en) 2011-02-23
BRPI0808171A2 (en) 2014-08-05
AU2008221119A1 (en) 2008-09-04
HRP20110686T1 (en) 2011-10-31
RU2459868C2 (en) 2012-08-27
EP2121920A2 (en) 2009-11-25
JP5669397B2 (en) 2015-02-12
NZ578851A (en) 2011-09-30

Similar Documents

Publication Publication Date Title
EP2121920B1 (en) Method for cloning cognate antibodies
US7749697B2 (en) Method for linking sequences of interest
US20100069262A1 (en) Method for Cloning Avian-Derived Antibodies
JP2010268806A (en) Method for manufacturing recombinant polyclonal protein
KR101135134B1 (en) Method for linking sequences of interest
MXPA06002894A (en) Method for linking sequences of interest

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20091001

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20100118

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1133905

Country of ref document: HK

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

DAX Request for extension of the european patent (deleted)
GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: HR

Ref legal event code: TUEP

Ref document number: P20110686

Country of ref document: HR

REG Reference to a national code

Ref country code: PT

Ref legal event code: SC4A

Free format text: AVAILABILITY OF NATIONAL TRANSLATION

Effective date: 20110926

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602008009102

Country of ref document: DE

Effective date: 20111027

REG Reference to a national code

Ref country code: HR

Ref legal event code: T1PR

Ref document number: P20110686

Country of ref document: HR

REG Reference to a national code

Ref country code: SE

Ref legal event code: TRGR

REG Reference to a national code

Ref country code: NL

Ref legal event code: T3

REG Reference to a national code

Ref country code: DK

Ref legal event code: T3

Ref country code: RO

Ref legal event code: EPE

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2369946

Country of ref document: ES

Kind code of ref document: T3

Effective date: 20111209

REG Reference to a national code

Ref country code: NO

Ref legal event code: T2

Effective date: 20110824

REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1133905

Country of ref document: HK

REG Reference to a national code

Ref country code: PL

Ref legal event code: T3

REG Reference to a national code

Ref country code: EE

Ref legal event code: FG4A

Ref document number: E006173

Country of ref document: EE

Effective date: 20111121

REG Reference to a national code

Ref country code: HU

Ref legal event code: AG4A

Ref document number: E012257

Country of ref document: HU

REG Reference to a national code

Ref country code: GR

Ref legal event code: EP

Ref document number: 20110402641

Country of ref document: GR

Effective date: 20120117

REG Reference to a national code

Ref country code: SK

Ref legal event code: T3

Ref document number: E 11176

Country of ref document: SK

PLBI Opposition filed

Free format text: ORIGINAL CODE: 0009260

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: TR

Payment date: 20120213

Year of fee payment: 5

26 Opposition filed

Opponent name: STERLING IP

Effective date: 20120522

PLAX Notice of opposition and request to file observation + time limit sent

Free format text: ORIGINAL CODE: EPIDOSNOBS2

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: HR

Payment date: 20120124

Year of fee payment: 5

REG Reference to a national code

Ref country code: DE

Ref legal event code: R026

Ref document number: 602008009102

Country of ref document: DE

Effective date: 20120522

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

PLAF Information modified related to communication of a notice of opposition and request to file observations + time limit

Free format text: ORIGINAL CODE: EPIDOSCOBS2

PLBB Reply of patent proprietor to notice(s) of opposition received

Free format text: ORIGINAL CODE: EPIDOSNOBS3

REG Reference to a national code

Ref country code: HR

Ref legal event code: ODRP

Ref document number: P20110686

Country of ref document: HR

Payment date: 20130124

Year of fee payment: 6

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FI

Payment date: 20130213

Year of fee payment: 6

Ref country code: CZ

Payment date: 20130222

Year of fee payment: 6

Ref country code: SE

Payment date: 20130219

Year of fee payment: 6

Ref country code: IE

Payment date: 20130219

Year of fee payment: 6

Ref country code: IS

Payment date: 20130122

Year of fee payment: 6

Ref country code: RO

Payment date: 20130201

Year of fee payment: 6

Ref country code: LT

Payment date: 20130123

Year of fee payment: 6

Ref country code: NO

Payment date: 20130214

Year of fee payment: 6

Ref country code: EE

Payment date: 20130213

Year of fee payment: 6

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: PL

Payment date: 20130124

Year of fee payment: 6

Ref country code: SK

Payment date: 20130226

Year of fee payment: 6

Ref country code: NL

Payment date: 20130219

Year of fee payment: 6

Ref country code: GR

Payment date: 20130222

Year of fee payment: 6

Ref country code: SI

Payment date: 20130129

Year of fee payment: 6

Ref country code: LV

Payment date: 20130213

Year of fee payment: 6

Ref country code: BE

Payment date: 20130220

Year of fee payment: 6

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111124

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: PT

Payment date: 20130218

Year of fee payment: 6

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110824

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: CY

Payment date: 20130129

Year of fee payment: 6

PLCK Communication despatched that opposition was rejected

Free format text: ORIGINAL CODE: EPIDOSNREJ1

PLAB Opposition data, opponent's data or that of the opponent's representative modified

Free format text: ORIGINAL CODE: 0009299OPPO

R26 Opposition filed (corrected)

Opponent name: STERLING IP

Effective date: 20120522

PLBN Opposition rejected

Free format text: ORIGINAL CODE: 0009273

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: OPPOSITION REJECTED

27O Opposition rejected

Effective date: 20130927

REG Reference to a national code

Ref country code: DE

Ref legal event code: R100

Ref document number: 602008009102

Country of ref document: DE

Effective date: 20130927

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120227

BERE Be: lapsed

Owner name: SYMPHOGEN A/S

Effective date: 20140228

REG Reference to a national code

Ref country code: PT

Ref legal event code: MM4A

Free format text: LAPSE DUE TO NON-PAYMENT OF FEES

Effective date: 20140827

REG Reference to a national code

Ref country code: HR

Ref legal event code: PBON

Ref document number: P20110686

Country of ref document: HR

Effective date: 20140227

REG Reference to a national code

Ref country code: NL

Ref legal event code: V1

Effective date: 20140901

REG Reference to a national code

Ref country code: SE

Ref legal event code: EUG

REG Reference to a national code

Ref country code: EE

Ref legal event code: MM4A

Ref document number: E006173

Country of ref document: EE

Effective date: 20140228

REG Reference to a national code

Ref country code: GR

Ref legal event code: ML

Ref document number: 20110402641

Country of ref document: GR

Effective date: 20140903

REG Reference to a national code

Ref country code: LT

Ref legal event code: MM4D

Effective date: 20140227

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140901

Ref country code: FI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: NO

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140228

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20140829

Ref country code: CY

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: LT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: EE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140228

Ref country code: GR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140903

Ref country code: RO

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: CZ

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

REG Reference to a national code

Ref country code: SK

Ref legal event code: MM4A

Ref document number: E 11176

Country of ref document: SK

Effective date: 20140227

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: LV

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: SK

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: SI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140228

Ref country code: SE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140228

REG Reference to a national code

Ref country code: SI

Ref legal event code: KO00

Effective date: 20141007

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140827

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

REG Reference to a national code

Ref country code: PL

Ref legal event code: LAPE

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: HU

Payment date: 20160217

Year of fee payment: 9

Ref country code: AT

Payment date: 20160218

Year of fee payment: 9

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 10

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20140227

REG Reference to a national code

Ref country code: AT

Ref legal event code: MM01

Ref document number: 521704

Country of ref document: AT

Kind code of ref document: T

Effective date: 20170227

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170227

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

REG Reference to a national code

Ref country code: DE

Ref legal event code: R081

Ref document number: 602008009102

Country of ref document: DE

Owner name: LES LABORATOIRES SERVIER, FR

Free format text: FORMER OWNER: SYMPHOGEN A/S, LYNGBY, DK

REG Reference to a national code

Ref country code: GB

Ref legal event code: 732E

Free format text: REGISTERED BETWEEN 20230216 AND 20230222

REG Reference to a national code

Ref country code: ES

Ref legal event code: PC2A

Owner name: LES LABORATOIRES SERVIER

Effective date: 20230323

P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230524

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20231229

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20240307

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20231229

Year of fee payment: 17

Ref country code: GB

Payment date: 20240108

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20240111

Year of fee payment: 17

Ref country code: DK

Payment date: 20240214

Year of fee payment: 17