WO2004003136A2 - Methods of detecting sequence differences - Google Patents

Methods of detecting sequence differences Download PDF

Info

Publication number
WO2004003136A2
WO2004003136A2 PCT/US2003/019576 US0319576W WO2004003136A2 WO 2004003136 A2 WO2004003136 A2 WO 2004003136A2 US 0319576 W US0319576 W US 0319576W WO 2004003136 A2 WO2004003136 A2 WO 2004003136A2
Authority
WO
WIPO (PCT)
Prior art keywords
amplification
primer
primers
polymoφhic site
sequence
Prior art date
Application number
PCT/US2003/019576
Other languages
French (fr)
Other versions
WO2004003136A3 (en
Inventor
Vladimir I. Slepnev
Original Assignee
Sention, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sention, Inc. filed Critical Sention, Inc.
Priority to AU2003243700A priority Critical patent/AU2003243700B2/en
Priority to EP03761972A priority patent/EP1534859B1/en
Priority to DE60335116T priority patent/DE60335116D1/en
Priority to CN03815401.3A priority patent/CN1665938B/en
Priority to JP2004517727A priority patent/JP4617403B2/en
Priority to AT03761972T priority patent/ATE489477T1/en
Priority to CA2490864A priority patent/CA2490864C/en
Publication of WO2004003136A2 publication Critical patent/WO2004003136A2/en
Publication of WO2004003136A3 publication Critical patent/WO2004003136A3/en
Priority to AU2009200987A priority patent/AU2009200987A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/34Polynucleotides, e.g. nucleic acids, oligoribonucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2525/00Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
    • C12Q2525/10Modifications characterised by
    • C12Q2525/155Modifications characterised by incorporating/generating a new priming site
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2535/00Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
    • C12Q2535/125Allele specific primer extension
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2537/00Reactions characterised by the reaction format or use of a specific feature
    • C12Q2537/10Reactions characterised by the reaction format or use of a specific feature the purpose or use of
    • C12Q2537/143Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2565/00Nucleic acid analysis characterised by mode or means of detection
    • C12Q2565/10Detection mode being characterised by the assay principle
    • C12Q2565/125Electrophoretic separation

Definitions

  • the invention relates to molecular genetic methods for the identification of sequence differences in the genome of an individual relative to the sequences of a population of individuals. More particularly, the invention relates to methods for the identification of single nucleotide differences in genomic sequences.
  • the nucleic acids comprising the genome of an organism contain the genetic information for that organism. Variability in gene sequences between individuals accounts for many of the obvious phenotypic differences (such as pigmentation of hair, skin, etc.) and many non-obvious ones (such as drug tolerance and disease susceptibility). Even minute changes in a nucleotide sequence, including single base pair substitutions, can have a significant effect on the quality or quantity of a protein. Single nucleotide changes are referred to as single nucleotide polymo ⁇ hisms or simply SNPs, and the site at which the SNP occurs is referred to herein as a polymo ⁇ hic site.
  • DNA polymo ⁇ hisms are located throughout the genome, within and between genes, and the various fornis may or may not result in differential gene function (as determined by comparing the function of two alternative forms of the same sequence). Most polymo ⁇ hisms do not alter gene function and are termed "neutral" polymo ⁇ hisms. Others do have affect gene function, for example, by changing the amino acid sequence of a protein, or by altering control sequences such as promoters or RNA splicing or degradation signals, and are more commonly referred to as mutations.
  • SNPs Diseases associated with SNPs include: sickle cell anemia, ⁇ - thalassemias, diabetes, cystic fibrosis, hyperlipoproteinemia, a wide variety of autoimmune diseases, and the formation of some oncogenes, e.g., mutant p53.
  • point mutations can cause altered pathogenicity or susceptibility to disease and resistance to therapeutics.
  • the ability to detect specific nucleotide alterations or mutations in DNA sequences is useful for a number of medical and non-medical pu ⁇ oses. Methods capable of identifying nucleotide alterations permit screening and diagnosis of diseases associated with SNPs. Polymo ⁇ hisms are also useful in genetic studies to identify genes involved with a disease. If a polymo ⁇ hism alters the function of one or more genes such that disease susceptibility is increased, the polymo ⁇ hism will be present more often in individuals with the disease relative to those without the disease. Statistical methods can be used to evaluate polymo ⁇ hism frequencies found in diseased relative to normal populations, and can facilitate the establishment of a causal link between a polymo ⁇ hism and a disease phenotype.
  • Non-medical applications of SNPs include, for example, the detection of microorganisms or particular strains of them, and in forensic analysis.
  • RFLP analysis is limited in that it can only detect those changes that affect a restriction endonuclease cleavage site, and the method is dependent upon gel electrophoresis and staining, which limits throughput.
  • Single-strand conformational polymo ⁇ hism (SSCP) analysis can also detect SNPs in an amplified DNA fragment.
  • the amplified fragment is denatured then allowed to re-anneal during electrophoresis in non-denaturing polyacrylamide gels.
  • the presence of single nucleotide sequence changes can cause a detectable change in the conformation and electrophoretic migration of a sample relative to wild-type sequence. This method is limited in its dependence upon polyacrylamide gel electrophoresis.
  • Hybridization-based methods employ allele-specific oligonucleotide (ASO) probes (see, e.g., European Patent Publications EP -237362 and EP-32931 1).
  • the hybridization-based methods include, for example, detection based on ribonuclease A cleavage at mismatches in probe RNA:sample DNA duplexes or denaturing gradient gel electrophoresis for mismatches in probe DNA:sample DNA duplexes (reviewed in Landegren et al., Science 242:229-237, 1988; Rossiter et al, J. Biol. Chem. 265:12753-12756, 1990).
  • ASO allele-specific oligonucleotide
  • genotyping SNPs employ allele-specific amplification (see, e.g., U.S. Pat. Nos. 5,521,301; 5,639,611; and 5,981,176), mini-sequencing methods, quantitative RT-PCR methods (eg., the so-called "TaqMan assays"; see, e.g., U.S. Pat. No. 5,210,015 to Gelfand, U.S. Pat. No. 5,538,848 to Livak, et al., and U.S. Pat. No. 5,863,736 to Haaland, as well as Heid, C.A., et al. Genome Research, 6:986-994 (1996); Gibson, U .E.
  • SNP genotyping methods rely at some point upon PCR amplification, either to generate enough material for analysis (e.g., SSCP analysis) or to differentially amplify one form over another so as to detect differences (e.g, the primer extension assays).
  • SSCP analysis e.g., SSCP analysis
  • differentially amplify one form over another so as to detect differences
  • the primer extension assays e.g., the primer extension assays.
  • efforts are being focused on multiplexing the reactions so that multiple SNPs can be detected in a single set of reactions.
  • Multiplexing by simply adding primer pairs specific for multiple SNP-containing fragments faces problems caused by primer interactions that lead to inefficient amplification of target fragments and to the generation of artifact fragments.
  • CE Capillary electrophoresis
  • PCR-amplified DNA containing a known SNP was extended with one of two identically fluorescently labeled chain terminators, followed by CE separation and detection of inco ⁇ orated label.
  • the identities of inco ⁇ orated terminators are determined based on sequence-specific differences in CE migration for oligonucleotides.
  • McClay et al. 2002, Anal. Biochem. 301 : 200-206 describe an SNP genotyping assay involving PCR using a set of two differentially fluorescently labeled primers differing in their 3 '-terminal base with a common upstream primer, followed by CE and fluorescent detection. Throughput was increased by mixing amplification products of different sizes and electrophoresing together.
  • U.S. Patent No. 6,074,831 teaches the use of CE for the concurrent separation of molecules partitioned into subsets according to graph theory techniques, and the application of the method to SNP genotyping.
  • U.S. Patent No. 6,322,980 describes the use of CE in an SNP detection method using the exonuclease activity of a polymerase to release a fluorescent label from a primer hybridized to the polymo ⁇ hic site.
  • U.S. Patent No. 6,270,973 also describes the use of CE separation in an SNP genotyping method involving nucleic acid probe depolymerizing activity.
  • U.S. Patent No. 6,312,893 describes a sequencing method that generates organically tagged fragments in which the tag correlates with a particular nucleotide. Fragments are separated by CE, followed by tag cleavage from the fragments and detection of cleaved tags by non-fluorescent spectrometry or potentiometry.
  • U.S. Patent No. 6,156,178 describes the use of CE in an SNP detection method using a depolymerizing activity to release an identifier nucleotide from a primer hybridized to the polymo ⁇ hic site.
  • the invention provides methods useful for genotyping nucleic acid samples with regard to sequence differences.
  • the methods are useful for the determination of single nucleotide differences, e.g, single nucleotide polymorphisms.
  • the methods of the invention use PCR amplification of primer extension products comprising heterologous sequence tags, followed by capillary electrophoretic size separation and detection of the amplified extension products.
  • the size separation and product detection are performed in real time. Because the CE separation and detection techniques provide information including the amplified fragment size and the identity of label present on any given amplification product, the disclosed methods are particularly well suited for simultaneously analyzing samples for genotype with regard to multiple known SNPs.
  • Each known SNP can be detected by the amplification of a discretely sized amplification fragment bearing a distinguishably labeled sequence tag that specifically correlates with the presence of a particular nucleotide at that polymo ⁇ hic site.
  • Methods according to the invention also have the advantage of requiring one set of amplification primers for the detection of multiple SNPs, thereby reducing the impact of problems related to the use of multiple different amplification primers.
  • the invention encompasses a method of determining for a given nucleic acid sample, the identity of the nucleotide at a known polymo ⁇ hic site, the method comprising: a) subjecting to an amplification regimen a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a tag sequence, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymo ⁇ hic site, wherein the amplification regimen is performed using an upstream amplification primer and a set of distinguishably labeled downstream amplification primers, each member of the set of downstream amplification primers comprising a tag sequence comprised by a member of the population of primer extension products and a distinguishable label, wherein each distinguishable label specifically corresponds to the presence of a specific nucleotide at the polymo ⁇ hic site; and b) detecting incorporation of a distinguishable label into a nucleic acid molecule, thereby to determine the identity of the nucleot
  • the distinguishable label is a fluorescent label.
  • step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge.
  • the separating comprises capillary electrophoresis.
  • the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
  • the method further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the inco ⁇ oration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymo ⁇ hic site.
  • the removing, separating and detecting are performed after each cycle in the regimen.
  • the separating comprises capillary electrophoresis.
  • steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescence detector.
  • the tag sequence comprises 15 to 40 nucleotides.
  • the set of distinguishably labeled downstream amplification primers consists of: a primer that comprises a tag sequence that specifically corresponds to the presence of A at the polymo ⁇ hic site; a primer that comprises a tag sequence that specifically corresponds to the presence of C at the polymo ⁇ hic site; a primer that comprises a tag sequence that specifically corresponds to the presence of G at the polymo ⁇ hic site; and a primer that comprises a tag sequence that specifically corresponds to the presence of T at the polymo ⁇ hic site.
  • the set of distinguishably labeled downstream amplification primers consists of a pair of oligonucleotides, one comprising a tag sequence that specifically corresponds to a first allele of the polymo ⁇ hic site and one comprising a tag sequence that specifically corresponds to a second allele of the polymo ⁇ hic site.
  • Another embodiment further comprises the step, before step (a), of removing primers not inco ⁇ orated when the population of primer extension products was made.
  • the step of removing primers comprises degrading the primers not inco ⁇ orated when the population of primer extension products was made.
  • the degrading is performed using a heat labile exonuclease.
  • the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
  • the heat labile exonuclease is thermally inactivated before continuing to step (a).
  • the invention further emcompasses a method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymo ⁇ hic sites to be interrogated, the method comprising: a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a member of a set of tag sequences, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymo ⁇ hic site, wherein the amplification regimen is performed using one upstream amplification primer for each sequence comprising a known polymo ⁇ hic site to be interrogated, and a set of distinguishably labeled downstream amplification primers, each member of the set of downstream amplification primers comprising a tag sequence comprised by a member of the population of primer extension products and a distinguishable label that specifically corresponds to the presence of a specific nucleotide at the polymo ⁇ hic site, and
  • the distinguishable label is a fluorescent label.
  • step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge.
  • the separating comprises capillary electrophoresis.
  • the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
  • a preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the inco ⁇ oration of a the distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymo ⁇ hic site.
  • the removing, separating and detecting are performed after each cycle in the regimen.
  • the separating comprises capillary electrophoresis.
  • steps (a) and (b) are perforaied in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
  • the tag sequence comprises 15 to 40 nucleotides.
  • the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymo ⁇ hic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymo ⁇ hic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymo ⁇ hic site.
  • Another embodiment further comprises the step, before step (a), of removing primers not inco ⁇ orated when the population of primer extension products was made.
  • the step of removing primers comprises degrading the primers not inco ⁇ orated when the population of primer extension products was made.
  • the degrading is performed using a heat labile exonuclease.
  • the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
  • the heat labile exonuclease is thermally inactivated before continuing to step (a).
  • the invention further encompasses a method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymo ⁇ hic sites to be interrogated, the method comprising: a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a first tag sequence or its complement and a member of a set of second tag sequences or its complement, the presence of which second tag sequence or its complement specifically corresponds to the presence of one specific nucleotide at a known polymo ⁇ hic site, wherein for each polymo ⁇ hic site in the set of polymo ⁇ hic sites, the first tag sequence is located at a distinct distance 5' of the polymo ⁇ hic site, relative to the distance of the first tag sequence from a polymo ⁇ hic site on molecules in the sample containing other polymo ⁇ hic sites, wherein the amplification regimen is performed using an upstream amplification primer comprising the first
  • the distinguishable label is a fluorescent label.
  • step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge. In a preferred embodiment wherein the separating comprises capillary electrophoresis.
  • the amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
  • a preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size andor by charge, and detecting the inco ⁇ oration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymo ⁇ hic site.
  • the removing, separating and detecting are performed after each cycle in the regimen.
  • the separating comprises capillary electrophoresis.
  • steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
  • the tag sequence comprises 15 to 40 nucleotides.
  • the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymo ⁇ hic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymo ⁇ hic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymorphic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymo ⁇ hic site.
  • Another embodiment further comprises the step, before step (a), of removing primers not inco ⁇ orated when the population of primer extension products was made.
  • the step of removing primers comprises degrading the primers not inco ⁇ orated when the population of primer extension products was made.
  • the degrading is performed using a heat labile exonuclease.
  • the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
  • the heat labile exonuclease is thermally inactivated before continuing to step (a).
  • the invention further encompasses a method of determining the identity of a single nucleotide at a known polymo ⁇ hic site, the method comprising: I) providing a nucleic acid sample comprising the polymo ⁇ hic site; II) separating the strands of the nucleic acid sample and re-annealing in the presence of: a) a first oligonucleotide primer comprising a 3 ' region that hybridizes to a sequence at a known distance upstream of the known polymo ⁇ hic site, the first oligonucleotide primer comprising a first sequence tag located 5' of the 3' region; and b) a set of second oligonucleotide primers, wherein each member of the set comprises: i) a region that hybridizes 3' of and adjacent to the polymo ⁇ hic site; ii) a variable 3' terminal nucleotide, wherein, when the member is hybridized to the known sequence, the 3' temi
  • the distinguishable label is a fluorescent label.
  • step (VIII) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge, h a preferred embodiment the separating comprises capillary electrophoresis.
  • the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
  • a preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the inco ⁇ oration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymo ⁇ hic site.
  • the removing, separating and detecting are performed after each cycle in the regimen.
  • steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
  • the tag sequences each comprise 15 to 40 nucleotides.
  • the 3' region that hybridizes to a sequence at a known distance upstream of the known polymorphic site comprises 10-30 nucleotides.
  • the region that hybridizes 3' of and adjacent to the polymo ⁇ hic site comprises 10-30 nucleotides.
  • the set of downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymo ⁇ hic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymo ⁇ hic site.
  • the invention further encompasses a method of determining the identities of single nucleotides present at a group of known polymo ⁇ hic sites, the method comprising: I) providing a nucleic acid sample comprising the group of polymo ⁇ hic sites; II) separating the strands of the nucleic acid sample and re-annealing in the presence of: a) a set of first oligonucleotide primers each comprising a 3' region that hybridizes to a sequence at a known distance upstream of a known polymo ⁇ hic site, each member of the set of first oligonucleotide primers comprising a common sequence tag located 5' of the 3' region, and each member of the set of first oligonucleotide primers selected such that a distinctly sized amplification product is generated for each polymo ⁇ hic site in the group of known polymo ⁇ hic sites; and b) a set of downstream amplification primers comprising, in 5' to 3
  • the distinguishable label is a fluorescent label.
  • the step (VIII) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge.
  • the separating comprises capillary electrophoresis.
  • the amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
  • a preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the inco ⁇ oration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymo ⁇ hic site.
  • the removing, separating and detecting are performed after each cycle in the regimen.
  • steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
  • the tag sequences each comprise 15 to 40 nucleotides.
  • the 3' region that hybridizes to a sequence at a known distance upstream of the known polymo ⁇ hic site comprises 10-30 nucleotides.
  • the region that hybridizes 3' of and adjacent to the polymo ⁇ hic site comprises 10-30 nucleotides.
  • the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymo ⁇ hic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymo ⁇ hic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymo ⁇ hic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymo ⁇ hic site.
  • the invention further encompasses a kit for the determination of the nucleotide present at a polymo ⁇ hic site present on a nucleic acid sample, the kit comprising a set of upstream primers comprising: a) a first primer comprising a 5 '-tag sequence and 3' sequence sufficient to specifically hybridize at a known distance upstream of a known polymo ⁇ hic site; and b) a set of 4 downstream second primers, comprising in 5' to 3' order: i) a sequence tag selected from the group consisting of a tag specifically corresponding to G as the 3 '-terminal nucleotide of the primer; a tag specifically corresponding to A as the 3 '-terminal nucleotide of the primer; a tag specifically corresponding to T as the 3 '-terminal nucleotide of the primer; and a tag specifically corresponding to C as the 3 '-terminal nucleotide of the primer; ii) a region that specifically hybridizes to a
  • One embodiment further comprises a set of 5 primers lacking sequence specific for a gene in the genome of the organism being examined for polymo ⁇ hisms, the primers comprising a primer comprising the tag sequence of the first primer and a set of four distinguishably labeled primers comprising the tag sequences of the set of four downstream second primers.
  • sample refers to a biological material which is isolated from its natural environment and containing a polynucleotide.
  • a “sample” according to the invention can consist of purified or isolated polynucleotide, or it may comprise a biological sample such as a tissue sample, a biological fluid sample, or a cell sample comprising a polynucleotide.
  • a biological fluid includes blood, plasma, sputum, urine, cerebrospinal fluid, lavages, and leukophoresis samples.
  • a sample of the present invention may be any plant, animal, bacterial or viral material containing a polynucleotide.
  • polymo ⁇ hism refers to a nucleic acid sequence variation. When compared to a naturally occurring sequence, a polymo ⁇ hism can be present at a frequency of greater than 0.01%, 0.1%, 1% or greater in a population. As used herein, a polymo ⁇ hism can be an insertion, deletion, duplication, or rearrangement. As used herein, a “single nucleotide polymo ⁇ hism” or “SNP" refers to nucleic acid sequence variation at a single nucleotide residue, including a single nucleotide deletion, insertion, or base change.
  • a polymo ⁇ hism including a SNP, can be phenotypically neutral or can have an associated variant phenotype that distinguishes it from that exhibited by the predominant sequence at that locus.
  • neutral polymo ⁇ hism refers to a polymo ⁇ hism in which the sequence variation does not alter gene function
  • mutation or “functional polymo ⁇ hism” refers to a sequence variation which does alter gene function, and which thus has an associated phenotype.
  • the "predominant allele” is that which occurs most frequently in the population being examined (i.e., when there are two alleles, the allele that occurs in greater than 50% of the population is the predominant allele; when there are more than two alleles, the "predominant allele” is that which occurs in the subject population at the highest frequency, e.g, at least 5% higher frequency, relative to the other alleles at that site).
  • variant allele is used to refer to the allele or alleles occurring less frequently than the predominant allele in that population (e.g, when there are two alleles, the variant allele is that which occurs in less than 50% of the subject population; when there are more than two alleles, the variant alleles are all of those that occur less frequently, e.g, at least 5% less frequently, than the predominant allele).
  • polymo ⁇ hic site refers to the position, in a polymo ⁇ hic nucleotide sequence, of the nucleotide that varies among individuals.
  • an "oligonucleotide primer” refers to a polynucleotide molecule (i.e., DNA or RNA) capable of annealing to a polynucleotide template and providing a 3' end to produce an extension product which is complementary to the polynucleotide template.
  • the conditions for initiation and extension usually include the presence of four different deoxyribonucleoside triphosphates and a polymerization-inducing agent such as DNA polymerase or reverse transcriptase, in a suitable buffer ("buffer” includes substituents which are cofactors, or which affect pH, ionic strength, etc.) and at a suitable temperature.
  • the primer according to the invention may be single- or double-stranded.
  • Primer is single-stranded for maximum efficiency in amplification, and the primer and its complement form a double-stranded polynucleotide.
  • "Primers" useful in the present invention are less than or equal to 100 nucleotides in length, e.g, less than or equal to 90, or 80, or 70, or 60, or 50, or 40, or 30, or 20, or 15, or equal to 10 nucleotides in length.
  • polymerase extension means the template-dependent inco ⁇ oration of at least one complementary nucleotide, by a nucleic acid polymerase, onto the 3' end of an annealed primer. Polymerase extension preferably adds more than one nucleotide, preferably up to and including nucleotides corresponding to the full length of the template.
  • polymerase extension Conditions for polymerase extension vary with the identity of the polymerase.
  • the temperature of polymerase extension is based upon the known activity properties of the enzyme. In general, although the enzymes retain at least partial activity below their optimal extension temperatures, polymerase extension by the most commonly used thermostable polymerases (e.g, Taq polymerase and variants thereof) is perfo ⁇ ned at 65°C to 75°C, preferably about 68-72°C.
  • primer extension products refers to nucleic acid molecules generated by the process of polymerase extension.
  • tag sequence refers to a nucleotide sequence, preferably a heterologous or artificial nucleotide sequence, that is attached to an oligonucleotide primer via standard phosphodiester linkage (i.e., phosphodiester linkage between the 3 ' OH of the tag and the 5' phosphate of the oligonucleotide) and permits the identification or tracing of polynucleotides into which the "tag" is inco ⁇ orated (inco ⁇ orated for example, by primer extension or amplification of a primer extension product).
  • standard phosphodiester linkage i.e., phosphodiester linkage between the 3 ' OH of the tag and the 5' phosphate of the oligonucleotide
  • a "tag” sequence according to the invention will comprise at least 15, and preferably 20 to 30 nucleotides and will preferably not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped.
  • a tag sequence according to the invention can be, but is not necessarily, random.
  • the term "specifically corresponds" means that a given nucleic acid tag sequence on an oligonucleotide is only used with a given 3 '-terminal nucleotide, such that the presence of the tag sequence is indicative of the presence of that 3 '-terminal nucleotide.
  • tag sequence "1" would only be used on an oligonucleotide with a 3 '-terminal A
  • tag sequence "2” would only be used on an oligonucleotide with a 3 '-terminal C
  • tag sequence "3” would only be used on an oligonucleotide with a 3 '-terminal G
  • tag sequence "4" would only be used on an oligonucleotide with a 3 '-terminal T.
  • amplification regimen refers to a process of specifically amplifying, i.e., increasing the abundance of, a nucleic acid sequence of interest.
  • An amplification regimen according to the invention comprises at least two, and preferably at least 5, 10, 15, 20, 25, 30, 35 or more iterative cycles, where each cycle comprises the steps of: 1) strand separation (e.g, thermal denaturation); 2) oligonucleotide primer annealing to template molecules; and 3) nucleic acid polymerase extension of the annealed primers. Conditions and times necessary for each of these steps are well known in the art.
  • Amplification achieved using an amplification regimen is preferably exponential, but can alternatively be linear.
  • An amplification regimen according to the invention is preferably performed in a thermal cycler, many of which are commercially available.
  • set means a group of nucleic acid samples, primers or other entities.
  • a set will comprise a known number of, and at least two of such entities.
  • subset means a group comprised by a set as defined herein, wherein the subset group is less than every member of the set.
  • a subset as used herein can consist of a single entity.
  • upstream and downstream are used to refer to positions on a polynucleotide relative to a polymo ⁇ hic site.
  • upstream refers to 5' of the polymo ⁇ hic site
  • downstream refers to 3' of the polymo ⁇ hic site.
  • the term "distinguishably labeled" means that the signal from one labeled oligonucleotide primer or a nucleic acid molecule into which it is inco ⁇ orated can be distinguished from the signal from another such labeled primer or nucleic acid molecule.
  • Detectable labels can comprise, for example, a light-absorbing dye, a fluorescent dye, or a radioactive label. Fluorescent dyes are preferred. Generally, a fluorescent signal is distinguishable from another fluorescent signal if the peak emission wavelengths are separated by at least 20 nm. Greater peak separation is preferred, especially where the emission peaks of fluorophores in a given reaction are wide, as opposed to narrow or more abrupt peaks.
  • separating nucleic acid molecules refers to the process of physically separating nucleic acid molecules in a sample or aliquot on the basis of size and/or charge. Electrophoretic separation is preferred, and capillary electrophoretic separation is most preferred.
  • the term "detecting the inco ⁇ oration” refers to the process of determining whether a given labeled oligonucleotide primer has been extended, thereby inco ⁇ orating the label into the primer extension or amplification product. Detection can be by any means compatible with the detectable label, but will preferably involve detection of a fluorescent label. Detecting encompasses determination of both the presence and the abundance of label in a primer extension or amplification product. Fluorescence detectors are well known in the art.
  • the tenn “specifically hybridizes” means that under given hybridization conditions a probe or primer hybridizes only to a target sequence in a sample comprising the target sequence.
  • Given hybridization conditions include the conditions for the annealing step in an amplification regimen, i.e., annealing temperature selected on the basis of predicted T m , and salt conditions suitable for the polymerase enzyme of choice.
  • strand separation means treatment of a nucleic acid sample such that complementary double-stranded molecules are separated into two single strands available for annealing to an oligonucleotide primer.
  • Strand separation according to the invention is achieved by heating the nucleic acid sample above its T m .
  • T m Generally, for a sample containing nucleic acid molecules in buffer suitable for a nucleic acid polymerase, heating to 94°C is sufficient to achieve strand separation according to the invention.
  • An exemplary buffer contains 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl 2 , and 0.1% BSA.
  • primer annealing or "re-annealing” means permitting oligonucleotide primers to hybridize to template nucleic acid strands.
  • Conditions for primer annealing vary with the length and sequence of the primer and are based upon the calculated T m for the primer.
  • an annealing step in an amplification regimen involves reducing the temperature following the strand separation step to a temperature based on the calculated T m for the primer sequence, for a time sufficient to permit such annealing.
  • T m can be readily predicted by one of skill in the art using any of a number of widely available algorithms (e.g, OligoTM , Primer Design and programs available on the internet, including Primer3 and Oligo Calculator).
  • the annealing temperature is selected to be about 5°C below the predicted T m, although temperatures closer to and above the T m (e.g., between 1°C and 5°C below the predicted T m or between 1°C and 5°C above the predicted T m ) can be used, as can temperatures more than 5°C below or above the predicted T m (e.g, 6°C below, 8°C below, 10°C below or lower and 6°C above, 8°C above, or 10°C above). Generally, the closer the annealing temperature is to the T m , the more specific is the annealing.
  • Primer annealing depends largely upon the volume of the reaction, with larger volumes requiring longer times, but also depends upon primer and template concentrations, with higher relative concentrations of primer to template requiring less time than lower.
  • primer annealing steps in an amplification regimen can be on the order of 1 second to 5 minutes, but will generally be between 10 seconds and 2 minutes, preferably on the order of 30 seconds to 2 minutes.
  • the term "3' region that hybridizes to a sequence at a known distance upstream of a known polymo ⁇ hic site” refers to a sequence of nucleotides, located at the 3' end of an oligonucleotide, that specifically hybridize to a sequence upstream (i.e., 5') of a known polymo ⁇ hic site being genotyped in a sample of nucleic acid.
  • the "3' region that hybridizes” will be at least 12 nucleotides long, and preferably at least 15, 18, 21, 24, 27, 30 nucleotides or more.
  • the "region that hybridizes” is selected to be a known distance from the polymo ⁇ hic site so as to give rise to an amplification product that is distinctly sized relative to other amplification products in a method according to the invention.
  • the "known distance” can be from 50 to 1000 nucleotides, and is preferably from 50 to 500 nucleotides or 50 to 250 nucleotides.
  • a "region that hybridizes 3 ' of and adjacent to a polymo ⁇ hic site” is an oligonucleotide sequence, generally 10 to about 25 nucleotides in length, that specifically hybridizes 3' of a polymo ⁇ hic site, such that the penultimate 3' nucleotide of the region is hybridized one nucleotide downstream of the polymo ⁇ hic site.
  • the invention makes use of a set of four primers comprising such a region, with the set comprised of oligonucleotides having four different 3' terminal nucleotides, G, A, T or C, only one of which will hybridize to the nucleotide at the polymo ⁇ hic site and permit primer extension by a nucleic acid polymerase.
  • variable 3 '-terminal nucleotide refers to a 3 '-terminal nucleotide of an oligonucleotide that can be any of G, A, T or C.
  • the term "opposite the polymo ⁇ hic site” means that a nucleotide, the 3'- terminal nucleotide on an oligonucleotide primer hybridized to a polymo ⁇ hism-containing nucleic acid strand, is positioned such that it will form a Watson-Crick hydrogen bonded base pair with the nucleotide at the polymo ⁇ hic position if the 3 '-terminal nucleotide is complementary to the nucleotide at the polymo ⁇ hic site.
  • the term “complementary” refers to the hierarchy of hydrogen-bonded base pair fonnation preferences between the four deoxyribonucleotides G, A, T, and C, such that A pairs with T and G pairs with C.
  • nucleic acid polymerase refers an enzyme that catalyzes the template-dependent polymerization of nucleoside triphosphates to form primer extension products that are complementary to one of the nucleic acid strands of the template nucleic acid sequence.
  • a nucleic acid polymerase enzyme initiates synthesis at the 3' end of an annealed primer and proceeds in the direction toward the 5' end of the template.
  • Numerous nucleic acid polymerases are known in the art and commercially available.
  • One group of preferred nucleic acid polymerases are thermostable, i.e., they retain function after being subjected to temperatures sufficient to denature annealed strands of complementary nucleic acids.
  • aliquot refers to a sample of an amplification reaction taken during the cycling regimen. An aliquot is less than the total volume of the reaction, and is preferably 0.1-30 % in volume. In one embodiment of the invention, for each aliquot removed, an equal volume of reaction buffer containing reagents necessary for the reaction (e.g, buffer, salt, nucleotides, and polymerase enzyme) is introduced.
  • reaction buffer containing reagents necessary for the reaction (e.g, buffer, salt, nucleotides, and polymerase enzyme) is introduced.
  • condition that permit the extension of an annealed oligonucleotide such that extension products are generated refers to the set of conditions including, for example temperature, salt and co-factor concentrations, pH, and enzyme concentration under which a nucleic acid polymerase catalyzes primer extension. Such conditions will vary with the identity of the nucleic acid polymerase being used, but the conditions for a large number of useful polymerase enzymes are well known to those skilled in the art.
  • One exemplary set of conditions is 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl 2 , 200 ⁇ M each dNTP, and 0.1 % BS A at 72 °C, under which Taq polymerase catalyzes primer extension.
  • the term "real time” means that the measurement of the accumulation of products in a nucleic acid amplification reaction is at least initiated, and preferably completed during or concurrent with the amplification regimen. Thus, for the measurement process to be considered “real time”, at least the initiation of the measurement or detection of amplification products in each aliquot is concurrent with the amplification process.
  • “initiated” is meant that an aliquot is withdrawn and placed into a separation apparatus, e.g, a capillary electrophoresis capillary, and separation is begun. The completion of the measurement is the detection of labeled species in the separated nucleic acids from the aliquot.
  • lag or delay is less than 30 minutes, e.g, 25 minutes, 20 minutes, 15 minutes, 10 minutes, 5 minutes, 4 minutes, 3 minutes, 2 minutes, 1 minute or less, including no lag or delay.
  • capillary electrophoresis means the electrophoretic separation of nucleic acid molecules in an aliquot from an amplification reaction wherein the separation is performed in a capillary tube.
  • Capillary tubes are available with inner diameters from about 10 to 300 ⁇ m, and can range from about 0.2 cm to about 3 m in length, but are preferably in the range of 0.5 cm to 20 cm, more preferably in the range of 0.5 cm to 10 cm.
  • microfluidic microcapillaries available, e.g, from Caliper or Agilent Technologies
  • Agilent Technologies is specifically contemplated within the meaning of "capillary electrophoresis.”
  • module apparatus means an apparatus that comprises individual units in which certain processes of the methods according to the invention are performed.
  • the individual units of a modular apparatus can be but are not necessarily physically connected, but it is preferred that the individual units are controlled by a central control device such as a computer.
  • An example of a modular apparatus useful according to the invention has a thermal cycler unit, a sampler unit, and a capillary electrophoresis unit with a fluorescence detector.
  • the modular apparatus useful according to the invention can also comprise a robotic ami to transfer samples from the cycling reaction to the electrophoresis unit.
  • the term "sampling device” refers to a mechanism that withdraws an aliquot from an amplification during the amplification regimen.
  • Sampling devices useful according to the invention will preferably be adapted to minimize contamination of the cycling reaction(s), by, for example, using pipeting tips or needles that are either disposed of after a single sample is withdrawn, or by incorporating one or more steps of washing the needle or tip after each sample is withdrawn.
  • the sampling device can contact the capillary to be used for capillary electrophoresis directly with the amplification reaction in order to load an aliquot into the capillary.
  • the sample device can include a fluidic line (e.g. a tube) connected to the controllable valve which will open at particular cycle.
  • Sampling devices known in the art include, for example, the multipu ⁇ ose Robbins Scientific Hydra 96 pipettor, which is adapted to sampling to or from 96 well plates. This and others can be readily adapted for use according to the methods of the invention.
  • robot arm means a device, preferably controlled by a microprocessor, that physically transfers samples, tubes, or plates containing samples from one location to another. Each location can be a unit in a modular apparatus useful according to the invention.
  • An example of a robotic arm useful according to the invention is the Mitsubishi RV- E2 Robotic Arm. Software for the control of robotic arms is generally available from the manufacturer of the arm.
  • amplified product refers to polynucleotides which are copies of a portion of a particular polynucleotide sequence and/or its complementary sequence, which correspond in nucleotide sequence to the template polynucleotide sequence and its complementary sequence.
  • An “amplified product,” according to the invention, may be DNA or RNA, and it may be double-stranded or single-stranded.
  • the term "distinctly sized amplification product” means an amplification product that is resolvable from amplification products of different sizes. "Different sizes" refers to nucleic acid molecules that differ by at least one nucleotide in length. Generally, distinctly sized amplification products useful according to the invention differ by greater than or equal to more nucleotides than the limit of resolution for the separation process used in a given method according to the invention. For example, when the limit of resolution of separation is one base, distinctly sized amplification products differ by at least one base in length, but can differ by 2 bases, 5 bases, 10 bases, 20 bases, 50 bases, 100 bases or more. When the limit of resolution is, for example, 10 bases, distinctly sized amplification products will differ by at least 10 bases, but can differ by 11 bases, 15 bases, 20 bases, 30 bases, 50 bases, 100 bases or more.
  • amplification plot mean a mathematical curve representing the signal from a detectable label inco ⁇ orated into a nucleic acid sequence of interest at two or more steps in an amplification regimen, plotted as a function of the cycle number from which the samples were withdrawn.
  • the profile is preferably generated by plotting the fluorescence of each band detected after capillary electrophoresis separation of nucleic acids in the individual reaction samples. Most commercially available fluorescence detectors are interfaced with software permitting the generation of curves based on the signal detected.
  • the number of genes that could be investigated in a single reaction can be estimated based on the measurable difference of the product size ( 1-2 bases) and on the separable size of PCR products (500-1000b ⁇ ) and can be as high as 1000, but is preferably 100-200.
  • heat-labile exonuclease refers to an enzyme that degrades single-stranded nucleic acid molecules or overhanging single strands on partially double stranded nucleic acid molecules and is irreversibly inactivated by incubation at an elevated temperature.
  • the temperature for inactivation will vary with the enzyme and with, for example, buffer conditions and enzyme concentration. Conditions for enzyme inactivation are known to those skilled in the art.
  • a non-limiting example of a heat-labile exonuclease useful according to the invention is Exonuclease I (Exol), from E. coli (commerically available from, e.g. New England Biolabs, Beverly MA). Exol is inactivated by incubation at 80°C for 20 minutes.
  • the term "substantially lacking sequence specific for a gene in the genome of the organism” means that a given primer will not generate a primer extension product when incubated under primer extension conditions with genomic DNA from the organism being investigated with respect to polymo ⁇ hisms.
  • Figure 1 shows a schematic diagram of primer extension reactions useful in one embodiment of the invention.
  • SI and S5 are different sequence tags.
  • Figure 2 shows a schematic diagram of an amplification regimen and detection useful in one embodiment of the invention.
  • SI and S5 are tag sequence primers that differ from one another but are identical to SI to S 5 shown in Figure 1.
  • the invention provides methods of determining the genotype of a nucleic acid sample with respect to known single nucleotide polymo ⁇ hisms.
  • the methods of the invention employ primer extension reactions that inco ⁇ orate sequence tags permitting the simultaneous identification of the specific nucleotides present at a group of SNPs. Tagged fragments are then amplified using sets of primers specific for the tags wherein the downstream primer is labeled. During the amplification regimen, aliquots of the reaction are withdrawn and subjected to size separation and detection of the amplified fragments. The nucleotides present at the polymo ⁇ hic sites are identified based on the size and identity of the label attached to the amplified fragments.
  • the system is well suited for multiplexing. Further, the separation and detection are performed during the amplification reaction, such that a profile of the amplification reaction is generated in real time.
  • the real time aspect provides rapid analysis as well as information regarding the course of the amplification that is useful in identifying and eliminating artifactual signals caused, for example, by interactions between primers.
  • the invention requires the generation of sequence-tagged primer extension products.
  • a critical aspect of this step is that the tag on any particular extension product specifically corresponds with the identity of the nucleotide at the polymo ⁇ hic site.
  • the tag is inco ⁇ orated by the extension of a primer with the following general structure:
  • Tag c is the tag sequence that corresponds with the identity of the nucleotide at the 3' terminus of the primer
  • target complement is the 3' region of the primer that specifically hybridizes adjacent to the known SNP
  • arid V c is a variable 3' terminal nucleotide that corresponds with the identity of the Tag c sequence.
  • the Tag c sequence is preferably 20 to 30 nucleotides in length and preferably does not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped or to any of the other primers used in a given reaction.
  • the "target complement” is long enough to provide specific hybridization between the primer and the sequence adjacent to a known SNP, and will generally be about 10 to 25 nucleotides in length.
  • V c is selected from dG, dA, dT and dC, and is positioned so that it is opposite the known polymo ⁇ hic site when the primer is hybridized to the nucleic acid sample being interrogated. V c will base pair with the nucleotide at the polymo ⁇ hic site only if it is complementary to the nucleotide at that site.
  • nucleic acid polymerase e.g, Taq polymerase
  • a nucleic acid polymerase will only extend a primer if the 3 '-terminal nucleotide is base paired with the adjacent nucleotide on the template strand
  • the extension of a primer with a known 3 '-terminal nucleotide opposite the polymo ⁇ hic site identifies the nucleotide present at the polymo ⁇ hic site as the complement of the 3 ' -terminal nucleotide.
  • a set of downstream primer extension primers useful for the identification of an SNP will include four different tag sequences, one each to correspond to a 3 '-terminal dG, dA, dT or dC.
  • tags are referred to as Tags 1-4, for example, Tag 1 would be used on the primer terminating in a 3' dG, Tag 2 would be used on the primer terminating in a 3' dA, Tag 3 would be used on the primer terminating in a 3' dT, and Tag 4 would be used on the primer terminating in a 3' dC.
  • a major advantage of the methods disclosed herein is that one can use the same set of four downstream Tag c sequences in assays for multiple SNPs, because the resulting amplification products will differ in size. This limits the possibilities for non-template directed inte ⁇ rimer interactions in the amplification step that tend to interfere with multiplex amplifications.
  • Sequence-tagged upstream primers are used to generate the opposite strand of a given SNP-containing sequence. These primers will have the general structure:
  • Tag refers to a sequence tag different from each of those used in a downstream set of primer extension primers
  • target complement refers to a sequence complementary to a region upstream of the known SNP.
  • the "Tag” sequence on the upstream primer is preferably 20 to 30 nucleotides in length and preferably does not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped, or to any of the other primers being used in a given reaction.
  • the “target complement” is long enough to provide specific hybridization under primer extension conditions between the primer and a sequence upstream of a known SNP, and will generally be about 10 to 25 nucleotides in length.
  • the distance upstream will generally be at least 50 nucleotides, but can be 50 to 1000 nucleotides or more, preferably 50 to 500, or 50 to 250 nucleotides upstream of the polymo ⁇ hic site.
  • the distance of the upstream primer sequence from the polymo ⁇ hic site determines the size or length of the later amplification products.
  • the sizes of the later amplification products must be selected so as to differ by more than the resolution limit of the system used for size separation. Thus, if the limit of resolution of separation is one base, the sizes of the amplification products should be selected to differ by at least one base in length, and preferably more (e.g, at least 5, 10, 15 bases or more). When the limit of resolution is, for example, 10 bases, sizes of the amplification products should differ by at least 10 bases, and preferably more (e.g, at least 15, 20, 25, 30 bases or more).
  • upstream and downstream are used herein in order to facilitate the description of the invention. However, it is recognized that because of the double-stranded ' nature of DNA, a polymo ⁇ hism could be approached with SNP-specific primers from either side, that is, from upstream or downstream, by hybridization of the primer to one strand as opposed to the other.
  • the invention specifically contemplates the interrogation of SNPs on either strand of the genomic DNA.
  • a nucleic acid sample is denatured, preferably by heat, e.g, to 95°C for 2 minutes or more, and allowed to re-anneal in the presence of an upstream extension primer and a set of downstream primer extension primers for each SNP to be interrogated in the reaction.
  • the denaturing and annealing is best performed in a buffer compatible with the nucleic acid polymerase to be used for the primer extension reaction, e.g, IX Taq polymerase buffer.
  • Re- annealing is performed at a temperature below the T m of the primers, generally between about 20°C and 60°C, although lower or higher temperatures may be suitable for some primers.
  • Primers should be present at about 15 to 500 nM for each primer. Optimal primer concentrations can be determined empirically by one of skill in the art with a minimum of experimentation, for example by setting up test reactions in which the primers are varied over the 15 to 500 nM range and analyzing the results with respect to the relative resolution, yield and specificity of the extension or amplification reactions.
  • polymerization is perforaied using a nucleic acid polymerase.
  • Numerous polymerases sufficient for this step are known and can be selected by one skilled in the art.
  • the thermostable Taq polymerase and other thermostable polymerases e.g, Pfu polymerase.
  • Primer extension is performed under standard conditions for the enzyme chosen, e.g, 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl 2 , and 0.1% BSA and 100 ⁇ M each dNTP at 72°C for two minutes.
  • the first round of primer extension results in a population in which one strand has an upstream primer and tag sequence inco ⁇ orated and the other strand has a downstream primer and tag sequence inco ⁇ orated.
  • the downstream primer inco ⁇ orated for each SNP is the one in which the 3 '-terminal nucleotide was complementary to the nucleotide at the polymo ⁇ hic site on the target DNA.
  • the inco ⁇ oration of that downstream primer necessarily inco ⁇ orates the tag sequence associated with or corresponding to that 3 '-terminal nucleotide.
  • the products of the first primer extension reaction are subjected to another round of denaturing, re-annealing in the presence of the same primers, and polymerase extension of those primers.
  • Heat labile exonucleases useful according to the invention include, for example E. coli Exonuclease I (Exol), and Exonuclease VII (ExoVII) .
  • Exol for example, is active at 37°C but is inactivated by incubation for 20 minutes at 80°C.
  • the primers used for primer extension are removed so that new primers, corresponding to the inco ⁇ orated upstream and downstream tag sequences, can be used to amplify the primer extension products.
  • a set of primers comprising an upstream tag sequence primer and four downstream tag sequence primers is added.
  • Each of the four downstream tag sequence primers is distinguishably labeled (e.g, end labeled) with a fluorescent dye.
  • the mixture with the new primers added is then subjected to an amplification regimen comprising cycles of thermal denaturation, re-annealing and polymerase extension.
  • the amplification regimen should comprise at least two cycles, but will preferably comprise 2 to 35 cycles, more preferably 10 to 30 cycles, and more preferably 15 to 25 cycles.
  • a sample or aliquot of the reaction is withdrawn from the tube or reaction vessel, and nucleic acids in the aliquot are separated and detected.
  • the separation and detection are performed concurrently with the cycling regimen, such that a curve representing product abundance as a function of cycle number can be generated while the cycling occurs.
  • the term "concurrently” means that the separation is at least initiated while the cycling regimen is proceeding.
  • the separation technology used e.g, capillary electrophoresis
  • the separation will most often require on the order of 1-120 minutes per aliquot.
  • Concurrent separation is most preferably performed through use of a robotic sampler that deposits the samples to the separation apparatus immediately after the samples are withdrawn from the cycling reaction.
  • the identity of the nucleotide at a polymorphic site is determined by detection of the fluorescent signals on the size-separated amplification products. Because each of the four downstream tag primers is labeled with a distinguishable fluorescent label, and because the tag on a given primer corresponds to the identity of the 3 '-terminal nucleotide of the original downstream primer extension primer, the inco ⁇ oration and detection of that fluorescently labeled tag identifies the nucleotide at the polymo ⁇ hic site.
  • the original primer extension reactions include primer sets that recognize more than one SNP.
  • each different polymo ⁇ hism will be represented by a distinctly sized amplification product.
  • additional upstream primers each comprising the same tag sequence and varying in the 3' region that hybridizes at a distinct distance upstream of an additional known SNP.
  • each additional SNP to be interrogated requires a set of four downstream primer extension primers, each member of the set comprising in 5' to 3' order: a) a tag sequence that corresponds to the 3 ' terminal nucleotide of that primer, wherein the tag sequence is the same tag sequence that corresponds to that 3 '-terminal nucleotide on the downstream primers used for other SNPs being interrogated in the same series of reactions; b) a region sufficient to direct specific hybridization of the primer downstream of and adjacent to a known SNP; and c) a variable 3 '-terminal nucleotide that corresponds to the tag sequence on that primer, wherein when the primer is hybridized to its genomic target sequence, the 3 '-terminal nucleotide is opposite the polymo ⁇ hic site and can base pair with the nucleotide at that site if it is complementary.
  • a single amplification primer set is used, identical to that used when a single SNP is interrogated. That is, the amplification primer set will comprise an upstream primer comprising the upstream tag and a set of four distinguishably labeled primers comprising the four downstream tags on the primer extension primers, where the labels correspond to the tags that correspond to the nucleotides opposite the polymorphic site.
  • the same amplification primer set can be used for each SNP interrogated because the incorporated tags are common between the sets. That is, all upstream primers have the same tag sequence, and all downstream primer extension primer sets have the same tag sequences corresponding to the same 3 ' terminal nucleotides.
  • Each SNP interrogated will have a distinct size when separated, and the identity of the label inco ⁇ orated into a molecule of that size positively identifies the nucleotide present at that polymo ⁇ hic site.
  • the ability to amplify and detect multiple SNPs with a single set of five amplification primers has the advantage of avoiding primer interaction problems prevalent when large numbers of primers are used for amplification.
  • the effect of variations in primer annealing efficiency will be largely negated because all SNPs interrogated with a given amplification primer set will be affected by such variations to the same degree.
  • each set will comprise upstream tags selected so that the amplification products are distinctly sized, and downstream tags in which the respective tags correspond to the 3 '-terminal nucleotides of the primer extension primers.
  • the downstream primers can be labeled with the same corresponding fluorescent labels as the other sets, or, preferably with a different set of distinguishable fluorescent labels.
  • Oligonucleotide primers are generally 5 to 100 nucleotides in length, preferably from 17 to 45 nucleotides, although primers of different lengths are of use.
  • Primers for primer extension reactions are preferably 10 to 60 nucleotides long, while primers for amplification are preferably about 17-25 nucleotides in length.
  • Primers useful according to the invention can be designed to have a particular melting temperature (T m ) by the method of melting temperature estimation.
  • Commercial programs, including OligoTM , Primer Design and programs available on the internet, including PrimerS and Oligo Calculator can be used to calculate the T m of a polynucleotide sequence useful according to the invention.
  • the T m of an amplification primer useful according to the invention is between about 45°C and 65 3 C and more preferably between about 50°C and 60 3 C.
  • the T m of a polynucleotide affects its hybridization to another polynucleotide (e.g, the annealing of an oligonucleotide primer to a template polynucleotide).
  • the oligonucleotide primers used in various steps selectively hybridize to a target template or to polynucleotides derived from the target template.
  • selective hybridization occurs when two polynucleotide sequences are substantially complementary (at least about 65% complementary over a stretch of at least 14 to 25 nucleotides, preferably at least about 75%, more preferably at least about 90% complementary). See Kanehisa, M, 1984, Polynucleotide Res.
  • mismatch may be small, such as a mono-, di- or tri-nucleotides.
  • a region of mismatch may encompass loops, which are defined as regions in which there exists a mismatch in an uninterrupted series of four or more nucleotides.
  • oligonucleotide primers Numerous factors influence the efficiency and selectivity of hybridization of the primer to a second polynucleotide molecule. These factors, which include primer length, nucleotide sequence and/or composition, hybridization temperature, buffer composition and potential for steric hindrance in the region to which the primer is required to hybridize, will be considered when designing oligonucleotide primers according to the invention.
  • longer sequences have a higher melting temperature (T M ) than do shorter ones, and are less likely to be repeated within a given target sequence, thereby minimizing promiscuous hybridization.
  • T M melting temperature
  • Primer sequences with a high G-C content or that comprise palindromic sequences tend to self-hybridize, as do their intended target sites, since unimolecular, rather than bimolecular, hybridization kinetics are generally favored in solution.
  • Hybridization temperature varies inversely with primer annealing efficiency, as does the concentration of organic solvents, e.g. formamide, that might be included in a priming reaction or hybridization mixture, while increases in salt concentration facilitate binding. Under stringent annealing conditions, longer hybridization probes or synthesis primers hybridize more efficiently than do shorter ones, which are sufficient under more permissive conditions.
  • stringent hybridization is performed in a suitable buffer (for example, IX Taq Polymerase Buffer, or other buffer suitable for enzymes used for primer extension and amplification) under conditions that allow the polynucleotide sequence to hybridize to the oligonucleotide primers.
  • a suitable buffer for example, IX Taq Polymerase Buffer, or other buffer suitable for enzymes used for primer extension and amplification
  • Stringent hybridization conditions can vary (for example from salt concentrations of less than about 1M, more usually less than about 500 mM and preferably less than about 200 mM) and hybridization temperatures can vary (for example, from as low as 0 3 C to greater than 22 3 C, greater than about 30 3 C, and (most often) in excess of about 37 3 C) depending upon the lengths and/or the polynucleotide composition or the oligonucleotide primers. Longer fragments may require higher hybridization temperatures for specific hybridization. As several factors affect the stringency of hybridization, the combination of parameters is more
  • primers designed to hybridize near a known SNP are limited with respect to the modifications one can make to manipulate T m .
  • T m For example, where one would normally be able to shift up- or downstream on a sequence to find a region with a more favorable GC content, when a primer is designed to hybridize adjacent to a SNP, one cannot move the primer to another location. In this situation, then, the primary means of manipulating T m is to vary the length of the complementary sequence in the primer.
  • Tags useful according to the invention are preferably heterologous or artificial nucleotide sequences of at least 15, and preferably 20 to 30 nucleotides in length.
  • a tag will preferably not hybridize under PCR annealing conditions to a sequence in the genome of the organism being genotyped.
  • a tag sequence according to the invention can be, but is not necessarily random. One can determine whether a potential tag sequence hybridizes under PCR annealing conditions to a sequence in the genome of an organism by using the tag sequence as a labeled primer in a primer extension reaction with genomic DNA from the organism of interest as template.
  • the labeled primer is annealed to the genomic DNA at the annealing temperature one plans to use for the amplification steps of the method of the invention, and then incubated with thermostable polymerase under extension conditions.
  • the reaction products are then electrophoretically separated alongside labeled probe alone. If the labeled tag appears in a band or bands larger than the tag primer, the tag primer hybridized under PCR annealing conditions to a sequence in the genome of the organism being genotyped. Care should also be taken to avoid tags with complementarity to other tags intended for use in the same reaction.
  • Oligonucleotide primers useful according to the invention can be labeled, as described below, by inco ⁇ orating moieties detectable by spectroscopic, photochemical, biochemical, immunochemical, enzymatic or chemical means.
  • the method of linking or conjugating the label to the oligonucleotide primer depends, of course, on the type of label(s) used and the position of the label on the primer (i.e., 3 '-terminal, 5 '-terminal or body-labeled).
  • fluorescent dyes are preferred, a variety of labels that would be appropriate for use in the invention, as well as methods for their inclusion in the primer, are known in the art and include, but are not limited to, enzymes (e.g, alkaline phosphatase and horseradish peroxidase) and enzyme substrates, radioactive atoms, chromophores, fluorescence quenchers, chemiluminescent labels, and electrochemiluminescent labels, such as OrigenTM (Igen), that may interact with each other to enhance, alter, or diminish a signal.
  • enzymes e.g, alkaline phosphatase and horseradish peroxidase
  • enzyme substrates e.g, radioactive atoms, chromophores, fluorescence quenchers, chemiluminescent labels, and electrochemiluminescent labels, such as OrigenTM (Igen)
  • Igen electrochemiluminescent labels
  • Fluorophores for use as labels in constructing labeled primers of the invention include, but are not limited to rhodamine and derivatives (such as Texas Red), fluorescein and derivatives (such as 5-bromomethyl fluorescein), Cy5, Cy3, JOE, FAM, Oregon GreenTM, Lucifer Yellow, IAEDANS, 7-Me 2 N-coumarin-4-acetate, 7-OH-4-CH 3 -coumarin-3-acetate, 7-NH 2 -4-CH 3 - coumarin-3 -acetate (AMCA), monobromobimane, pyrene frisulfonates, such as Cascade Blue, and monobromorimethyl-ammoniobimane.
  • fluorophores with wide Stokes shifts are preferred, to allow using fluorimeters with filters rather than a monochromometer and to increase the efficiency of detection.
  • the labels can be attached to the oligonucleotide directly or indirectly by a variety of techniques. Depending on the precise type of label or tag used, the label can be located at the 5' end of the primer or located internally in the primer, or attached to spacer arms of various sizes and compositions to facilitate signal interactions. 5' end labeling is preferred.
  • phosphoramidite reagents one can produce oligomers containing functional groups (e.g, thiols or primary amines) at the 5'- terminus via an appropriately protected phosphoramidite, and can label them using protocols described in, for example, PCR Protocols: A Guide to Methods and Applications, frmis et al, eds. Academic Press, Ind, 1990.
  • a 5' phosphate group can be introduced as a radioisotope by using polynucleotide kinase and gamma- 32 P-ATP or gamma- 33 P- ATP to provide a reporter group.
  • Biotin can be added to the 5' end by reacting an aminothymidine residue, or a 6-amino hexyl residue, introduced during synthesis, with an N- hydroxysuccinimide ester of biotin.
  • PCR methods are well-known to those skilled in the art, such as those described in Mullis and Faloona, 1987, Methods Enzymol, 155: 335, Saiki et al, 1985. Science 230:1350. andU.S. Patent Nos. 4,683,202, 4,683,195 and 4,800,159, each of which is inco ⁇ orated herein by reference.
  • PCR is an in vitro method for the enzymatic synthesis of specific DNA sequences, using two oligonucleotide primers that hybridize to opposite strands and flank the region of interest in the target DNA.
  • PCR is reported to be capable of producing a selective enrichment of a specific DNA sequence by a factor of 10 9 .
  • the length and temperature of each step of a PCR cycle, as well as the number of cycles, are adjusted according to the stringency requirements in effect.
  • Annealing temperature and timing are determined both by the efficiency with which a primer is expected to anneal to a template and the degree of mismatch that is to be tolerated.
  • the ability to optimize the stringency of primer annealing conditions is well within the knowledge of one of skill in the art. An annealing temperature between 20°C and 72°C is most commonly used.
  • Initial denaturation of the template molecules is normally achieved by incubation at 92°C to 99°C for 4 minutes, followed by 20-40 cycles consisting of denaturation (94°C for 15 seconds to 1 minute), annealing (temperature based on T m as discussed above, usually about 5°C below the T ra of the oligonucleotide in the reaction with the lowest T m ; usually 1-2 minutes), and extension (usually 72°C for 1-3 minutes).
  • Sampling during the amplification regimen can be performed at any frequency or in any pattern desired. It is preferred that sampling occurs after each cycle in the regimen, although less frequent sampling can also be used, for example, every other cycle, every third cycle, every fourth cycle, etc. While a uniform sample interval will most often be desired, there is no requirement that sampling be performed at uniform intervals. As just one example, the sampling routine may involve sampling after every cycle for the first five cycles, and then sampling after every other cycle.
  • Sampling can be as simple as manually pipetting an aliquot from the reaction, but is preferably automated such that the aliquot is automatically withdrawn at predetermined sampling intervals. It is preferred that the reaction mixture is replenished at each withdrawal with equal volumes of fresh components such as dNTPs, primers and DNA polymerase. For this and other aspects of the invention, it is preferred, although not necessary that the cycling be performed in a microtiter or multiwell plate format. This format, which uses plates comprising multiple reaction wells, not only increases the throughput of the assay process, but is also well adapted for automated sampling steps due to the modular nature of the plates and the uniform grid layout of the wells on the plates.
  • microtiter plate designs useful according to the invention have, for example 12, 24, 48, 96, 384 or more wells, although any number of wells that physically fit on the plate and accommodate the desired reaction volume (usually 10-100 ⁇ l) can be used according to the invention.
  • the 96 or 384 well plate format is preferred.
  • An automated sampling process can be readily executed as a programmed routine and avoids both human error in sampling (i.e., error in sample size and tracking of sample identity) and the possibility of contamination from the person sampling.
  • Robotic samplers capable of withdrawing aliquots from thermal cyclers are available in the art.
  • the Mitsubishi RV-E2 Robotic Arm can be used in conjunction with a SciCloneTM Liquid Handler or a Robbins Scientific Hydra 96 pipettor.
  • the robotic sampler useful according to the invention can be integrated with the thermal cycler, or the sampler and cycler can be modular in design.
  • thermal cycling and sampling occur in the same location, with samples being withdrawn at programmed intervals by a robotic sampler.
  • the cycler and sampler are modular in design, the cycler and sampler are separate modules.
  • the assay plate is physically moved, e.g, by a robotic arm, from the cycler to the sampler and back to the cycler.
  • the volume of an aliquot removed at the sampling step can vary, depending, for example, upon the total volume of the amplification reaction, the sensitivity of product detection, and the type of separation used.
  • Amplification volumes can vary from several microliters to several hundred microliters (e.g, 5 ⁇ l, 10 ⁇ l, 20 ⁇ l, 40 ⁇ l, 60 ⁇ l, 80 ⁇ l, 100 ⁇ l, 120 ⁇ l, 150 ⁇ l, or 200 ⁇ l or more), preferably in the range of 10-150 ⁇ l, more preferably in the range of 10-100 ⁇ l.
  • Aliquot volumes can vary from 0.1 to 30 % of the reaction mixture.
  • Separation of nucleic acids according to the invention can be achieved by any means suitable for separation of nucleic acids, including, for example, electrophoresis, HPLC or mass spectrometry. Due to its speed and resolution, separation is preferably performed by capillary electrophoresis (CE).
  • CE capillary electrophoresis
  • CE is an efficient analytical separation technique for the analysis of minute amounts of sample.
  • CE separations are performed in a narrow diameter capillary tube, which is filled with an electrically conductive medium termed the "carrier electrolyte.”
  • An electric field is applied between the two ends of the capillary tube, and species in the sample move from one electrode toward the other electrode at a rate which is dependent on the electrophoretic mobility of each species, as well as on the rate of fluid movement in the tube.
  • CE may be performed using gels or liquids, such as buffers, in the capillary.
  • free zone electrophoresis separations are based on differences in the free solution mobility of sample species.
  • micelles are used to effect separations based on differences in hydrophobicity. This is known as Micellar Electrokinetic Capillary Chromatography (MECC).
  • MECC Micellar Electrokinetic Capillary Chromatography
  • CE separates nucleic acid molecules on the basis of charge, which effectively results in their separation by size or number of nucleotides.
  • a number of fragments When a number of fragments are produced, they will pass the fluorescence detector near the end of the capillary in ascending order of size. That is, smaller fragments will migrate ahead of larger ones and be detected first.
  • CE offers significant advantages of over conventional electrophoresis, primarily in the speed of separation, small size of the required sample (on the order of 1-50 nl), and high resolution. For example, separation speeds using CE can be 10 to 20 times faster than conventional gel electrophoresis, and no post-run staining is necessary.
  • CE provides high resolution, separating molecules in the range of about 10-1,000 base pairs differing by as little as a single base pair. High resolution is possible in part because the large surface area of the capillary efficiently dissipates heat, permitting the use of high voltages. In addition, band broadening is minimized due to the narrow inner diameter of the capillary.
  • free-zone electrophoresis the phenomenon of electroosmosis, or electroosmotic flow (EOF) occurs. This is a bulk flow of liquid that affects all of the sample molecules regardless of charge. Under certain conditions EOF can contribute to improved resolution and separation speed in free-zone CE.
  • CE can be performed by methods well known in the art, for example, as disclosed in U.S. Patent Nos. 6,217,731; 6,001,230; and 5,963,456, which are inco ⁇ orated herein by reference.
  • High throughput CE equipment is available commercially, for example, the HTS9610 High Throughput Analysis System and SCE 9610 fully automated 96-capillary electrophoresis genetic analysis system from Spectrumedix Co ⁇ oration (State College, PA).
  • Others include the P/ACE 5000 series from Beckman Instruments Inc (Fullerton, CA) and the ABI PRISM 3100 genetic analyzer (Applied Biosystems, Foster City, CA).
  • Each of these devices comprises a fluorescence detector that monitors the emission of light by molecules in the sample near the end of the CE column.
  • the standard fluorescence detectors can distinguish numerous different wavelengths of fluorescence emission, providing the ability to detect multiple fluorescently labeled species in a single CE run from an amplification sample.
  • CAE Capillary Array Electrophoresis
  • a confocal fluorescence scanner can be used (Quesada et al, 1991, Biotechniques 10:616-25).
  • the apparatus for separation can be separate from or integrated with the apparatus used for thermal cycling and sampling. Because according to the invention the separation step is initiated concurrently with the cycling regimen, samples are preferably taken directly from the amplification reaction and placed into the separation apparatus so that separation proceeds concurrently with amplification. Thus, while it is not necessary, it is preferred that the separation apparatus is integral with the thermal cycling and sampling apparatus. In one embodiment, this apparatus is modular, comprising a thermal cycling module and a separation/detection module, with a robotic sampler that withdraws sample from the thermal cycling reaction and places it into the separation/detection apparatus.
  • Amplification product detection methods useful according to the invention measure the intensity of fluorescence emitted by labeled primers when they are irradiated with light within the excitation spectrum of the fluorescent label.
  • Fluorescence detection technology is highly developed and very sensitive, with documented detection down to a single molecule in some instances.
  • High sensitivity fluorescence detection is a standard aspect of most commercially- available plate readers, microarray detection set-ups and CE apparatuses.
  • fiber optic transmission of excitation and emission signals is often employed. Spectrumedix, Applied Biosystems, Beckman Coulter and Agilent each sell CE equipment with fluorescence detectors sufficient for the fluorescence detection necessary for the methods described herein.
  • the fluorescence signals from two or more different fluorescent labels can be distinguished from each other if the peak wavelengths of emission are each separated by 20 nm or more in the spectrum.
  • the practitioner will select fluorophores with greater separation between peak wavelengths, particularly where the selected fluorophores have broad emission wavelength peaks. It follows that the more different fluorophores one wishes to include and detect concurrently in a sample, the narrower should be their emission peaks.
  • Example 1 Detection of single nucleotide differences.
  • Leber's hereditary optic neuropathy is associated with the presence of several point mutations in mitochondrial DNA, at positions 3460, 11778 and 14459.
  • the genotype of an individual with respect to SNPs in human mitochondrial DNA associated with Leber's hereditary optic neuropathy (LHON) can be determined as follows.
  • the upstream primers are as follows:
  • the downstream primers are as follows:
  • G-primer 5 '-agttggcgaa gcagtcgcta gaagaCGG GCT ACT ACA ACC CTT CGC TGA CG-3' (SEQ ID NO: 7)
  • A-primer 5'-gatgctggtg tggctggtgt tcccgCGG GCT ACT ACA ACC CTT CGC TGA CA-3' (SEQ ID NO: 8)
  • T-primer 5'-ggttggttgc acactggaga tattggCGG GCT ACT ACA ACC CTT CGC TGA CT-3' (SEQ ID NO: 9)
  • A-primer 5'-gatgctggtg tggctggtgt tcccgTCA AAC TAC GAA CGC ACT CAC AGT CA-3' (SEQ ID NO: 12)
  • T-primer 5 '-ggttggttgc acactggaga tattggTC A AAC TAC GAA CGC ACT CAC AGT
  • A-primer 5'-gatgctggtg tggctggtgt tcccgCTC AGG ATA CTC CTC AAT AGC CAT CA-3' (SEQ ID NO: 16)
  • T-primer 5'-ggttggttgc acactggaga tattggCTC AGG ATA CTC CTC AAT AGC CAT CT-3' (SEQ ID NO: 17)
  • C-primer 5'-ctggagcatc tggaaaagta gtaccCTC AGG ATA CTC CTC AAT AGC CAT
  • the full set of 5 primer extension primers for each polymo ⁇ hic site (40 pmol each, 15 primers in total) is mixed with 1 ⁇ g of template genomic DNA from the individual to be tested, in IX Pfu buffer (20 mM Tris-HCl, pH 8.8, 10 mM KC1, 10 mM (NH 4 ) 2 SO 4 , 2 mM MgSO 4 , 0.1% Triton-X-100 and 0.1 mg/ml nuclease-free BSA) in a total volume of 50 ⁇ l. The mixture is heated to 94°C for 2 minutes and slowly cooled to room temperature, to permit primer annealing.
  • Primer extension primers are removed by the addition of 20 U of E. coli Exonuclease I (Exol; New England Biolabs) and incubation at 37°C for 20 minutes. Exol is then inactivated by incubation at 80°C for 20 minutes.
  • the 5 amplification primers (40 pmol of each primer in IX Pfu buffer, final volume 75 ⁇ l) are added as follows:
  • G-primer 5'-R6G-agttggcgaa gcagtcgcta gaaga-3' (SEQ ID NO: 20)
  • A-primer 5 '-FAM-gatgctggtg tggctggtgt tcccg-3' (SEQ ID NO: 21)
  • T-primer 5'-ROX-ggttggttgc acactggaga tattgg-3' (SEQ ID NO: 22)
  • Amplification is performed by adding 1 ⁇ l of fresh, cloned Pfu polymerase and cycling the reaction as follows: 35 cycles of 94°C for 45 sec, 50°C for 45 sec, and 72°C for 2 min. After each cycle, or at any chosen interval, an aliquot (0.5 ⁇ l) is withdrawn and loaded onto a prepared capillary electrophoresis apparatus. Separation is initiated and conducted during the amplification regimen. Amplified primer extension products are detected by fluorescence after separation over the length of the capillary. The signal strength of each fragment can be plotted for each cycle, to generate an amplification profile.
  • the method detailed in this example can be further multiplexed by including an additional upstream primer extension primer for each additional SNP, having the same upstream tag and a 3' region specific for a different SNP-containing fragment of a distinct size from those already included.
  • Each additional SNP interrogated must also have its own set of 4 downstream primers carrying the same set of 4 downstream primer tags, a 3' region that specifically hybridizes adjacent to the SNP, and a variable 3'-tenninal nucleotide that corresponds to the tag sequence.

Abstract

The invention relates to methods of genotyping single nucleotide differences in a nucleic acid sample. More particularly, the invention provides methods of identifying the nucleotide at a polymorphic site or a group of polymorphic sites in a sample of genomic DNA. The method uses tagged primer extension in which a set of tag sequences correspond to the identity of the nucleotides at the polymorphic sites. Primer extension products are PCR amplified using a common set of tag-specific primers, the downstream primers bearing distinguishable labels. Following separation by size and/or charge, the detection of distinguishable label in a product of the anticipated size determines the identity of the nucleotide at the polymorphic site. The method is well-suited for the genotyping of multiple single-nucleotide differences in one series of reactions.

Description

METHODS OF DETECTING SEQUENCE DIFFERENCES
This application claims the priority of U.S. Provisional Application No. 60/392,331, filed June 28, 2002, the entirety of which is incoφorated herein by reference, including figures.
FIELD OF THE INVENTION
The invention relates to molecular genetic methods for the identification of sequence differences in the genome of an individual relative to the sequences of a population of individuals. More particularly, the invention relates to methods for the identification of single nucleotide differences in genomic sequences.
BACKGROUND OF THE INVENTION
The nucleic acids comprising the genome of an organism contain the genetic information for that organism. Variability in gene sequences between individuals accounts for many of the obvious phenotypic differences (such as pigmentation of hair, skin, etc.) and many non-obvious ones (such as drug tolerance and disease susceptibility). Even minute changes in a nucleotide sequence, including single base pair substitutions, can have a significant effect on the quality or quantity of a protein. Single nucleotide changes are referred to as single nucleotide polymoφhisms or simply SNPs, and the site at which the SNP occurs is referred to herein as a polymoφhic site. DNA polymoφhisms are located throughout the genome, within and between genes, and the various fornis may or may not result in differential gene function (as determined by comparing the function of two alternative forms of the same sequence). Most polymoφhisms do not alter gene function and are termed "neutral" polymoφhisms. Others do have affect gene function, for example, by changing the amino acid sequence of a protein, or by altering control sequences such as promoters or RNA splicing or degradation signals, and are more commonly referred to as mutations. Diseases associated with SNPs include: sickle cell anemia, β- thalassemias, diabetes, cystic fibrosis, hyperlipoproteinemia, a wide variety of autoimmune diseases, and the formation of some oncogenes, e.g., mutant p53. In addition to causing or affecting disease states, point mutations can cause altered pathogenicity or susceptibility to disease and resistance to therapeutics.
The ability to detect specific nucleotide alterations or mutations in DNA sequences is useful for a number of medical and non-medical puφoses. Methods capable of identifying nucleotide alterations permit screening and diagnosis of diseases associated with SNPs. Polymoφhisms are also useful in genetic studies to identify genes involved with a disease. If a polymoφhism alters the function of one or more genes such that disease susceptibility is increased, the polymoφhism will be present more often in individuals with the disease relative to those without the disease. Statistical methods can be used to evaluate polymoφhism frequencies found in diseased relative to normal populations, and can facilitate the establishment of a causal link between a polymoφhism and a disease phenotype.
Methods that can quickly identify sequence variations that correlate with disease are also valuable in pemiitting prophylactic measures, in the assessment of the likelihood of developing disease and in evaluating the prognosis of such disease. Non-medical applications of SNPs include, for example, the detection of microorganisms or particular strains of them, and in forensic analysis.
Central to the usefulness of SNPs is the ability to determine the genotype of an individual with respect to known SNPs. A number of approaches to the problem have been taken. For example, some polymoφhisms fortuitously result in changes in restriction endonuclease cleavage sites, thereby changing the pattern of fragments observed when a digested genomic DNA sample is separated by electrophoresis. This is the basis for Restriction Fragment Length Polymoφhism analysis, or RFLP analysis. RFLP analysis is limited in that it can only detect those changes that affect a restriction endonuclease cleavage site, and the method is dependent upon gel electrophoresis and staining, which limits throughput.
Single-strand conformational polymoφhism (SSCP) analysis can also detect SNPs in an amplified DNA fragment. In this method, the amplified fragment is denatured then allowed to re-anneal during electrophoresis in non-denaturing polyacrylamide gels. The presence of single nucleotide sequence changes can cause a detectable change in the conformation and electrophoretic migration of a sample relative to wild-type sequence. This method is limited in its dependence upon polyacrylamide gel electrophoresis.
Hybridization-based methods employ allele-specific oligonucleotide (ASO) probes (see, e.g., European Patent Publications EP -237362 and EP-32931 1). The hybridization-based methods include, for example, detection based on ribonuclease A cleavage at mismatches in probe RNA:sample DNA duplexes or denaturing gradient gel electrophoresis for mismatches in probe DNA:sample DNA duplexes (reviewed in Landegren et al., Science 242:229-237, 1988; Rossiter et al, J. Biol. Chem. 265:12753-12756, 1990).
Other methods of genotyping SNPs employ allele-specific amplification (see, e.g., U.S. Pat. Nos. 5,521,301; 5,639,611; and 5,981,176), mini-sequencing methods, quantitative RT-PCR methods (eg., the so-called "TaqMan assays"; see, e.g., U.S. Pat. No. 5,210,015 to Gelfand, U.S. Pat. No. 5,538,848 to Livak, et al., and U.S. Pat. No. 5,863,736 to Haaland, as well as Heid, C.A., et al. Genome Research, 6:986-994 (1996); Gibson, U .E. M, et al., Genome Research 6:995-1001 (1996); Holland, P. M., et al. Proc. Natl. Acad. Sci. USA 88:7276-7280, (1991); and Livak, K. J., et al., PCR Methods and Applications 357-362 (1995)), and single nucleotide primer extension (SNuPE) assays (e.g., U.S. Pat No. 5,846,710) and related extension assays
(e.g, U.S. Pat. Nos. 6,004,744; 5,888,819; 5,856,092; 5,710,028 and 6,013,431). There is a need in the art for improved SNP genotyping assays.
Most SNP genotyping methods rely at some point upon PCR amplification, either to generate enough material for analysis (e.g., SSCP analysis) or to differentially amplify one form over another so as to detect differences (e.g, the primer extension assays). In order to increase the throughput of PCR-based methods, efforts are being focused on multiplexing the reactions so that multiple SNPs can be detected in a single set of reactions. Multiplexing by simply adding primer pairs specific for multiple SNP-containing fragments faces problems caused by primer interactions that lead to inefficient amplification of target fragments and to the generation of artifact fragments. There is a need in the art for improved multiplex SNP genotyping methods.
Capillary electrophoresis (CE) has been used to examine SNPs. One study used CE to analyze the results of a single nucleotide polymerase extension assay (Piggee et al, 1997, J. Chromatography A. 781 : 367-375). In that study, PCR-amplified DNA containing a known SNP was analyzed by hybridization of a primer immediately adjacent to the polymoφhic site and extension of the primer with a single fluorescently labeled chain terminator, followed by CE separation and detection of the incoφorated label. In another study, PCR-amplified DNA containing a known SNP was extended with one of two identically fluorescently labeled chain terminators, followed by CE separation and detection of incoφorated label. The identities of incoφorated terminators are determined based on sequence-specific differences in CE migration for oligonucleotides. McClay et al. (2002, Anal. Biochem. 301 : 200-206) describe an SNP genotyping assay involving PCR using a set of two differentially fluorescently labeled primers differing in their 3 '-terminal base with a common upstream primer, followed by CE and fluorescent detection. Throughput was increased by mixing amplification products of different sizes and electrophoresing together.
U.S. Patent No. 6,074,831 teaches the use of CE for the concurrent separation of molecules partitioned into subsets according to graph theory techniques, and the application of the method to SNP genotyping.
U.S. Patent No. 6,322,980 describes the use of CE in an SNP detection method using the exonuclease activity of a polymerase to release a fluorescent label from a primer hybridized to the polymoφhic site. U.S. Patent No. 6,270,973 also describes the use of CE separation in an SNP genotyping method involving nucleic acid probe depolymerizing activity.
U.S. Patent No. 6,312,893 describes a sequencing method that generates organically tagged fragments in which the tag correlates with a particular nucleotide. Fragments are separated by CE, followed by tag cleavage from the fragments and detection of cleaved tags by non-fluorescent spectrometry or potentiometry.
U.S. Patent No. 6,156,178 describes the use of CE in an SNP detection method using a depolymerizing activity to release an identifier nucleotide from a primer hybridized to the polymoφhic site.
None of the above methods uses nucleic acid sequence tags in either primer extension or amplification steps, different primers for extension and amplification, common amplification primer sets or real-time amplification monitoring and detection.
SUMMARY OF THE INVENTION
The invention provides methods useful for genotyping nucleic acid samples with regard to sequence differences. In a preferred aspect, the methods are useful for the determination of single nucleotide differences, e.g, single nucleotide polymorphisms. The methods of the invention use PCR amplification of primer extension products comprising heterologous sequence tags, followed by capillary electrophoretic size separation and detection of the amplified extension products. In one aspect, the size separation and product detection are performed in real time. Because the CE separation and detection techniques provide information including the amplified fragment size and the identity of label present on any given amplification product, the disclosed methods are particularly well suited for simultaneously analyzing samples for genotype with regard to multiple known SNPs. Each known SNP can be detected by the amplification of a discretely sized amplification fragment bearing a distinguishably labeled sequence tag that specifically correlates with the presence of a particular nucleotide at that polymoφhic site. Methods according to the invention also have the advantage of requiring one set of amplification primers for the detection of multiple SNPs, thereby reducing the impact of problems related to the use of multiple different amplification primers.
The invention encompasses a method of determining for a given nucleic acid sample, the identity of the nucleotide at a known polymoφhic site, the method comprising: a) subjecting to an amplification regimen a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a tag sequence, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein the amplification regimen is performed using an upstream amplification primer and a set of distinguishably labeled downstream amplification primers, each member of the set of downstream amplification primers comprising a tag sequence comprised by a member of the population of primer extension products and a distinguishable label, wherein each distinguishable label specifically corresponds to the presence of a specific nucleotide at the polymoφhic site; and b) detecting incorporation of a distinguishable label into a nucleic acid molecule, thereby to determine the identity of the nucleotide at the polymoφhic site.
In one embodiment, the distinguishable label is a fluorescent label.
In another embodiment step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge. In a preferred embodiment the separating comprises capillary electrophoresis.
In another embodiment the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers. In a preferred embodiment the method further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymoφhic site. In a further preferred embodiment the removing, separating and detecting are performed after each cycle in the regimen. In a further preferred embodiment the separating comprises capillary electrophoresis. In another embodiment, steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescence detector.
In another embodiment the tag sequence comprises 15 to 40 nucleotides.
In another embodiment the set of distinguishably labeled downstream amplification primers consists of: a primer that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a primer that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a primer that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a primer that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
In another embodiment the set of distinguishably labeled downstream amplification primers consists of a pair of oligonucleotides, one comprising a tag sequence that specifically corresponds to a first allele of the polymoφhic site and one comprising a tag sequence that specifically corresponds to a second allele of the polymoφhic site.
Another embodiment further comprises the step, before step (a), of removing primers not incoφorated when the population of primer extension products was made. In a further preferred embodiment the step of removing primers comprises degrading the primers not incoφorated when the population of primer extension products was made. In a further preferred embodiment the degrading is performed using a heat labile exonuclease. In a further preferred embodiment the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII. In a further preferred embodiment wherein the heat labile exonuclease is thermally inactivated before continuing to step (a).
The invention further emcompasses a method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymoφhic sites to be interrogated, the method comprising: a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a member of a set of tag sequences, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein the amplification regimen is performed using one upstream amplification primer for each sequence comprising a known polymoφhic site to be interrogated, and a set of distinguishably labeled downstream amplification primers, each member of the set of downstream amplification primers comprising a tag sequence comprised by a member of the population of primer extension products and a distinguishable label that specifically corresponds to the presence of a specific nucleotide at the polymoφhic site, and wherein the upstream amplification primers are selected such that each polymoφhic site of the set of known polymoφhic sites to be interrogated corresponds to a distinctly sized amplification product; and b) detecting incoφoration of a distinguishable label in distinctly sized amplification products, thereby to determine the identity of the nucleotide at each polymoφhic site.
In one embodiment, the distinguishable label is a fluorescent label.
In another embodiment step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge. In a preferred embodiment the separating comprises capillary electrophoresis.
In one embodiment the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
A preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a the distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymoφhic site. In a further preferred embodiment the removing, separating and detecting are performed after each cycle in the regimen. In a further preferred embodiment the separating comprises capillary electrophoresis.
In another embodiment steps (a) and (b) are perforaied in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
In another embodiment the tag sequence comprises 15 to 40 nucleotides.
In another embodiment the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
Another embodiment further comprises the step, before step (a), of removing primers not incoφorated when the population of primer extension products was made. In a preferred embodiment the step of removing primers comprises degrading the primers not incoφorated when the population of primer extension products was made. In a further preferred embodiment the degrading is performed using a heat labile exonuclease. In a further preferred embodiment the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII. In a further preferred embodiment the heat labile exonuclease is thermally inactivated before continuing to step (a).
The invention further encompasses a method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymoφhic sites to be interrogated, the method comprising: a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a first tag sequence or its complement and a member of a set of second tag sequences or its complement, the presence of which second tag sequence or its complement specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein for each polymoφhic site in the set of polymoφhic sites, the first tag sequence is located at a distinct distance 5' of the polymoφhic site, relative to the distance of the first tag sequence from a polymoφhic site on molecules in the sample containing other polymoφhic sites, wherein the amplification regimen is performed using an upstream amplification primer comprising the first tag sequence, and a set of distinguishably labeled downstream amplification primers, each member of the set of downstream amplification primers comprising a tag sequence comprised by a member of the population of primer extension products and a distinguishable label that specifically corresponds to the presence of a specific nucleotide at the polymoφhic site, and wherein the upstream amplification primers are selected such that each polymoφhic site of the set of known polymoφhic sites to be interrogated corresponds to a distinctly sized amplification product; and b) detecting incoφoration of a distinguishable label in distinctly sized amplification products, thereby to determine the identity of the nucleotide at each the polymoφhic site.
In one embodiment, the distinguishable label is a fluorescent label. In another embodiment step (b) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge. In a preferred embodiment wherein the separating comprises capillary electrophoresis.
In another embodiment the amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers. A preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size andor by charge, and detecting the incoφoration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymoφhic site. In a further preferred embodiment the removing, separating and detecting are performed after each cycle in the regimen. In a further preferred embodiment the separating comprises capillary electrophoresis.
In another embodiment steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
In another embodiment the tag sequence comprises 15 to 40 nucleotides.
In another embodiment the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymorphic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
Another embodiment further comprises the step, before step (a), of removing primers not incoφorated when the population of primer extension products was made. In a preferred embodiment the step of removing primers comprises degrading the primers not incoφorated when the population of primer extension products was made. In a further preferred embodiment, the degrading is performed using a heat labile exonuclease. In a further preferred embodiment the heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII. In a further preferred embodiment the heat labile exonuclease is thermally inactivated before continuing to step (a).
The invention further encompasses a method of determining the identity of a single nucleotide at a known polymoφhic site, the method comprising: I) providing a nucleic acid sample comprising the polymoφhic site; II) separating the strands of the nucleic acid sample and re-annealing in the presence of: a) a first oligonucleotide primer comprising a 3 ' region that hybridizes to a sequence at a known distance upstream of the known polymoφhic site, the first oligonucleotide primer comprising a first sequence tag located 5' of the 3' region; and b) a set of second oligonucleotide primers, wherein each member of the set comprises: i) a region that hybridizes 3' of and adjacent to the polymoφhic site; ii) a variable 3' terminal nucleotide, wherein, when the member is hybridized to the known sequence, the 3' temiinal nucleotide is opposite the polymoφhic site, and wherein, if and only if the 3' terminal nucleotide is complementary to the nucleotide at the polymoφhic site, the 3' terminal nucleotide base pairs with the nucleotide at the polymoφhic site; and iii) a tag sequence that corresponds to the varaible 3 '-terminal nucleotide of (ii), the tag sequence located 5' of the region of (i) on the member; III) contacting the annealed oligonucleotides resulting from step (II) with a nucleic acid polymerase under conditions that permit the extension of an annealed oligonucleotide such that extension products are generated, wherein the primer extension product from the first oligonucleotide primer, when separated from its complement, can serve as a template for the synthesis of the extension product of a member of the set of second oligonucleotide primers, and vice versa; IV) repeating strand separating and contacting steps (II) and (III) two times, such that a population of nucleic acid molecules is generated that comprises both a sequence identical to or complementary to the first oligonucleotide and a sequence identical to or complementary to one of the members of the second set of oligonucleotides; V) contacting the population generated in step (IV) with a heat-labile exonuclease under conditions permitting the degradation of non-annealed oligonucleotide primers, such that the primers are degraded; VI) thermally inactivating the heat-labile exonuclease; VII) subjecting the population of nucleic acid molecules to an amplification regimen, wherein the amplification regimen is performed using an upstream amplification primer comprising the first sequence tag comprised by the first oligonucleotide primer, and a set of downstream amplification primers, each member of the set of downstream amplification primers comprising a tag comprised by a member of the set of second oligonucleotide primers and a distinguishable label; and VIII) detecting incoφoration of at least one distinguishable label, thereby determining the identity of the nucleotide at the known polymoφhic site.
In one embodiment, the distinguishable label is a fluorescent label.
In another embodiment step (VIII) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge, h a preferred embodiment the separating comprises capillary electrophoresis.
In another embodiment the amplification regimen comprises at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers. A preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymoφhic site. In another preferred embodiment the removing, separating and detecting are performed after each cycle in the regimen.
In another embodiment steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
In another embodiment the tag sequences each comprise 15 to 40 nucleotides.
In another embodiment the 3' region that hybridizes to a sequence at a known distance upstream of the known polymorphic site comprises 10-30 nucleotides.
In another embodiment the region that hybridizes 3' of and adjacent to the polymoφhic site comprises 10-30 nucleotides.
In another embodiment the set of downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymorphic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site. The invention further encompasses a method of determining the identities of single nucleotides present at a group of known polymoφhic sites, the method comprising: I) providing a nucleic acid sample comprising the group of polymoφhic sites; II) separating the strands of the nucleic acid sample and re-annealing in the presence of: a) a set of first oligonucleotide primers each comprising a 3' region that hybridizes to a sequence at a known distance upstream of a known polymoφhic site, each member of the set of first oligonucleotide primers comprising a common sequence tag located 5' of the 3' region, and each member of the set of first oligonucleotide primers selected such that a distinctly sized amplification product is generated for each polymoφhic site in the group of known polymoφhic sites; and b) a set of downstream amplification primers comprising, in 5' to 3' order: i) a sequence tag selected from the group consisting of a tag specifically corresponding to G as the 3 '-terminal nucleotide of the primer; a tag specifically corresponding to A as the 3' -terminal nucleotide of the primer; a tag specifically corresponding to T as the 3 '-terminal nucleotide of the primer; and a tag specifically corresponding to C as the 3 '-terminal nucleotide of the primer; ii) a region that specifically hybridizes to a sequence adjacent to and 3' of a polymoφhic site in the group of polymoφhic sites, wherein the set of downstream amplification primers comprises a subset of primers comprising a region that specifically hybridizes adjacent to the polymoφhic site for each polymoφhic site in the group of polymorphic sites; and iii) a 3' terminal nucleotide selected from G, A, T or C, wherein the terminal nucleotide specifically corresponds to the sequence tag described in (i) on that downstream amplification primer, and wherein when the downstream amplification primer is hybridized to the sequence adjacent to and 3' of a polymoφhic site, the 3' terminal nucleotide is opposite the polymoφhic site; III) contacting the annealed oligonucleotides resulting from step (II) with a nucleic acid polymerase under conditions that permit the extension of an annealed oligonucleotide such that extension products are generated, wherein the primer extension product from the first oligonucleotide primer, when separated from its complement, can serve as a template for the synthesis of the extension product of as member of the set of second oligonucleotide primers, and vice versa; IV) repeating strand separating and contacting steps (II) and (III) two times, such that a reaction mixture comprising a population of nucleic acid molecules is generated that comprises both a sequence identical to or complementary to the first oligonucleotide and a sequence identical to or complementary to a member of the set of downstream amplification primers; V) contacting the population gnerated in step (IV) with a heat-labile exonuclease under conditions permitting the degradation of non- annealed oligonucleotide primers, such that non-annealed primers are degraded; VI) thermally inactivating the heat-labile exonuclease; VII) subjecting the population of nucleic acid molecules to an amplification regimen, wherein the amplification regimen is performed using an upstream amplification primer comprising the common sequence tag comprised by the first oligonucleotide primer, and a set of downstream amplification primers, each member of the set of downstream amplification primers comprising a tag comprised by a member of the set of second oligonucleotide primers and a distinguishable label; and VIII) detecting incoφoration of at least one distinguishable label, thereby deteraiining the identities of the nucleotides present at the known polymoφhic sites.
In one embodiment the distinguishable label is a fluorescent label.
In one embodiment the step (VIII) comprises separating nucleic acid molecules made during the amplification regimen by size and/or by charge. In a preferred embodiment the separating comprises capillary electrophoresis.
In another embodiment the amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers. A preferred embodiment further comprises the steps, during the amplification regimen and after at least one of the reaction cycles, of removing an aliquot of the amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a distinguishable label, wherein the detecting determines the identity of the nucleotide at the polymoφhic site. In a further preferred embodiment the removing, separating and detecting are performed after each cycle in the regimen.
In another embodiment steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
In another embodiment the tag sequences each comprise 15 to 40 nucleotides.
In another embodiment the 3' region that hybridizes to a sequence at a known distance upstream of the known polymoφhic site comprises 10-30 nucleotides.
In another embodiment the region that hybridizes 3' of and adjacent to the polymoφhic site comprises 10-30 nucleotides.
In another embodiment the set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
The invention further encompasses a kit for the determination of the nucleotide present at a polymoφhic site present on a nucleic acid sample, the kit comprising a set of upstream primers comprising: a) a first primer comprising a 5 '-tag sequence and 3' sequence sufficient to specifically hybridize at a known distance upstream of a known polymoφhic site; and b) a set of 4 downstream second primers, comprising in 5' to 3' order: i) a sequence tag selected from the group consisting of a tag specifically corresponding to G as the 3 '-terminal nucleotide of the primer; a tag specifically corresponding to A as the 3 '-terminal nucleotide of the primer; a tag specifically corresponding to T as the 3 '-terminal nucleotide of the primer; and a tag specifically corresponding to C as the 3 '-terminal nucleotide of the primer; ii) a region that specifically hybridizes to a sequence adjacent to and 3' of a polymoφhic site in the group of polymoφhic sites, wherein the set of downstream amplification primers comprises a subset of primers comprising a region that specifically hybridizes adjacent to the polymoφhic site for each polymoφhic site in the group of polymoφhic sites; and iii) a 3' terminal nucleotide selected from G, A, T or C, wherein the terminal nucleotide specifically corresponds to the sequence tag described in (i) on that downstream amplification primer, and wherein when the downstream amplification primer is hybridized to the sequence adjacent to and 3' of a polymoφhic site, the 3' terminal nucleotide is opposite the polymoφhic site.
One embodiment further comprises a set of 5 primers lacking sequence specific for a gene in the genome of the organism being examined for polymoφhisms, the primers comprising a primer comprising the tag sequence of the first primer and a set of four distinguishably labeled primers comprising the tag sequences of the set of four downstream second primers.
As used herein, the term "sample" refers to a biological material which is isolated from its natural environment and containing a polynucleotide. A "sample" according to the invention can consist of purified or isolated polynucleotide, or it may comprise a biological sample such as a tissue sample, a biological fluid sample, or a cell sample comprising a polynucleotide. A biological fluid includes blood, plasma, sputum, urine, cerebrospinal fluid, lavages, and leukophoresis samples. A sample of the present invention may be any plant, animal, bacterial or viral material containing a polynucleotide.
As used herein, the term "polymoφhism" refers to a nucleic acid sequence variation. When compared to a naturally occurring sequence, a polymoφhism can be present at a frequency of greater than 0.01%, 0.1%, 1% or greater in a population. As used herein, a polymoφhism can be an insertion, deletion, duplication, or rearrangement. As used herein, a "single nucleotide polymoφhism" or "SNP" refers to nucleic acid sequence variation at a single nucleotide residue, including a single nucleotide deletion, insertion, or base change. A polymoφhism, including a SNP, can be phenotypically neutral or can have an associated variant phenotype that distinguishes it from that exhibited by the predominant sequence at that locus. As used herein, "neutral polymoφhism" refers to a polymoφhism in which the sequence variation does not alter gene function, and "mutation" or "functional polymoφhism" refers to a sequence variation which does alter gene function, and which thus has an associated phenotype.
When referring to the genotype of an individual with regard to an SNP, the "predominant allele" is that which occurs most frequently in the population being examined (i.e., when there are two alleles, the allele that occurs in greater than 50% of the population is the predominant allele; when there are more than two alleles, the "predominant allele" is that which occurs in the subject population at the highest frequency, e.g, at least 5% higher frequency, relative to the other alleles at that site). The term "variant allele" is used to refer to the allele or alleles occurring less frequently than the predominant allele in that population (e.g, when there are two alleles, the variant allele is that which occurs in less than 50% of the subject population; when there are more than two alleles, the variant alleles are all of those that occur less frequently, e.g, at least 5% less frequently, than the predominant allele).
As used herein, the term "polymoφhic site" refers to the position, in a polymoφhic nucleotide sequence, of the nucleotide that varies among individuals.
As used herein, an "oligonucleotide primer" refers to a polynucleotide molecule (i.e., DNA or RNA) capable of annealing to a polynucleotide template and providing a 3' end to produce an extension product which is complementary to the polynucleotide template. The conditions for initiation and extension usually include the presence of four different deoxyribonucleoside triphosphates and a polymerization-inducing agent such as DNA polymerase or reverse transcriptase, in a suitable buffer ("buffer" includes substituents which are cofactors, or which affect pH, ionic strength, etc.) and at a suitable temperature. The primer according to the invention may be single- or double-stranded. The primer is single-stranded for maximum efficiency in amplification, and the primer and its complement form a double-stranded polynucleotide. "Primers" useful in the present invention are less than or equal to 100 nucleotides in length, e.g, less than or equal to 90, or 80, or 70, or 60, or 50, or 40, or 30, or 20, or 15, or equal to 10 nucleotides in length.
As used herein, the term "polymerase extension" means the template-dependent incoφoration of at least one complementary nucleotide, by a nucleic acid polymerase, onto the 3' end of an annealed primer. Polymerase extension preferably adds more than one nucleotide, preferably up to and including nucleotides corresponding to the full length of the template.
Conditions for polymerase extension vary with the identity of the polymerase. The temperature of polymerase extension is based upon the known activity properties of the enzyme. In general, although the enzymes retain at least partial activity below their optimal extension temperatures, polymerase extension by the most commonly used thermostable polymerases (e.g, Taq polymerase and variants thereof) is perfoπned at 65°C to 75°C, preferably about 68-72°C.
As used herein, the term "primer extension products" refers to nucleic acid molecules generated by the process of polymerase extension.
As used herein, the term "tag sequence," or simply "tag" refers to a nucleotide sequence, preferably a heterologous or artificial nucleotide sequence, that is attached to an oligonucleotide primer via standard phosphodiester linkage (i.e., phosphodiester linkage between the 3 ' OH of the tag and the 5' phosphate of the oligonucleotide) and permits the identification or tracing of polynucleotides into which the "tag" is incoφorated (incoφorated for example, by primer extension or amplification of a primer extension product). A "tag" sequence according to the invention will comprise at least 15, and preferably 20 to 30 nucleotides and will preferably not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped. A tag sequence according to the invention can be, but is not necessarily, random.
As used herein, the term "specifically corresponds" means that a given nucleic acid tag sequence on an oligonucleotide is only used with a given 3 '-terminal nucleotide, such that the presence of the tag sequence is indicative of the presence of that 3 '-terminal nucleotide. For example, tag sequence "1" would only be used on an oligonucleotide with a 3 '-terminal A, tag sequence "2" would only be used on an oligonucleotide with a 3 '-terminal C, tag sequence "3" would only be used on an oligonucleotide with a 3 '-terminal G and tag sequence "4" would only be used on an oligonucleotide with a 3 '-terminal T. Thus, in a method according to the invention, if a fragment amplifies with a primer specific for tag 2, it is known that the 3 '-terminal nucleotide of the original primer extension primer was a C, and therefore, that the polymoφhic nucleotide is a G in that sample.
As used herein, the term "amplification regimen" refers to a process of specifically amplifying, i.e., increasing the abundance of, a nucleic acid sequence of interest. An amplification regimen according to the invention comprises at least two, and preferably at least 5, 10, 15, 20, 25, 30, 35 or more iterative cycles, where each cycle comprises the steps of: 1) strand separation (e.g, thermal denaturation); 2) oligonucleotide primer annealing to template molecules; and 3) nucleic acid polymerase extension of the annealed primers. Conditions and times necessary for each of these steps are well known in the art. Amplification achieved using an amplification regimen is preferably exponential, but can alternatively be linear. An amplification regimen according to the invention is preferably performed in a thermal cycler, many of which are commercially available.
As used herein, the term "set" means a group of nucleic acid samples, primers or other entities. A set will comprise a known number of, and at least two of such entities.
As used herein, the term "subset" means a group comprised by a set as defined herein, wherein the subset group is less than every member of the set. A subset as used herein can consist of a single entity.
As used herein, the relative terms "upstream" and "downstream" are used to refer to positions on a polynucleotide relative to a polymoφhic site. Generally, "upstream" refers to 5' of the polymoφhic site, and "downstream" refers to 3' of the polymoφhic site. It is understood that the choice of "upstream" and "downstream" in a double-stranded DNA sequence is largely arbitrary, in that one may choose to focus on either strand, and the direction that is "upstream" or "downstream" of the polymoφhic site will change, depending upon which strand is chosen as the "reference" strand. In order to avoid any ambiguity, as used herein to describe a given method, the "reference" strand for the selection of the terms "upstream" and "downstream" will remain the same throughout that method.
As used herein, the term "distinguishably labeled" means that the signal from one labeled oligonucleotide primer or a nucleic acid molecule into which it is incoφorated can be distinguished from the signal from another such labeled primer or nucleic acid molecule. Detectable labels can comprise, for example, a light-absorbing dye, a fluorescent dye, or a radioactive label. Fluorescent dyes are preferred. Generally, a fluorescent signal is distinguishable from another fluorescent signal if the peak emission wavelengths are separated by at least 20 nm. Greater peak separation is preferred, especially where the emission peaks of fluorophores in a given reaction are wide, as opposed to narrow or more abrupt peaks.
As used herein, the term "separating nucleic acid molecules" refers to the process of physically separating nucleic acid molecules in a sample or aliquot on the basis of size and/or charge. Electrophoretic separation is preferred, and capillary electrophoretic separation is most preferred.
As used herein, the term "detecting the incoφoration" refers to the process of determining whether a given labeled oligonucleotide primer has been extended, thereby incoφorating the label into the primer extension or amplification product. Detection can be by any means compatible with the detectable label, but will preferably involve detection of a fluorescent label. Detecting encompasses determination of both the presence and the abundance of label in a primer extension or amplification product. Fluorescence detectors are well known in the art.
As used herein, the tenn "specifically hybridizes" means that under given hybridization conditions a probe or primer hybridizes only to a target sequence in a sample comprising the target sequence. Given hybridization conditions include the conditions for the annealing step in an amplification regimen, i.e., annealing temperature selected on the basis of predicted Tm, and salt conditions suitable for the polymerase enzyme of choice.
As used herein, the term "strand separation" or "separating the strands" means treatment of a nucleic acid sample such that complementary double-stranded molecules are separated into two single strands available for annealing to an oligonucleotide primer. Strand separation according to the invention is achieved by heating the nucleic acid sample above its Tm. Generally, for a sample containing nucleic acid molecules in buffer suitable for a nucleic acid polymerase, heating to 94°C is sufficient to achieve strand separation according to the invention. An exemplary buffer contains 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl2, and 0.1% BSA. As used herein, the term "primer annealing" or "re-annealing" means permitting oligonucleotide primers to hybridize to template nucleic acid strands. Conditions for primer annealing vary with the length and sequence of the primer and are based upon the calculated Tm for the primer. Generally, an annealing step in an amplification regimen involves reducing the temperature following the strand separation step to a temperature based on the calculated Tm for the primer sequence, for a time sufficient to permit such annealing. Tm can be readily predicted by one of skill in the art using any of a number of widely available algorithms (e.g, Oligo™ , Primer Design and programs available on the internet, including Primer3 and Oligo Calculator). For most amplification regimens, the annealing temperature is selected to be about 5°C below the predicted Tm, although temperatures closer to and above the Tm (e.g., between 1°C and 5°C below the predicted Tm or between 1°C and 5°C above the predicted Tm) can be used, as can temperatures more than 5°C below or above the predicted Tm (e.g, 6°C below, 8°C below, 10°C below or lower and 6°C above, 8°C above, or 10°C above). Generally, the closer the annealing temperature is to the Tm, the more specific is the annealing. Time of primer annealing depends largely upon the volume of the reaction, with larger volumes requiring longer times, but also depends upon primer and template concentrations, with higher relative concentrations of primer to template requiring less time than lower. Depending upon volume and relative primer/template concentration, primer annealing steps in an amplification regimen can be on the order of 1 second to 5 minutes, but will generally be between 10 seconds and 2 minutes, preferably on the order of 30 seconds to 2 minutes.
As used herein, the term "3' region that hybridizes to a sequence at a known distance upstream of a known polymoφhic site" refers to a sequence of nucleotides, located at the 3' end of an oligonucleotide, that specifically hybridize to a sequence upstream (i.e., 5') of a known polymoφhic site being genotyped in a sample of nucleic acid. The "3' region that hybridizes" will be at least 12 nucleotides long, and preferably at least 15, 18, 21, 24, 27, 30 nucleotides or more. The "region that hybridizes" is selected to be a known distance from the polymoφhic site so as to give rise to an amplification product that is distinctly sized relative to other amplification products in a method according to the invention. The "known distance" can be from 50 to 1000 nucleotides, and is preferably from 50 to 500 nucleotides or 50 to 250 nucleotides.
As used herein, a "region that hybridizes 3 ' of and adjacent to a polymoφhic site" is an oligonucleotide sequence, generally 10 to about 25 nucleotides in length, that specifically hybridizes 3' of a polymoφhic site, such that the penultimate 3' nucleotide of the region is hybridized one nucleotide downstream of the polymoφhic site. The invention makes use of a set of four primers comprising such a region, with the set comprised of oligonucleotides having four different 3' terminal nucleotides, G, A, T or C, only one of which will hybridize to the nucleotide at the polymoφhic site and permit primer extension by a nucleic acid polymerase.
As used herein, the term "variable 3 '-terminal nucleotide" refers to a 3 '-terminal nucleotide of an oligonucleotide that can be any of G, A, T or C.
As used herein, the term "opposite the polymoφhic site" means that a nucleotide, the 3'- terminal nucleotide on an oligonucleotide primer hybridized to a polymoφhism-containing nucleic acid strand, is positioned such that it will form a Watson-Crick hydrogen bonded base pair with the nucleotide at the polymoφhic position if the 3 '-terminal nucleotide is complementary to the nucleotide at the polymoφhic site.
As used herein, the term "complementary" refers to the hierarchy of hydrogen-bonded base pair fonnation preferences between the four deoxyribonucleotides G, A, T, and C, such that A pairs with T and G pairs with C.
As used herein, the phrase "nucleic acid polymerase" refers an enzyme that catalyzes the template-dependent polymerization of nucleoside triphosphates to form primer extension products that are complementary to one of the nucleic acid strands of the template nucleic acid sequence. A nucleic acid polymerase enzyme initiates synthesis at the 3' end of an annealed primer and proceeds in the direction toward the 5' end of the template. Numerous nucleic acid polymerases are known in the art and commercially available. One group of preferred nucleic acid polymerases are thermostable, i.e., they retain function after being subjected to temperatures sufficient to denature annealed strands of complementary nucleic acids.
As used herein, the term "aliquot" refers to a sample of an amplification reaction taken during the cycling regimen. An aliquot is less than the total volume of the reaction, and is preferably 0.1-30 % in volume. In one embodiment of the invention, for each aliquot removed, an equal volume of reaction buffer containing reagents necessary for the reaction (e.g, buffer, salt, nucleotides, and polymerase enzyme) is introduced.
As used herein, the term "conditions that permit the extension of an annealed oligonucleotide such that extension products are generated" refers to the set of conditions including, for example temperature, salt and co-factor concentrations, pH, and enzyme concentration under which a nucleic acid polymerase catalyzes primer extension. Such conditions will vary with the identity of the nucleic acid polymerase being used, but the conditions for a large number of useful polymerase enzymes are well known to those skilled in the art. One exemplary set of conditions is 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl2, 200 μM each dNTP, and 0.1 % BS A at 72 °C, under which Taq polymerase catalyzes primer extension.
As used herein, the term "real time" means that the measurement of the accumulation of products in a nucleic acid amplification reaction is at least initiated, and preferably completed during or concurrent with the amplification regimen. Thus, for the measurement process to be considered "real time", at least the initiation of the measurement or detection of amplification products in each aliquot is concurrent with the amplification process. By "initiated" is meant that an aliquot is withdrawn and placed into a separation apparatus, e.g, a capillary electrophoresis capillary, and separation is begun. The completion of the measurement is the detection of labeled species in the separated nucleic acids from the aliquot. Because the time necessary for separation and detection may exceed the time of each individual cycle of the amplification regimen, there may be a lag in the detection of the amplification products of up to 120 minutes beyond the completion of the amplification regimen. Preferably such lag or delay is less than 30 minutes, e.g, 25 minutes, 20 minutes, 15 minutes, 10 minutes, 5 minutes, 4 minutes, 3 minutes, 2 minutes, 1 minute or less, including no lag or delay.
As used herein, the term "capillary electrophoresis" means the electrophoretic separation of nucleic acid molecules in an aliquot from an amplification reaction wherein the separation is performed in a capillary tube. Capillary tubes are available with inner diameters from about 10 to 300 μm, and can range from about 0.2 cm to about 3 m in length, but are preferably in the range of 0.5 cm to 20 cm, more preferably in the range of 0.5 cm to 10 cm. In addition, the use of microfluidic microcapillaries (available, e.g, from Caliper or Agilent Technologies) is specifically contemplated within the meaning of "capillary electrophoresis."
As used herein, the term "modular apparatus" means an apparatus that comprises individual units in which certain processes of the methods according to the invention are performed. The individual units of a modular apparatus can be but are not necessarily physically connected, but it is preferred that the individual units are controlled by a central control device such as a computer. An example of a modular apparatus useful according to the invention has a thermal cycler unit, a sampler unit, and a capillary electrophoresis unit with a fluorescence detector. The modular apparatus useful according to the invention can also comprise a robotic ami to transfer samples from the cycling reaction to the electrophoresis unit.
As used herein, the term "sampling device" refers to a mechanism that withdraws an aliquot from an amplification during the amplification regimen. Sampling devices useful according to the invention will preferably be adapted to minimize contamination of the cycling reaction(s), by, for example, using pipeting tips or needles that are either disposed of after a single sample is withdrawn, or by incorporating one or more steps of washing the needle or tip after each sample is withdrawn. Alternatively, the sampling device can contact the capillary to be used for capillary electrophoresis directly with the amplification reaction in order to load an aliquot into the capillary. Alternatively, the sample device can include a fluidic line (e.g. a tube) connected to the controllable valve which will open at particular cycle. Sampling devices known in the art include, for example, the multipuφose Robbins Scientific Hydra 96 pipettor, which is adapted to sampling to or from 96 well plates. This and others can be readily adapted for use according to the methods of the invention.
As used herein, the term "robotic arm" means a device, preferably controlled by a microprocessor, that physically transfers samples, tubes, or plates containing samples from one location to another. Each location can be a unit in a modular apparatus useful according to the invention. An example of a robotic arm useful according to the invention is the Mitsubishi RV- E2 Robotic Arm. Software for the control of robotic arms is generally available from the manufacturer of the arm.
As used herein, the term "amplified product" refers to polynucleotides which are copies of a portion of a particular polynucleotide sequence and/or its complementary sequence, which correspond in nucleotide sequence to the template polynucleotide sequence and its complementary sequence. An "amplified product," according to the invention, may be DNA or RNA, and it may be double-stranded or single-stranded.
As used herein, the term "distinctly sized amplification product" means an amplification product that is resolvable from amplification products of different sizes. "Different sizes" refers to nucleic acid molecules that differ by at least one nucleotide in length. Generally, distinctly sized amplification products useful according to the invention differ by greater than or equal to more nucleotides than the limit of resolution for the separation process used in a given method according to the invention. For example, when the limit of resolution of separation is one base, distinctly sized amplification products differ by at least one base in length, but can differ by 2 bases, 5 bases, 10 bases, 20 bases, 50 bases, 100 bases or more. When the limit of resolution is, for example, 10 bases, distinctly sized amplification products will differ by at least 10 bases, but can differ by 11 bases, 15 bases, 20 bases, 30 bases, 50 bases, 100 bases or more.
As used herein, the term "profile" or the equivalent terms "amplification curve" and
"amplification plot" mean a mathematical curve representing the signal from a detectable label incoφorated into a nucleic acid sequence of interest at two or more steps in an amplification regimen, plotted as a function of the cycle number from which the samples were withdrawn. The profile is preferably generated by plotting the fluorescence of each band detected after capillary electrophoresis separation of nucleic acids in the individual reaction samples. Most commercially available fluorescence detectors are interfaced with software permitting the generation of curves based on the signal detected.
The number of genes that could be investigated in a single reaction can be estimated based on the measurable difference of the product size ( 1-2 bases) and on the separable size of PCR products (500-1000bρ) and can be as high as 1000, but is preferably 100-200.
As used herein, the term "heat-labile exonuclease" refers to an enzyme that degrades single-stranded nucleic acid molecules or overhanging single strands on partially double stranded nucleic acid molecules and is irreversibly inactivated by incubation at an elevated temperature. The temperature for inactivation will vary with the enzyme and with, for example, buffer conditions and enzyme concentration. Conditions for enzyme inactivation are known to those skilled in the art. A non-limiting example of a heat-labile exonuclease useful according to the invention is Exonuclease I (Exol), from E. coli (commerically available from, e.g. New England Biolabs, Beverly MA). Exol is inactivated by incubation at 80°C for 20 minutes.
As used herein, the term "substantially lacking sequence specific for a gene in the genome of the organism" means that a given primer will not generate a primer extension product when incubated under primer extension conditions with genomic DNA from the organism being investigated with respect to polymoφhisms.
BRIEF DESCRIPTION OF THE FIGURES
Figure 1 shows a schematic diagram of primer extension reactions useful in one embodiment of the invention. SI and S5 are different sequence tags. Figure 2 shows a schematic diagram of an amplification regimen and detection useful in one embodiment of the invention. SI and S5 are tag sequence primers that differ from one another but are identical to SI to S 5 shown in Figure 1.
DETAILED DESCRIPTION OF THE INVENTION
The invention provides methods of determining the genotype of a nucleic acid sample with respect to known single nucleotide polymoφhisms. The methods of the invention employ primer extension reactions that incoφorate sequence tags permitting the simultaneous identification of the specific nucleotides present at a group of SNPs. Tagged fragments are then amplified using sets of primers specific for the tags wherein the downstream primer is labeled. During the amplification regimen, aliquots of the reaction are withdrawn and subjected to size separation and detection of the amplified fragments. The nucleotides present at the polymoφhic sites are identified based on the size and identity of the label attached to the amplified fragments. Because both amplimer size and incoφorated label are detected, the system is well suited for multiplexing. Further, the separation and detection are performed during the amplification reaction, such that a profile of the amplification reaction is generated in real time. The real time aspect provides rapid analysis as well as information regarding the course of the amplification that is useful in identifying and eliminating artifactual signals caused, for example, by interactions between primers.
Generating sequence tagged primer extension products:
As a first step, the invention requires the generation of sequence-tagged primer extension products. A critical aspect of this step is that the tag on any particular extension product specifically corresponds with the identity of the nucleotide at the polymoφhic site. In this step, the tag is incoφorated by the extension of a primer with the following general structure:
5'- Tagc - target complement - Vc-3'
wherein "Tagc" is the tag sequence that corresponds with the identity of the nucleotide at the 3' terminus of the primer, "target complement" is the 3' region of the primer that specifically hybridizes adjacent to the known SNP, arid Vc is a variable 3' terminal nucleotide that corresponds with the identity of the Tagc sequence. The Tagc sequence is preferably 20 to 30 nucleotides in length and preferably does not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped or to any of the other primers used in a given reaction. The "target complement" is long enough to provide specific hybridization between the primer and the sequence adjacent to a known SNP, and will generally be about 10 to 25 nucleotides in length. Vc is selected from dG, dA, dT and dC, and is positioned so that it is opposite the known polymoφhic site when the primer is hybridized to the nucleic acid sample being interrogated. Vc will base pair with the nucleotide at the polymoφhic site only if it is complementary to the nucleotide at that site. Because a nucleic acid polymerase, e.g, Taq polymerase, will only extend a primer if the 3 '-terminal nucleotide is base paired with the adjacent nucleotide on the template strand, the extension of a primer with a known 3 '-terminal nucleotide opposite the polymoφhic site identifies the nucleotide present at the polymoφhic site as the complement of the 3 ' -terminal nucleotide.
A set of downstream primer extension primers useful for the identification of an SNP will include four different tag sequences, one each to correspond to a 3 '-terminal dG, dA, dT or dC. Thus, if the tags are referred to as Tags 1-4, for example, Tag 1 would be used on the primer terminating in a 3' dG, Tag 2 would be used on the primer terminating in a 3' dA, Tag 3 would be used on the primer terminating in a 3' dT, and Tag 4 would be used on the primer terminating in a 3' dC. A major advantage of the methods disclosed herein is that one can use the same set of four downstream Tagc sequences in assays for multiple SNPs, because the resulting amplification products will differ in size. This limits the possibilities for non-template directed inteφrimer interactions in the amplification step that tend to interfere with multiplex amplifications.
Sequence-tagged upstream primers are used to generate the opposite strand of a given SNP-containing sequence. These primers will have the general structure:
5 '-Tag - target complement - 3'
wherein "Tag" refers to a sequence tag different from each of those used in a downstream set of primer extension primers, and "target complement" refers to a sequence complementary to a region upstream of the known SNP. The "Tag" sequence on the upstream primer is preferably 20 to 30 nucleotides in length and preferably does not hybridize under primer extension conditions to a sequence in the genome of the organism being genotyped, or to any of the other primers being used in a given reaction. The "target complement" is long enough to provide specific hybridization under primer extension conditions between the primer and a sequence upstream of a known SNP, and will generally be about 10 to 25 nucleotides in length. The distance upstream will generally be at least 50 nucleotides, but can be 50 to 1000 nucleotides or more, preferably 50 to 500, or 50 to 250 nucleotides upstream of the polymoφhic site. The distance of the upstream primer sequence from the polymoφhic site determines the size or length of the later amplification products. The sizes of the later amplification products must be selected so as to differ by more than the resolution limit of the system used for size separation. Thus, if the limit of resolution of separation is one base, the sizes of the amplification products should be selected to differ by at least one base in length, and preferably more (e.g, at least 5, 10, 15 bases or more). When the limit of resolution is, for example, 10 bases, sizes of the amplification products should differ by at least 10 bases, and preferably more (e.g, at least 15, 20, 25, 30 bases or more).
The terms "upstream" and "downstream" are used herein in order to facilitate the description of the invention. However, it is recognized that because of the double-stranded' nature of DNA, a polymoφhism could be approached with SNP-specific primers from either side, that is, from upstream or downstream, by hybridization of the primer to one strand as opposed to the other. The invention specifically contemplates the interrogation of SNPs on either strand of the genomic DNA.
In order to generate sequence-tagged primer extension products according to the invention, a nucleic acid sample is denatured, preferably by heat, e.g, to 95°C for 2 minutes or more, and allowed to re-anneal in the presence of an upstream extension primer and a set of downstream primer extension primers for each SNP to be interrogated in the reaction. The denaturing and annealing is best performed in a buffer compatible with the nucleic acid polymerase to be used for the primer extension reaction, e.g, IX Taq polymerase buffer. Re- annealing is performed at a temperature below the Tm of the primers, generally between about 20°C and 60°C, although lower or higher temperatures may be suitable for some primers. Primers should be present at about 15 to 500 nM for each primer. Optimal primer concentrations can be determined empirically by one of skill in the art with a minimum of experimentation, for example by setting up test reactions in which the primers are varied over the 15 to 500 nM range and analyzing the results with respect to the relative resolution, yield and specificity of the extension or amplification reactions.
Following annealing in the presence of the primers, polymerization is perforaied using a nucleic acid polymerase. Numerous polymerases sufficient for this step are known and can be selected by one skilled in the art. Among the most commonly used enzymes are the thermostable Taq polymerase and other thermostable polymerases, e.g, Pfu polymerase. Primer extension is performed under standard conditions for the enzyme chosen, e.g, 50 mM KC1, 10 mM Tric-HCl (pH 8.8@ 25°C), 0.5 to 3 mM MgCl2, and 0.1% BSA and 100 μM each dNTP at 72°C for two minutes.
The first round of primer extension results in a population in which one strand has an upstream primer and tag sequence incoφorated and the other strand has a downstream primer and tag sequence incoφorated. The downstream primer incoφorated for each SNP is the one in which the 3 '-terminal nucleotide was complementary to the nucleotide at the polymoφhic site on the target DNA. The incoφoration of that downstream primer necessarily incoφorates the tag sequence associated with or corresponding to that 3 '-terminal nucleotide. In order to generate a population in which molecules representing each strand carry both an upstream tag or its complement and a downstream tag or its complement, the products of the first primer extension reaction are subjected to another round of denaturing, re-annealing in the presence of the same primers, and polymerase extension of those primers.
Following the second round of primer extension, non-extended primers are removed.
Any method of primer removal can be used, e.g, electrophoresis or column chromatography, but it is preferred that a heat labile exonuclease specific for single-stranded DNA be used. The use of a heat-labile exonuclease avoids the need for time-consuming separation and purification procedures and the possibility for contamination or sample loss. Heat labile exonucleases useful according to the invention include, for example E. coli Exonuclease I (Exol), and Exonuclease VII (ExoVII) . Exol, for example, is active at 37°C but is inactivated by incubation for 20 minutes at 80°C.
The primers used for primer extension are removed so that new primers, corresponding to the incoφorated upstream and downstream tag sequences, can be used to amplify the primer extension products. Following the removal of the first primers, a set of primers comprising an upstream tag sequence primer and four downstream tag sequence primers is added. Each of the four downstream tag sequence primers is distinguishably labeled (e.g, end labeled) with a fluorescent dye. The mixture with the new primers added is then subjected to an amplification regimen comprising cycles of thermal denaturation, re-annealing and polymerase extension. The amplification regimen should comprise at least two cycles, but will preferably comprise 2 to 35 cycles, more preferably 10 to 30 cycles, and more preferably 15 to 25 cycles. During the cycling regimen, following at least one of the cycles of denaturation, primer annealing and primer extension in this aspect of the invention, a sample or aliquot of the reaction is withdrawn from the tube or reaction vessel, and nucleic acids in the aliquot are separated and detected. The separation and detection are performed concurrently with the cycling regimen, such that a curve representing product abundance as a function of cycle number can be generated while the cycling occurs. As used herein, the term "concurrently" means that the separation is at least initiated while the cycling regimen is proceeding. Depending upon the separation technology used (e.g, capillary electrophoresis) and the number and size of species to be separated in a given reaction, the separation will most often require on the order of 1-120 minutes per aliquot. Thus, when separation steps take longer than the duration of each cycle, and when samples are withdrawn after, for example, every cycle, the separation steps will be completed after the completion of the full cycling regimen. However, as used herein, this situation is still considered to be "concurrent" separation, as long as the separation of each sample was initiated during the cycling regimen. Concurrent separation is most preferably performed through use of a robotic sampler that deposits the samples to the separation apparatus immediately after the samples are withdrawn from the cycling reaction.
In the manner described above, the identity of the nucleotide at a polymorphic site is determined by detection of the fluorescent signals on the size-separated amplification products. Because each of the four downstream tag primers is labeled with a distinguishable fluorescent label, and because the tag on a given primer corresponds to the identity of the 3 '-terminal nucleotide of the original downstream primer extension primer, the incoφoration and detection of that fluorescently labeled tag identifies the nucleotide at the polymoφhic site.
In a preferred aspect, the original primer extension reactions include primer sets that recognize more than one SNP. In this aspect, each different polymoφhism will be represented by a distinctly sized amplification product. For example, one can include additional upstream primers, each comprising the same tag sequence and varying in the 3' region that hybridizes at a distinct distance upstream of an additional known SNP. In concert with the additional upstream primer, each additional SNP to be interrogated requires a set of four downstream primer extension primers, each member of the set comprising in 5' to 3' order: a) a tag sequence that corresponds to the 3 ' terminal nucleotide of that primer, wherein the tag sequence is the same tag sequence that corresponds to that 3 '-terminal nucleotide on the downstream primers used for other SNPs being interrogated in the same series of reactions; b) a region sufficient to direct specific hybridization of the primer downstream of and adjacent to a known SNP; and c) a variable 3 '-terminal nucleotide that corresponds to the tag sequence on that primer, wherein when the primer is hybridized to its genomic target sequence, the 3 '-terminal nucleotide is opposite the polymoφhic site and can base pair with the nucleotide at that site if it is complementary. Following two primer extension reactions and the removal of non-incoφorated primers as described above, a single amplification primer set is used, identical to that used when a single SNP is interrogated. That is, the amplification primer set will comprise an upstream primer comprising the upstream tag and a set of four distinguishably labeled primers comprising the four downstream tags on the primer extension primers, where the labels correspond to the tags that correspond to the nucleotides opposite the polymorphic site. The same amplification primer set can be used for each SNP interrogated because the incorporated tags are common between the sets. That is, all upstream primers have the same tag sequence, and all downstream primer extension primer sets have the same tag sequences corresponding to the same 3 ' terminal nucleotides. Each SNP interrogated will have a distinct size when separated, and the identity of the label incoφorated into a molecule of that size positively identifies the nucleotide present at that polymoφhic site. The ability to amplify and detect multiple SNPs with a single set of five amplification primers has the advantage of avoiding primer interaction problems prevalent when large numbers of primers are used for amplification. In addition, the effect of variations in primer annealing efficiency will be largely negated because all SNPs interrogated with a given amplification primer set will be affected by such variations to the same degree.
Further multiplexing can be achieved by using more than one set of five tag sequences. The additional sets will comprise tags distinct from those used in other sets. Care should be taken to avoid tags with complementarity to other tags to be used simultaneously. As above, each set will comprise upstream tags selected so that the amplification products are distinctly sized, and downstream tags in which the respective tags correspond to the 3 '-terminal nucleotides of the primer extension primers. For the amplification, the downstream primers can be labeled with the same corresponding fluorescent labels as the other sets, or, preferably with a different set of distinguishable fluorescent labels. Following size separation, the amplified SNP- containing fragments are identified by size, and the identity of the nucleotide at the polymoφhic site is identified by the label incoφorated, as described above. General Considerations for Primer Design
Oligonucleotide primers are generally 5 to 100 nucleotides in length, preferably from 17 to 45 nucleotides, although primers of different lengths are of use. Primers for primer extension reactions are preferably 10 to 60 nucleotides long, while primers for amplification are preferably about 17-25 nucleotides in length. Primers useful according to the invention can be designed to have a particular melting temperature (Tm) by the method of melting temperature estimation. Commercial programs, including Oligo™ , Primer Design and programs available on the internet, including PrimerS and Oligo Calculator can be used to calculate the Tm of a polynucleotide sequence useful according to the invention. Preferably, the Tm of an amplification primer useful according to the invention (e.g, a tag sequence), as calculated for example by Oligo Calculator, is between about 45°C and 653C and more preferably between about 50°C and 603C.
The Tm of a polynucleotide affects its hybridization to another polynucleotide (e.g, the annealing of an oligonucleotide primer to a template polynucleotide). In the methods of the invention, it is preferred that the oligonucleotide primers used in various steps selectively hybridize to a target template or to polynucleotides derived from the target template. Typically, selective hybridization occurs when two polynucleotide sequences are substantially complementary (at least about 65% complementary over a stretch of at least 14 to 25 nucleotides, preferably at least about 75%, more preferably at least about 90% complementary). See Kanehisa, M, 1984, Polynucleotide Res. 12: 203, incoφorated herein by reference. As a result, it is expected that a certain degree of mismatch at the priming site is tolerated. Such mismatch may be small, such as a mono-, di- or tri-nucleotides. Alternatively, a region of mismatch may encompass loops, which are defined as regions in which there exists a mismatch in an uninterrupted series of four or more nucleotides.
Numerous factors influence the efficiency and selectivity of hybridization of the primer to a second polynucleotide molecule. These factors, which include primer length, nucleotide sequence and/or composition, hybridization temperature, buffer composition and potential for steric hindrance in the region to which the primer is required to hybridize, will be considered when designing oligonucleotide primers according to the invention.
A positive correlation exists between primer length and both the efficiency and accuracy with which a primer will anneal to a target sequence. In particular, longer sequences have a higher melting temperature (TM) than do shorter ones, and are less likely to be repeated within a given target sequence, thereby minimizing promiscuous hybridization. Primer sequences with a high G-C content or that comprise palindromic sequences tend to self-hybridize, as do their intended target sites, since unimolecular, rather than bimolecular, hybridization kinetics are generally favored in solution. However, it is also important to design a primer that contains sufficient numbers of G-C nucleotide pairings since each G-C pair is bound by three hydrogen bonds, rather than the two that are found when A and T bases pair to bind the target sequence, and therefore forms a tighter, stronger bond. Hybridization temperature varies inversely with primer annealing efficiency, as does the concentration of organic solvents, e.g. formamide, that might be included in a priming reaction or hybridization mixture, while increases in salt concentration facilitate binding. Under stringent annealing conditions, longer hybridization probes or synthesis primers hybridize more efficiently than do shorter ones, which are sufficient under more permissive conditions. Preferably, stringent hybridization is performed in a suitable buffer (for example, IX Taq Polymerase Buffer, or other buffer suitable for enzymes used for primer extension and amplification) under conditions that allow the polynucleotide sequence to hybridize to the oligonucleotide primers. Stringent hybridization conditions can vary (for example from salt concentrations of less than about 1M, more usually less than about 500 mM and preferably less than about 200 mM) and hybridization temperatures can vary (for example, from as low as 03C to greater than 223C, greater than about 303C, and (most often) in excess of about 373C) depending upon the lengths and/or the polynucleotide composition or the oligonucleotide primers. Longer fragments may require higher hybridization temperatures for specific hybridization. As several factors affect the stringency of hybridization, the combination of parameters is more important than the absolute measure of a single factor.
Unlike the design of primers made to recognize a sequence anywhere on a given gene, primers designed to hybridize near a known SNP are limited with respect to the modifications one can make to manipulate Tm. For example, where one would normally be able to shift up- or downstream on a sequence to find a region with a more favorable GC content, when a primer is designed to hybridize adjacent to a SNP, one cannot move the primer to another location. In this situation, then, the primary means of manipulating Tm is to vary the length of the complementary sequence in the primer. Sequence tags useful according to the invention:
Tags useful according to the invention are preferably heterologous or artificial nucleotide sequences of at least 15, and preferably 20 to 30 nucleotides in length. A tag will preferably not hybridize under PCR annealing conditions to a sequence in the genome of the organism being genotyped. A tag sequence according to the invention can be, but is not necessarily random. One can determine whether a potential tag sequence hybridizes under PCR annealing conditions to a sequence in the genome of an organism by using the tag sequence as a labeled primer in a primer extension reaction with genomic DNA from the organism of interest as template. The labeled primer is annealed to the genomic DNA at the annealing temperature one plans to use for the amplification steps of the method of the invention, and then incubated with thermostable polymerase under extension conditions. The reaction products are then electrophoretically separated alongside labeled probe alone. If the labeled tag appears in a band or bands larger than the tag primer, the tag primer hybridized under PCR annealing conditions to a sequence in the genome of the organism being genotyped. Care should also be taken to avoid tags with complementarity to other tags intended for use in the same reaction.
Labeling of oligonucleotide primers
Oligonucleotide primers useful according to the invention can be labeled, as described below, by incoφorating moieties detectable by spectroscopic, photochemical, biochemical, immunochemical, enzymatic or chemical means. The method of linking or conjugating the label to the oligonucleotide primer depends, of course, on the type of label(s) used and the position of the label on the primer (i.e., 3 '-terminal, 5 '-terminal or body-labeled).
While fluorescent dyes are preferred, a variety of labels that would be appropriate for use in the invention, as well as methods for their inclusion in the primer, are known in the art and include, but are not limited to, enzymes (e.g, alkaline phosphatase and horseradish peroxidase) and enzyme substrates, radioactive atoms, chromophores, fluorescence quenchers, chemiluminescent labels, and electrochemiluminescent labels, such as Origen™ (Igen), that may interact with each other to enhance, alter, or diminish a signal. Of course, if a labeled molecule is used in a PCR based amplification assay involving thermal cycling, the label must be able to survive the temperature cycling required in this automated process. Ideally, four distinguishable labels that can be detected using similar equipment, methods and/or substrates are preferred. Fluorophores for use as labels in constructing labeled primers of the invention include, but are not limited to rhodamine and derivatives (such as Texas Red), fluorescein and derivatives (such as 5-bromomethyl fluorescein), Cy5, Cy3, JOE, FAM, Oregon Green™, Lucifer Yellow, IAEDANS, 7-Me2N-coumarin-4-acetate, 7-OH-4-CH3-coumarin-3-acetate, 7-NH2-4-CH3- coumarin-3 -acetate (AMCA), monobromobimane, pyrene frisulfonates, such as Cascade Blue, and monobromorimethyl-ammoniobimane. In general, fluorophores with wide Stokes shifts are preferred, to allow using fluorimeters with filters rather than a monochromometer and to increase the efficiency of detection.
The labels can be attached to the oligonucleotide directly or indirectly by a variety of techniques. Depending on the precise type of label or tag used, the label can be located at the 5' end of the primer or located internally in the primer, or attached to spacer arms of various sizes and compositions to facilitate signal interactions. 5' end labeling is preferred. Using commercially available phosphoramidite reagents, one can produce oligomers containing functional groups (e.g, thiols or primary amines) at the 5'- terminus via an appropriately protected phosphoramidite, and can label them using protocols described in, for example, PCR Protocols: A Guide to Methods and Applications, frmis et al, eds. Academic Press, Ind, 1990.
Methods for introducing oligonucleotide functionalizing reagents to introduce one or more sulfhydryl, amino or hydroxyl moieties into the oligonucleotide primer sequence, typically at the 5' terminus, are described in U.S. Patent No. 4,914,210. A 5' phosphate group can be introduced as a radioisotope by using polynucleotide kinase and gamma-32P-ATP or gamma-33P- ATP to provide a reporter group. Biotin can be added to the 5' end by reacting an aminothymidine residue, or a 6-amino hexyl residue, introduced during synthesis, with an N- hydroxysuccinimide ester of biotin.
Amplification
PCR methods are well-known to those skilled in the art, such as those described in Mullis and Faloona, 1987, Methods Enzymol, 155: 335, Saiki et al, 1985. Science 230:1350. andU.S. Patent Nos. 4,683,202, 4,683,195 and 4,800,159, each of which is incoφorated herein by reference. In its simplest form, PCR is an in vitro method for the enzymatic synthesis of specific DNA sequences, using two oligonucleotide primers that hybridize to opposite strands and flank the region of interest in the target DNA. A repetitive series of reaction steps involving template denaturation, primer annealing and the extension of the annealed primers by DNA polymerase results in the exponential accumulation of a specific fragment whose termini are defined by the 5' ends of the primers. PCR is reported to be capable of producing a selective enrichment of a specific DNA sequence by a factor of 109.
The length and temperature of each step of a PCR cycle, as well as the number of cycles, are adjusted according to the stringency requirements in effect. Annealing temperature and timing are determined both by the efficiency with which a primer is expected to anneal to a template and the degree of mismatch that is to be tolerated. The ability to optimize the stringency of primer annealing conditions is well within the knowledge of one of skill in the art. An annealing temperature between 20°C and 72°C is most commonly used. Initial denaturation of the template molecules is normally achieved by incubation at 92°C to 99°C for 4 minutes, followed by 20-40 cycles consisting of denaturation (94°C for 15 seconds to 1 minute), annealing (temperature based on Tm as discussed above, usually about 5°C below the Tra of the oligonucleotide in the reaction with the lowest Tm; usually 1-2 minutes), and extension (usually 72°C for 1-3 minutes).
Sampling
Sampling during the amplification regimen can be performed at any frequency or in any pattern desired. It is preferred that sampling occurs after each cycle in the regimen, although less frequent sampling can also be used, for example, every other cycle, every third cycle, every fourth cycle, etc. While a uniform sample interval will most often be desired, there is no requirement that sampling be performed at uniform intervals. As just one example, the sampling routine may involve sampling after every cycle for the first five cycles, and then sampling after every other cycle.
Sampling can be as simple as manually pipetting an aliquot from the reaction, but is preferably automated such that the aliquot is automatically withdrawn at predetermined sampling intervals. It is preferred that the reaction mixture is replenished at each withdrawal with equal volumes of fresh components such as dNTPs, primers and DNA polymerase. For this and other aspects of the invention, it is preferred, although not necessary that the cycling be performed in a microtiter or multiwell plate format. This format, which uses plates comprising multiple reaction wells, not only increases the throughput of the assay process, but is also well adapted for automated sampling steps due to the modular nature of the plates and the uniform grid layout of the wells on the plates. Common microtiter plate designs useful according to the invention have, for example 12, 24, 48, 96, 384 or more wells, although any number of wells that physically fit on the plate and accommodate the desired reaction volume (usually 10-100 μl) can be used according to the invention. Generally, the 96 or 384 well plate format is preferred.
An automated sampling process can be readily executed as a programmed routine and avoids both human error in sampling (i.e., error in sample size and tracking of sample identity) and the possibility of contamination from the person sampling. Robotic samplers capable of withdrawing aliquots from thermal cyclers are available in the art. For example, the Mitsubishi RV-E2 Robotic Arm can be used in conjunction with a SciClone™ Liquid Handler or a Robbins Scientific Hydra 96 pipettor.
The robotic sampler useful according to the invention can be integrated with the thermal cycler, or the sampler and cycler can be modular in design. When the cycler and sampler are integrated, thermal cycling and sampling occur in the same location, with samples being withdrawn at programmed intervals by a robotic sampler. When the cycler and sampler are modular in design, the cycler and sampler are separate modules. In one embodiment, the assay plate is physically moved, e.g, by a robotic arm, from the cycler to the sampler and back to the cycler.
The volume of an aliquot removed at the sampling step can vary, depending, for example, upon the total volume of the amplification reaction, the sensitivity of product detection, and the type of separation used. Amplification volumes can vary from several microliters to several hundred microliters (e.g, 5 μl, 10 μl, 20 μl, 40 μl, 60 μl, 80 μl, 100 μl, 120 μl, 150 μl, or 200 μl or more), preferably in the range of 10-150 μl, more preferably in the range of 10-100 μl. Aliquot volumes can vary from 0.1 to 30 % of the reaction mixture.
Separation of nucleic acids
Separation of nucleic acids according to the invention can be achieved by any means suitable for separation of nucleic acids, including, for example, electrophoresis, HPLC or mass spectrometry. Due to its speed and resolution, separation is preferably performed by capillary electrophoresis (CE).
CE is an efficient analytical separation technique for the analysis of minute amounts of sample. CE separations are performed in a narrow diameter capillary tube, which is filled with an electrically conductive medium termed the "carrier electrolyte." An electric field is applied between the two ends of the capillary tube, and species in the sample move from one electrode toward the other electrode at a rate which is dependent on the electrophoretic mobility of each species, as well as on the rate of fluid movement in the tube. CE may be performed using gels or liquids, such as buffers, in the capillary. In one liquid mode, known as "free zone electrophoresis," separations are based on differences in the free solution mobility of sample species. In another liquid mode, micelles are used to effect separations based on differences in hydrophobicity. This is known as Micellar Electrokinetic Capillary Chromatography (MECC).
CE separates nucleic acid molecules on the basis of charge, which effectively results in their separation by size or number of nucleotides. When a number of fragments are produced, they will pass the fluorescence detector near the end of the capillary in ascending order of size. That is, smaller fragments will migrate ahead of larger ones and be detected first.
CE offers significant advantages of over conventional electrophoresis, primarily in the speed of separation, small size of the required sample (on the order of 1-50 nl), and high resolution. For example, separation speeds using CE can be 10 to 20 times faster than conventional gel electrophoresis, and no post-run staining is necessary. CE provides high resolution, separating molecules in the range of about 10-1,000 base pairs differing by as little as a single base pair. High resolution is possible in part because the large surface area of the capillary efficiently dissipates heat, permitting the use of high voltages. In addition, band broadening is minimized due to the narrow inner diameter of the capillary. In free-zone electrophoresis, the phenomenon of electroosmosis, or electroosmotic flow (EOF) occurs. This is a bulk flow of liquid that affects all of the sample molecules regardless of charge. Under certain conditions EOF can contribute to improved resolution and separation speed in free-zone CE.
CE can be performed by methods well known in the art, for example, as disclosed in U.S. Patent Nos. 6,217,731; 6,001,230; and 5,963,456, which are incoφorated herein by reference. High throughput CE equipment is available commercially, for example, the HTS9610 High Throughput Analysis System and SCE 9610 fully automated 96-capillary electrophoresis genetic analysis system from Spectrumedix Coφoration (State College, PA). Others include the P/ACE 5000 series from Beckman Instruments Inc (Fullerton, CA) and the ABI PRISM 3100 genetic analyzer (Applied Biosystems, Foster City, CA). Each of these devices comprises a fluorescence detector that monitors the emission of light by molecules in the sample near the end of the CE column. The standard fluorescence detectors can distinguish numerous different wavelengths of fluorescence emission, providing the ability to detect multiple fluorescently labeled species in a single CE run from an amplification sample.
Another means of increasing the throughput of the CE separation is to use a plurality of capillaries, or preferably an array of capillaries. Capillary Array Electrophoresis (CAE) devices have been developed with 96 capillary capacity (e.g, the MegaBACE instrument from Molecular Dynamics) and higher, up to and including even 1000 capillaries. In order to avoid problems with the detection of fluorescence from DNA caused by light scattering between the closely juxtaposed multiple capillaries, a confocal fluorescence scanner can be used (Quesada et al, 1991, Biotechniques 10:616-25).
The apparatus for separation (and detection) can be separate from or integrated with the apparatus used for thermal cycling and sampling. Because according to the invention the separation step is initiated concurrently with the cycling regimen, samples are preferably taken directly from the amplification reaction and placed into the separation apparatus so that separation proceeds concurrently with amplification. Thus, while it is not necessary, it is preferred that the separation apparatus is integral with the thermal cycling and sampling apparatus. In one embodiment, this apparatus is modular, comprising a thermal cycling module and a separation/detection module, with a robotic sampler that withdraws sample from the thermal cycling reaction and places it into the separation/detection apparatus.
Detection
Amplification product detection methods useful according to the invention measure the intensity of fluorescence emitted by labeled primers when they are irradiated with light within the excitation spectrum of the fluorescent label. Fluorescence detection technology is highly developed and very sensitive, with documented detection down to a single molecule in some instances. High sensitivity fluorescence detection is a standard aspect of most commercially- available plate readers, microarray detection set-ups and CE apparatuses. For CE equipment, fiber optic transmission of excitation and emission signals is often employed. Spectrumedix, Applied Biosystems, Beckman Coulter and Agilent each sell CE equipment with fluorescence detectors sufficient for the fluorescence detection necessary for the methods described herein.
The fluorescence signals from two or more different fluorescent labels can be distinguished from each other if the peak wavelengths of emission are each separated by 20 nm or more in the spectrum. Generally the practitioner will select fluorophores with greater separation between peak wavelengths, particularly where the selected fluorophores have broad emission wavelength peaks. It follows that the more different fluorophores one wishes to include and detect concurrently in a sample, the narrower should be their emission peaks.
EXAMPLES
Example 1. Detection of single nucleotide differences.
Leber's hereditary optic neuropathy (LHON) is associated with the presence of several point mutations in mitochondrial DNA, at positions 3460, 11778 and 14459.
Mutant: SNP region (Polymoφhic site shown in BOLD, underline)
3460 5'-CGG GCTACT ACAACC CTT CGC TGA CGC CAT AAA-3'
(SEQ ID NO: 1)
11778 5'-TCAAAC TAG GAA CGC ACT CAC AGT CGC ATC ATA-3'
(SEQ ID NO: 2)
14459 5'-CTC AGG ATA CTC CTC AAT AGC CAT CGC TGT AGT-3'
(SEQ IDNO: 3)
The genotype of an individual with respect to SNPs in human mitochondrial DNA associated with Leber's hereditary optic neuropathy (LHON) can be determined as follows.
Primer Extension:
Primers:
a) Upstream primers.
The upstream primers are as follows:
Mutant Upstream primer (tag sequences are in lower case)
3460 5'-gttacaagat tctcacacgc taagg-TTC ATA GTA GAA GAG CGA TGG-3'
(SEQ ID NO: 4)
11778 5'-gttacaagat tctcacacgc taagg-AAA AAG CTA TTA GTG GGA GTA-3' (SEQ ID NO: 5) 14459 5 '-gttacaagat tctcacacgc taagg-TCG GGT GTG TTA TTA TTC TGA-3 '
(SEQ ID NO: 6)
b) Downstream primers.
The downstream primers are as follows:
Mutant Downstream Primer
3460 G-primer: 5 '-agttggcgaa gcagtcgcta gaagaCGG GCT ACT ACA ACC CTT CGC TGA CG-3' (SEQ ID NO: 7)
A-primer. 5'-gatgctggtg tggctggtgt tcccgCGG GCT ACT ACA ACC CTT CGC TGA CA-3' (SEQ ID NO: 8)
T-primer: 5'-ggttggttgc acactggaga tattggCGG GCT ACT ACA ACC CTT CGC TGA CT-3' (SEQ ID NO: 9)
C-primer: 5'-ctggagcatc tggaaaagta gtaccCGG GCT ACT ACA ACC CTT CGC TGA CC-3' (SEQ ID NO: 10)
11778 G-primer: 5 '-agttggcgaa gcagtcgcta gaagaTCA AAC TAG GAA CGC ACT CAC AGT CG-3' (SEQ ID NO: 11)
A-primer: 5'-gatgctggtg tggctggtgt tcccgTCA AAC TAC GAA CGC ACT CAC AGT CA-3' (SEQ ID NO: 12) T-primer: 5 '-ggttggttgc acactggaga tattggTC A AAC TAC GAA CGC ACT CAC AGT
CT-3' (SEQ ID NO: 13)
C-primer: 5'-ctggagcatc tggaaaagta gtaccTCA AAC TAC GAA CGC ACT CAC AGT CC-3' (SEQ ID NO: 14)
14459 G-primer: 5 '-agttggcgaa gcagtcgcta gaagaCTC AGG ATA CTC CTC AAT AGC CAT CG-3' (SEQ ID NO: 15)
A-primer: 5'-gatgctggtg tggctggtgt tcccgCTC AGG ATA CTC CTC AAT AGC CAT CA-3' (SEQ ID NO: 16)
T-primer: 5'-ggttggttgc acactggaga tattggCTC AGG ATA CTC CTC AAT AGC CAT CT-3' (SEQ ID NO: 17) C-primer: 5'-ctggagcatc tggaaaagta gtaccCTC AGG ATA CTC CTC AAT AGC CAT
CC-3' (SEQ ID NO: 18) The full set of 5 primer extension primers for each polymoφhic site (40 pmol each, 15 primers in total) is mixed with 1 μg of template genomic DNA from the individual to be tested, in IX Pfu buffer (20 mM Tris-HCl, pH 8.8, 10 mM KC1, 10 mM (NH4)2SO4, 2 mM MgSO4, 0.1% Triton-X-100 and 0.1 mg/ml nuclease-free BSA) in a total volume of 50 μl. The mixture is heated to 94°C for 2 minutes and slowly cooled to room temperature, to permit primer annealing. 1 μl (2.5 U/μl) of cloned Pfu polymerase plus 1.25 μl of each dNTP (final concentration 200 μM) is added, and the sample is incubated at 72°C for 3 minutes. The sample is then cycled to 94°C for 2 minutes, then 50°C for 1 minute, and 72°C for 3 minutes to generate a population of primer extension products with an upstream primer or its complement and a downstream primer or its complement.
Primer extension primers are removed by the addition of 20 U of E. coli Exonuclease I (Exol; New England Biolabs) and incubation at 37°C for 20 minutes. Exol is then inactivated by incubation at 80°C for 20 minutes.
Amplification:
After removal of primer extension primers, the 5 amplification primers (40 pmol of each primer in IX Pfu buffer, final volume 75 μl) are added as follows:
a) Upstream Primer: 5'-gttacaagat tctcacacgc taagg-3' (SEQ ID NO: 19
b) Downstream primers: (distinguishably labeled)
G-primer: 5'-R6G-agttggcgaa gcagtcgcta gaaga-3' (SEQ ID NO: 20) A-primer: 5 '-FAM-gatgctggtg tggctggtgt tcccg-3' (SEQ ID NO: 21)
T-primer: 5'-ROX-ggttggttgc acactggaga tattgg-3' (SEQ ID NO: 22)
C-primer: 5 '-JOE ctggagcatc tggaaaagta gtacc-3' (SEQ ID NO: 23)
Amplification is performed by adding 1 μl of fresh, cloned Pfu polymerase and cycling the reaction as follows: 35 cycles of 94°C for 45 sec, 50°C for 45 sec, and 72°C for 2 min. After each cycle, or at any chosen interval, an aliquot (0.5 μl) is withdrawn and loaded onto a prepared capillary electrophoresis apparatus. Separation is initiated and conducted during the amplification regimen. Amplified primer extension products are detected by fluorescence after separation over the length of the capillary. The signal strength of each fragment can be plotted for each cycle, to generate an amplification profile.
Amplified products are:
Mutant Product size Wild-type polymoφhic Mutant polymoφhic nucleotide nucleotide
3460 249 G A (detected by ROX dye on 249 bp product)
11778 350 G A (detected by ROX dye on 350 bp product)
14459 456 G A (detected by ROX dye on 456 bp product)
The method detailed in this example can be further multiplexed by including an additional upstream primer extension primer for each additional SNP, having the same upstream tag and a 3' region specific for a different SNP-containing fragment of a distinct size from those already included. Each additional SNP interrogated must also have its own set of 4 downstream primers carrying the same set of 4 downstream primer tags, a 3' region that specifically hybridizes adjacent to the SNP, and a variable 3'-tenninal nucleotide that corresponds to the tag sequence.
Further multiplexing can be achieved by including new primer sets with a different set of upstream and downstream tags as described herein above.
OTHER EMBODIMENTS
All patents, patent applications, and published references cited herein are hereby incoφorated by reference in their entirety. While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.

Claims

1. A method of determining for a given nucleic acid sample, the identity of the nucleotide at a known polymoφhic site, said method comprising:
a) subjecting to an amplification regimen a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a tag sequence, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein said amplification regimen is performed using an upstream amplification primer and a set of distinguishably labeled downstream amplification primers, each member of said set of downstream amplification primers comprising a said tag sequence comprised by a member of said population of primer extension products and a distinguishable label, wherein each distinguishable label specifically corresponds to the presence of a specific nucleotide at said polymorphic site; and
b) detecting incoφoration of a distinguishable label into a nucleic acid molecule, thereby to determine the identity of the nucleotide at said polymoφhic site.
2. The method of claim 1 wherein said distinguishable label is a fluorescent label.
3. The method of claim 2 wherein said step (b) comprises separating nucleic acid molecules made during said amplification regimen by size and/or by charge.
4. The method of claim 3 wherein said separating comprises capillary electrophoresis.
5. The method of claim 1 wherein said amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
6. The method of claim 5 further comprising the steps, during said amplification regimen and after at least one of said reaction cycles, of removing an aliquot of said amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a said distinguishable label, wherein said detecting determines the identity of the nucleotide at said polymoφhic site.
7. The method of claim 6 wherein said removing, separating and detecting are performed after each cycle in said regimen.
8. The method of claim 6 wherein said separating comprises capillary electrophoresis.
9. The method of claim 1 wherein steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescence detector.
10. The method of claim 1 wherein said tag sequence comprises 15 to 40 nucleotides.
11. The method of claim 1 wherein said set of distinguishably labeled downstream amplification primers consists of: a primer that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a primer that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a primer that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a primer that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
12. The method of claim 1 wherein said set of distinguishably labeled downstream amplification primers consists of a pair of oligonucleotides, one comprising a tag sequence that specifically corresponds to a first allele of the polymoφhic site and one comprising a tag sequence that specifically corresponds to a second allele of the polymoφhic site.
13. The method of claim 1, further comprising the step, before step (a), of removing primers not incoφorated when said population of primer extension products was made.
14. The method of claim 13 wherein said step of removing comprises degrading said primers not incoφorated when said population of primer extension products was made.
15. The method of claim 14, wherein said degrading is performed using a heat labile exonuclease.
16. The method of claim 15 wherein said heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
17. The method of claim 16 wherein said heat labile exonuclease is thermally inactivated before continuing to step (a).
18. A method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymoφhic sites to be interrogated, said method comprising:
a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a member of a set of tag sequences, which tag sequence specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein said amplification regimen is performed using one upstream amplification primer for each sequence comprising a known polymoφhic site to be interrogated, and a set of distinguishably labeled downstream amplification primers, each member of said set of downstream amplification primers comprising a said tag sequence comprised by a member of said population of primer extension products and a distinguishable label that specifically corresponds to the presence of a specific nucleotide at said polymoφhic site, and wherein said upstream amplification primers are selected such that each polymoφhic site of said set of known polymoφhic sites to be interrogated corresponds to a distinctly sized amplification product;
b) detecting incoφoration of a distinguishable label in distinctly sized amplification products, thereby to determine the identity of the nucleotide at each said polymoφhic site.
19. The method of claim 18 wherein said distinguishable label is a fluorescent label.
20. The method of claim 18 wherein said step (b) comprises separating nucleic acid molecules made during said amplification regimen by size and/or by charge.
21. The method of claim 20 wherein said separating comprises capillary electrophoresis.
22. The method of claim 18 wherein said amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
23. The method of claim 22 further comprising the steps, during said amplification regimen and after at least one of said reaction cycles, of removing an aliquot of said amplification reaction, separating nucleic acid molecules by size and or by charge, and detecting the incoφoration of a said distinguishable label, wherein said detecting determines the identity of the nucleotide at said polymoφhic site.
24. The method of claim 23 wherein said removing, separating and detecting are performed after each cycle in said regimen.
25. The method of claim 23 wherein said separating comprises capillary electrophoresis.
26. The method of claim 18 wherein steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
27. The method of claim 18 wherein said tag sequence comprises 15 to 40 nucleotides.
28. The method of claim 18 wherein said set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
29. The method of claim 18, further comprising the step, before step (a), of removing primers not incoφorated when said population of primer extension products was made.
30. The method of claim 29 wherein said step of removing comprises degrading said primers not incoφorated when said population of primer extension products was made.
31. The method of claim 30, wherein said degrading is performed using a heat labile exonuclease.
32. The method of claim 31 wherein said heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
33. The method of claim 32 wherein said heat labile exonuclease is thermally inactivated before continuing to step (a).
34. A method of determining, for a given nucleic acid sample, the identities of the nucleotides at a set of known polymoφhic sites to be interrogated, said method comprising: a) subjecting to an amplification regimen, a population of primer extension products generated from a nucleic acid sample, each primer extension product comprising a first tag sequence or its complement and a member of a set of second tag sequences or its complement, the presence of which second tag sequence or its complement specifically corresponds to the presence of one specific nucleotide at a known polymoφhic site, wherein for each polymoφhic site in said set of polymoφhic sites, said first tag sequence is located at a distinct distance 5' of said polymoφhic site, relative to the distance of said first tag sequence from a polymoφhic site on molecules in said sample containing other polymoφhic sites, wherein said amplification regimen is performed using an upstream amplification primer comprising said first tag sequence, and a set of distinguishably labeled downstream amplification primers, each member of said set of downstream amplification primers comprising a said tag sequence comprised by a member of said population of primer extension products and a distinguishable label that specifically corresponds to the presence of a specific nucleotide at said polymoφhic site, and wherein said upstream amplification primers are selected such that each polymoφhic site of said set of known polymoφhic sites to be interrogated corresponds to a distinctly sized amplification product;
b) detecting incoφoration of a distinguishable label in distinctly sized amplification products, thereby to determine the identity of the nucleotide at each said polymoφhic site.
35. The method of claim 34 wherein said distinguishable label is a fluorescent label.
36. The method of claim 34 wherein said step (b) comprises separating nucleic acid molecules made during said amplification regimen by size and/or by charge.
37. The method of claim 36 wherein said separating comprises capillary electrophoresis.
38. The method of claim 34 wherein said amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
39. The method of claim 38 further comprising the steps, during said amplification regimen and after at least one of said reaction cycles, of removing an aliquot of said amplification reaction, separating nucleic acid molecules by size and or by charge, and detecting the incoφoration of a said distinguishable label, wherein said detecting determines the identity of the nucleotide at said polymoφhic site.
40. The method of claim 39 wherein said removing, separating and detecting are performed after each cycle in said regimen.
41. The method of claim 39 wherein said separating comprises capillary electrophoresis.
42. The method of claim 34 wherein steps (a) and (b) are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
43. The method of claim 34 wherein said tag sequence comprises 15 to 40 nucleotides.
44. The method of claim 34 wherein said set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
45. The method of claim 34, further comprising the step, before step (a), of removing primers not incoφorated when said population of primer extension products was made.
46. The method of claim 45 wherein said step of removing comprises degrading said primers not incoφorated when said population of primer extension products was made.
47. The method of claim 46, wherein said degrading is performed using a heat labile exonuclease.
48. The method of claim 47 wherein said heat labile exonuclease is selected from the group consisting of Exonuclease I and Exonuclease VII.
49. The method of claim 48 wherein said heat labile exonuclease is thermally inactivated before continuing to step (a).
50. A method of determining the identity of a single nucleotide at a known polymoφhic site, said method comprising:
I) providing a nucleic acid sample comprising said polymoφhic site; II) separating the strands of said nucleic acid sample and re-annealing in the presence of:
a) a first oligonucleotide primer comprising a 3 ' region that hybridizes to a sequence at a known distance upstream of said known polymoφhic site, said first oligonucleotide primer comprising a first sequence tag located 5' of said 3' region; and
b) a set of second oligonucleotide primers, wherein each member of said set comprises:
i) a region that hybridizes 3' of and adjacent to said polymoφhic site;
ii) a variable 3 ' terminal nucleotide, wherein, when said member is hybridized to said known sequence, said 3' terminal nucleotide is opposite said polymoφhic site, and wherein, if and only if said 3 ' terminal nucleotide is complementary to the nucleotide at said polymoφhic site, said 3' terminal nucleotide base pairs with said nucleotide at said polymoφhic site; and
iii) a tag sequence that corresponds to said varaible 3 '-terminal nucleotide of (ii), said tag sequence located 5' of the region of (i) on said member;
III) contacting the annealed oligonucleotides resulting from step (II) with a nucleic acid polymerase under conditions that permit the extension of an annealed oligonucleotide such that extension products are generated, wherein the primer extension product from the first oligonucleotide primer, when separated from its complement, can serve as a template for the synthesis of the extension product of a member of the set of second oligonucleotide primers, and vice versa;
TV) repeating strand separating and contacting steps (II) and (III) two times, such that a population of nucleic acid molecules is generated that comprises both a sequence identical to or complementary to said first oligonucleotide and a sequence identical to or complementary to one of the members of said second set of oligonucleotides;
V) contacting the population generated in step (IV) with a heat-labile exonuclease under conditions permitting the degradation of non-annealed oligonucleotide primers, such that said primers are degraded;
VI) thermally inactivating said heat-labile exonuclease; VJJ) subjecting said population of nucleic acid molecules to an amplification regimen, wherein said amplification regimen is performed using an upstream amplification primer comprising the first sequence tag comprised by said first oligonucleotide primer, and a set of downstream amplification primers, each member of said set of downstream amplification primers comprising a tag comprised by a member of said set of second oligonucleotide primers and a distinguishable label; and
VIII) detecting incoφoration of at least one distinguishable label, thereby determining the identity of the nucleotide at said known polymorphic site.
51. The method of claim 50 wherein said distinguishable label is a fluorescent label.
52. The method of claim 50 wherein said step (VIII) comprises separating nucleic acid molecules made during said amplification regimen by size and/or by charge.
53. The method of claim 50 wherein said separating comprises capillary electrophoresis.
54. The method of claim 50 wherein said amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
55. The method of claim 54 further comprising the steps, during said amplification regimen and after at least one of said reaction cycles, of removing an aliquot of said amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a said distinguishable label, wherein said detecting determines the identity of the nucleotide at said polymoφhic site.
56. The method of claim 55 wherein said removing, separating and detecting are performed after each cycle in said regimen.
57. The method of claim 50 wherein steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
58. The method of claim 50 wherein said tag sequences each comprise 15 to 40 nucleotides.
59. The method of claim 50 wherein said 3 ' region that hybridizes to a sequence at a known distance upstream of said known polymorphic site comprises 10-30 nucleotides.
60. The method of claim 50 wherein said region that hybridizes 3 ' of and adjacent to said polymoφhic site comprises 10-30 nucleotides.
61. The method of claim 50 wherein said set of downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
62. A method of determining the identities of single nucleotides present at a group of known polymoφhic sites, said method comprising:
I) providing a nucleic acid sample comprising said group of polymoφhic sites;
II) separating the strands of said nucleic acid sample and re-annealing in the presence of:
a) a set of first oligonucleotide primers each comprising a 3 ' region that hybridizes to a sequence at a known distance upstream of a known polymoφhic site, each member of said set of first oligonucleotide primers comprising a common sequence tag located 5' of said 3' region, and each member of said set of first oligonucleotide primers selected such that a distinctly sized amplification product is generated for each polymoφhic site in said group of known polymorphic sites; and
b) a set of downstream amplification primers comprising, in 5' to 3' order:
i) a sequence tag selected from the group consisting of a tag specifically corresponding to G as the 3 '-terminal nucleotide of said primer; a tag specifically corresponding to A as the 3 '-terminal nucleotide of said primer; a tag specifically corresponding to T as the 3'- terminal nucleotide of said primer; and a tag specifically corresponding to C as the 3 '-terminal nucleotide of said primer;
ii) a region that specifically hybridizes to a sequence adjacent to and 3' of a polymoφhic site in said group of polymoφhic sites, wherein said set of downstream amplification primers comprises a subset of primers comprising a region that specifically hybridizes adjacent to said polymoφhic site for each polymoφhic site in said group of polymoφhic sites; and
iii) a 3' terminal nucleotide selected from G, A, T or C, wherein said terminal nucleotide specifically corresponds to the sequence tag described in (i) on that downstream amplification primer, and wherein when said downstream amplification primer is hybridized to said sequence adjacent to and 3' of a polymoφhic site, said 3' terminal nucleotide is opposite said polymoφhic site;
III) contacting the annealed oligonucleotides resulting from step (II) with a nucleic acid polymerase under conditions that permit the extension of an annealed oligonucleotide such that extension products are generated, wherein the primer extension product from the first oligonucleotide primer, when separated from its complement, can serve as a template for the synthesis of the extension product of as member of the set of second oligonucleotide primers, and vice versa;
IN) repeating strand separating and contacting steps (II) and (III) two times, such that a reaction mixture comprising a population of nucleic acid molecules is generated that comprises both a sequence identical to or complementary to said first oligonucleotide and a sequence identical to or complementary to a member of said set of downstream amplification primers;
V) contacting the population gnerated in step (IV) with a heat-labile exonuclease under conditions peraaitting the degradation of non-annealed oligonucleotide primers, such that non- annealed primers are degraded;
VI) thermally inactivating said heat-labile exonuclease;
VII) subjecting said population of nucleic acid molecules to an amplification regimen, wherein said amplification regimen is performed using an upstream amplification primer comprising the common sequence tag comprised by said first oligonucleotide primer, and a set of downstream amplification primers, each member of said set of downstream amplification primers comprising a tag comprised by a member of said set of second oligonucleotide primers and a distinguishable label; and
VIII) detecting incoφoration of at least one distinguishable label, thereby determining the identities of the nucleotides present at said known polymoφhic sites.
63. The method of claim 62 wherein said distinguishable label is a fluorescent label.
64. The method of claim 62 wherein said step (VTfi) comprises separating nucleic acid molecules made during said amplification regimen by size and/or by charge.
65. The method of claim 64 wherein said separating comprises capillary electrophoresis.
66. The method of claim 62 wherein said amplification regimen comprising at least two amplification reaction cycles, wherein each cycle comprises the steps of: 1) nucleic acid strand separation; 2) oligonucleotide primer annealing; and 3) polymerase extension of annealed primers.
67. The method of claim 66 further comprising the steps, during said amplification regimen and after at least one of said reaction cycles, of removing an aliquot of said amplification reaction, separating nucleic acid molecules by size and/or by charge, and detecting the incoφoration of a said distinguishable label, wherein said detecting deteraiines the identity of the nucleotide at said polymoφhic site.
68. The method of claim 67 wherein said removing, separating and detecting are performed after each cycle in said regimen.
69. The method of claim 62 wherein steps I- VIII are performed in a modular apparatus comprising a thermal cycler, a sampling device, a capillary electrophoresis device and a fluorescent detector.
70. The method of claim 62 wherein said tag sequences each comprise 15 to 40 nucleotides.
71. The method of claim 62 wherein said 3 ' region that hybridizes to a sequence at a known distance upstream of said known polymoφhic site comprises 10-30 nucleotides.
72. The method of claim 62 wherein said region that hybridizes 3 ' of and adjacent to said polymoφhic site comprises 10-30 nucleotides.
73. The method of claim 62 wherein said set of distinguishably labeled downstream amplification primers consists of: a subset that comprises a tag sequence that specifically corresponds to the presence of A at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of C at the polymoφhic site; a subset that comprises a tag sequence that specifically corresponds to the presence of G at the polymoφhic site; and a subset that comprises a tag sequence that specifically corresponds to the presence of T at the polymoφhic site.
74. A kit for the detemiination of the nucleotide present at a polymoφhic site present on a nucleic acid sample, said kit comprising a set of upstream primers comprising:
a) a first primer comprising a 5 '-tag sequence and 3' sequence sufficient to specifically hybridize at a known distance upstream of a known polymoφhic site; and
b) a set of 4 downstream second primers, comprising in 5' to 3' order:
i) a sequence tag selected from the group consisting of a tag specifically corresponding to G as the 3 '-terminal nucleotide of said primer; a tag specifically corresponding to A as the 3'- terminal nucleotide of said primer; a tag specifically corresponding to T as the 3 '-terminal nucleotide of said primer; and a tag specifically corresponding to C as the 3 '-terminal nucleotide of said primer;
ii) a region that specifically hybridizes to a sequence adjacent to and 3 ' of a polymoφhic site in said group of polymoφhic sites, wherein said set of downstream amplification primers comprises a subset of primers comprising a region that specifically hybridizes adjacent to said polymorphic site for each polymoφhic site in said group of polymoφhic sites; and
iii) a 3 ' terminal nucleotide selected from G, A, T or C, wherein said terminal nucleotide specifically corresponds to the sequence tag described in (i) on that downstream amplification primer, and wherein when said downstream amplification primer is hybridized to said sequence adjacent to and 3 ' of a polymoφhic site, said 3 ' terminal nucleotide is opposite said polymoφhic site.
75. The kit of claim 74, further comprising a set of 5 primers lacking sequence specific for a gene in the genome of the organism being examined for polymoφhisms, said primers comprising a primer comprising the tag sequence of said first primer and a set of four distinguishably labeled primers comprising the tag sequences of said set of four downstream second primers.
PCT/US2003/019576 2002-06-28 2003-06-20 Methods of detecting sequence differences WO2004003136A2 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
AU2003243700A AU2003243700B2 (en) 2002-06-28 2003-06-20 Methods of detecting sequence differences
EP03761972A EP1534859B1 (en) 2002-06-28 2003-06-20 Methods of detecting sequence differences
DE60335116T DE60335116D1 (en) 2002-06-28 2003-06-20 METHOD FOR DETECTING SEQUENCE DIFFERENCES
CN03815401.3A CN1665938B (en) 2002-06-28 2003-06-20 Methods of detecting sequence differences
JP2004517727A JP4617403B2 (en) 2002-06-28 2003-06-20 How to detect sequence differences
AT03761972T ATE489477T1 (en) 2002-06-28 2003-06-20 METHOD FOR DETECTING SEQUENCE DIFFERENCES
CA2490864A CA2490864C (en) 2002-06-28 2003-06-20 Methods of detecting sequence differences
AU2009200987A AU2009200987A1 (en) 2002-06-28 2009-03-12 Methods of Detecting Sequence Differences

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US39233102P 2002-06-28 2002-06-28
US60/392,331 2002-06-28

Publications (2)

Publication Number Publication Date
WO2004003136A2 true WO2004003136A2 (en) 2004-01-08
WO2004003136A3 WO2004003136A3 (en) 2004-08-19

Family

ID=30000848

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/019576 WO2004003136A2 (en) 2002-06-28 2003-06-20 Methods of detecting sequence differences

Country Status (10)

Country Link
US (2) US20040096870A1 (en)
EP (1) EP1534859B1 (en)
JP (1) JP4617403B2 (en)
KR (1) KR101038137B1 (en)
CN (1) CN1665938B (en)
AT (1) ATE489477T1 (en)
AU (2) AU2003243700B2 (en)
CA (1) CA2490864C (en)
DE (1) DE60335116D1 (en)
WO (1) WO2004003136A2 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100400673C (en) * 2004-03-18 2008-07-09 华晶基因技术有限公司 Method for detecting mononucleotide polymorphism and point mutation
EP1947196A1 (en) * 2005-11-08 2008-07-23 Olympus Corporation Method of amplifying multiple nucleic acid sequences for differentiation
US7615620B2 (en) 2005-05-28 2009-11-10 Kbiosciences Limited Detection system for PCR assay
WO2010146339A1 (en) * 2009-06-15 2010-12-23 Bg Research Ltd Nucleic acid detection
ITRM20100527A1 (en) * 2010-10-08 2012-04-09 Bios Internat S R L FLUORESCENT METHOD FOR SCREENING OF HLA-B * 57: 01
WO2016026574A1 (en) * 2014-08-19 2016-02-25 Arcticzymes As Thermolabile exonucleases
US10011871B2 (en) 2012-02-17 2018-07-03 Fred Hutchinson Cancer Research Center Compositions and methods for accurately identifying mutations
US11584958B2 (en) 2017-03-31 2023-02-21 Grail, Llc Library preparation and use thereof for sequencing based error correction and/or variant identification

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040091905A1 (en) * 2002-07-01 2004-05-13 Baochuan Guo Method for detecting mutated polynucleotides within a large population of wild-type polynucleotides
US20080020386A1 (en) * 2006-03-29 2008-01-24 Medigen Biotechnology Corporation Methods and apparatus for genotyping
FI20075124A0 (en) * 2007-02-21 2007-02-21 Valtion Teknillinen Method and test kit for detection of nucleotide variations
EP2705159B1 (en) * 2011-05-06 2018-03-21 Qiagen GmbH Methods for sequencing, amplification and detection of nucleic acids comprising internally labelled primer
EP2707507B1 (en) * 2011-05-09 2017-11-01 Fluidigm Corporation Probe based nucleic acid detection
EP2817627A2 (en) 2012-02-21 2014-12-31 Oslo Universitetssykehus HF Methods and biomarkers for detection and prognosis of cervical cancer
US9938578B2 (en) * 2012-09-24 2018-04-10 iRepertoire, Inc. Multiplex pyrosequencing using non-interfering noise cancelling polynucleotide identification tags
CA2933387C (en) 2013-12-11 2023-05-02 The Regents Of The University Of California Methods for labeling dna fragments to reconstruct physical linkage and phase
AU2015296029B2 (en) * 2014-08-01 2022-01-27 Dovetail Genomics, Llc Tagging nucleic acids for sequence assembly
SG11201706730XA (en) 2015-02-17 2017-09-28 Dovetail Genomics Llc Nucleic acid sequence assembly
GB2554572B (en) 2015-03-26 2021-06-23 Dovetail Genomics Llc Physical linkage preservation in DNA storage
AU2016341198B2 (en) 2015-10-19 2023-03-09 Dovetail Genomics, Llc Methods for genome assembly, haplotype phasing, and target independent nucleic acid detection
KR20180116377A (en) 2016-02-23 2018-10-24 더브테일 제노믹스 엘엘씨 Generation and Hexagonal Paging of a Paged Read Set for Genome Assemblies
IL262946B2 (en) 2016-05-13 2023-03-01 Dovetail Genomics Llc Recovering long-range linkage information from preserved samples

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4683195A (en) 1986-01-30 1987-07-28 Cetus Corporation Process for amplifying, detecting, and/or-cloning nucleic acid sequences
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4800159A (en) 1986-02-07 1989-01-24 Cetus Corporation Process for amplifying, detecting, and/or cloning nucleic acid sequences
US4914210A (en) 1987-10-02 1990-04-03 Cetus Corporation Oligonucleotide functionalizing reagents
US5963456A (en) 1992-07-17 1999-10-05 Beckman Instruments, Inc. Method and apparatus for displaying capillary electrophoresis data
US6001230A (en) 1988-04-29 1999-12-14 Beckman Coulter, Inc. Automated capillary electrophoresis apparatus
US6156178A (en) 1999-07-13 2000-12-05 Molecular Dynamics, Inc. Increased throughput analysis of small compounds using multiple temporally spaced injections
US6217731B1 (en) 1998-10-21 2001-04-17 Spectrumedix Corporation Method and apparatus for monitoring and displaying the status of a parallel capillary electrophoresis device
US6270973B1 (en) 1998-03-13 2001-08-07 Promega Corporation Multiplex method for nucleic acid detection
US6312893B1 (en) 1996-01-23 2001-11-06 Qiagen Genomics, Inc. Methods and compositions for determining the sequence of nucleic acid molecules
US6322980B1 (en) 1999-04-30 2001-11-27 Aclara Biosciences, Inc. Single nucleotide detection using degradation of a fluorescent sequence

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5521301A (en) * 1988-12-12 1996-05-28 City Of Hope Genotyping of multiple allele systems
US5639611A (en) * 1988-12-12 1997-06-17 City Of Hope Allele specific polymerase chain reaction
US5856092A (en) * 1989-02-13 1999-01-05 Geneco Pty Ltd Detection of a nucleic acid sequence or a change therein
US6013431A (en) * 1990-02-16 2000-01-11 Molecular Tool, Inc. Method for determining specific nucleotide variations by primer extension in the presence of mixture of labeled nucleotides and terminators
US5210015A (en) * 1990-08-06 1993-05-11 Hoffman-La Roche Inc. Homogeneous assay system using the nuclease activity of a nucleic acid polymerase
US5846710A (en) * 1990-11-02 1998-12-08 St. Louis University Method for the detection of genetic diseases and gene sequence variations by single nucleotide primer extension
US6004744A (en) * 1991-03-05 1999-12-21 Molecular Tool, Inc. Method for determining nucleotide identity through extension of immobilized primer
US5888819A (en) * 1991-03-05 1999-03-30 Molecular Tool, Inc. Method for determining nucleotide identity through primer extension
US5981176A (en) * 1992-06-17 1999-11-09 City Of Hope Method of detecting and discriminating between nucleic acid sequences
US5710028A (en) * 1992-07-02 1998-01-20 Eyal; Nurit Method of quick screening and identification of specific DNA sequences by single nucleotide primer extension and kits therefor
US5538848A (en) * 1994-11-16 1996-07-23 Applied Biosystems Division, Perkin-Elmer Corp. Method for detecting nucleic acid amplification using self-quenching fluorescence probe
US5512441A (en) * 1994-11-15 1996-04-30 American Health Foundation Quantative method for early detection of mutant alleles and diagnostic kits for carrying out the method
US5710628A (en) * 1994-12-12 1998-01-20 Visible Genetics Inc. Automated electrophoresis and fluorescence detection apparatus and method
US6015675A (en) * 1995-06-06 2000-01-18 Baylor College Of Medicine Mutation detection by competitive oligonucleotide priming
JP4000605B2 (en) * 1996-07-24 2007-10-31 株式会社日立製作所 DNA sample preparation device and electrophoretic analyzer using the same
US5863736A (en) * 1997-05-23 1999-01-26 Becton, Dickinson And Company Method, apparatus and computer program products for determining quantities of nucleic acid sequences in samples
US6207031B1 (en) * 1997-09-15 2001-03-27 Whitehead Institute For Biomedical Research Methods and apparatus for processing a sample of biomolecular analyte using a microfabricated device
DE69840416D1 (en) * 1997-10-28 2009-02-12 Los Alamos Nat Security Llc Identification of DNA polymorphisms by flow cytometry
US6074831A (en) * 1998-07-09 2000-06-13 Agilent Technologies, Inc. Partitioning of polymorphic DNAs
US7115365B2 (en) * 1998-12-07 2006-10-03 Olympus Optical Co., Ltd. Method of analyzing a target nucleic acid
US6287778B1 (en) * 1999-10-19 2001-09-11 Affymetrix, Inc. Allele detection using primer extension with sequence-coded identity tags
US6618679B2 (en) * 2000-01-28 2003-09-09 Althea Technologies, Inc. Methods for analysis of gene expression
JP2003530893A (en) * 2000-04-25 2003-10-21 ディーエヌエー サイエンシーズ インコーポレーテッド Detection of nucleotide sequence variation by proofreading activity of polymerase
US6482615B2 (en) * 2001-03-02 2002-11-19 Integrated Genetic Devices Ltd. Method and apparatus for effecting rapid thermal cycling of samples in microtiter plate size
WO2003002752A2 (en) * 2001-06-29 2003-01-09 Rubicon Genomics Inc. Methods of using nick translate libraries for snp analysis
WO2003020983A1 (en) * 2001-08-30 2003-03-13 Virginia Commonwealth University Allele specific pcr for genotyping
WO2003060159A2 (en) * 2002-01-15 2003-07-24 Matforsk Methods of nucleic acid amplification

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4683202B1 (en) 1985-03-28 1990-11-27 Cetus Corp
US4683195B1 (en) 1986-01-30 1990-11-27 Cetus Corp
US4683195A (en) 1986-01-30 1987-07-28 Cetus Corporation Process for amplifying, detecting, and/or-cloning nucleic acid sequences
US4800159A (en) 1986-02-07 1989-01-24 Cetus Corporation Process for amplifying, detecting, and/or cloning nucleic acid sequences
US4914210A (en) 1987-10-02 1990-04-03 Cetus Corporation Oligonucleotide functionalizing reagents
US6001230A (en) 1988-04-29 1999-12-14 Beckman Coulter, Inc. Automated capillary electrophoresis apparatus
US5963456A (en) 1992-07-17 1999-10-05 Beckman Instruments, Inc. Method and apparatus for displaying capillary electrophoresis data
US6312893B1 (en) 1996-01-23 2001-11-06 Qiagen Genomics, Inc. Methods and compositions for determining the sequence of nucleic acid molecules
US6270973B1 (en) 1998-03-13 2001-08-07 Promega Corporation Multiplex method for nucleic acid detection
US6217731B1 (en) 1998-10-21 2001-04-17 Spectrumedix Corporation Method and apparatus for monitoring and displaying the status of a parallel capillary electrophoresis device
US6322980B1 (en) 1999-04-30 2001-11-27 Aclara Biosciences, Inc. Single nucleotide detection using degradation of a fluorescent sequence
US6156178A (en) 1999-07-13 2000-12-05 Molecular Dynamics, Inc. Increased throughput analysis of small compounds using multiple temporally spaced injections

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"PCR Protocols: A Guide to Methods and Applications", 1990, ACADEMIC PRESS, IND.
KANEHISA, M., POLYNUCLEOTIDE RES, vol. 12, 1984, pages 203
MULLIS; FALOONA, METHODS ENZYMOL., vol. 155, 1987, pages 335
SAIKI ET AL., SCIENCE, vol. 230, 1985, pages 1350

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100400673C (en) * 2004-03-18 2008-07-09 华晶基因技术有限公司 Method for detecting mononucleotide polymorphism and point mutation
US7615620B2 (en) 2005-05-28 2009-11-10 Kbiosciences Limited Detection system for PCR assay
EP1726664B1 (en) * 2005-05-28 2010-01-27 KBiosciences Ltd. Detection system for PCR assay
EP1947196A1 (en) * 2005-11-08 2008-07-23 Olympus Corporation Method of amplifying multiple nucleic acid sequences for differentiation
EP1947196A4 (en) * 2005-11-08 2009-09-09 Olympus Corp Method of amplifying multiple nucleic acid sequences for differentiation
WO2010146339A1 (en) * 2009-06-15 2010-12-23 Bg Research Ltd Nucleic acid detection
US20120183965A1 (en) * 2009-06-15 2012-07-19 David Ward Nucleic acid detection
ITRM20100527A1 (en) * 2010-10-08 2012-04-09 Bios Internat S R L FLUORESCENT METHOD FOR SCREENING OF HLA-B * 57: 01
US10450606B2 (en) 2012-02-17 2019-10-22 Fred Hutchinson Cancer Research Center Compositions and methods for accurately identifying mutations
US11441180B2 (en) 2012-02-17 2022-09-13 Fred Hutchinson Cancer Center Compositions and methods for accurately identifying mutations
US10011871B2 (en) 2012-02-17 2018-07-03 Fred Hutchinson Cancer Research Center Compositions and methods for accurately identifying mutations
WO2016026574A1 (en) * 2014-08-19 2016-02-25 Arcticzymes As Thermolabile exonucleases
US10415082B2 (en) 2014-08-19 2019-09-17 Arcticzymes As Thermolabile exonucleases
US10087483B2 (en) 2014-08-19 2018-10-02 Arcticzymes As Thermolabile exonucleases
US10787702B2 (en) 2014-08-19 2020-09-29 Arcticzymes As Thermolabile exonucleases
KR102432501B1 (en) 2014-08-19 2022-08-12 아크틱자임즈 에이에스 Thermolabile exonucleases
KR20180040736A (en) * 2014-08-19 2018-04-20 아크틱자임즈 에이에스 Thermolabile exonucleases
US11584958B2 (en) 2017-03-31 2023-02-21 Grail, Llc Library preparation and use thereof for sequencing based error correction and/or variant identification

Also Published As

Publication number Publication date
CA2490864A1 (en) 2004-01-08
KR20050034651A (en) 2005-04-14
JP2005531314A (en) 2005-10-20
AU2003243700A1 (en) 2004-01-19
AU2009200987A1 (en) 2009-04-09
CN1665938A (en) 2005-09-07
CA2490864C (en) 2012-09-11
US20110300537A1 (en) 2011-12-08
AU2003243700B2 (en) 2009-04-30
WO2004003136A3 (en) 2004-08-19
JP4617403B2 (en) 2011-01-26
EP1534859B1 (en) 2010-11-24
DE60335116D1 (en) 2011-01-05
KR101038137B1 (en) 2011-05-31
EP1534859A4 (en) 2007-09-05
US20040096870A1 (en) 2004-05-20
EP1534859A2 (en) 2005-06-01
ATE489477T1 (en) 2010-12-15
CN1665938B (en) 2012-10-10

Similar Documents

Publication Publication Date Title
US20110300537A1 (en) Methods of detecting sequence differences
EP1706488B1 (en) Method of performing pcr amplification on a microarray
EP1362928B1 (en) Genotyping by amplicon melting curve analysis
JP6081366B2 (en) MPCR method for analyzing repeat sequences
Wittwer et al. Nucleic acid techniques
Pierce et al. Linear-after-the-exponential polymerase chain reaction and allied technologies: Real-time detection strategies for rapid, reliable diagnosis from single cells
AU2003247715B8 (en) Methods and compositions for analyzing compromised samples using single nucleotide polymorphism panels
Bannai et al. Single-nucleotide-polymorphism genotyping for whole-genome-amplified samples using automated fluorescence correlation spectroscopy
De Silva et al. Monitoring hybridization during polymerase chain reaction
US20030235827A1 (en) Methods and compositions for monitoring primer extension and polymorphism detection reactions
US20030077584A1 (en) Methods and compositons for bi-directional polymorphism detection
US20220340981A1 (en) Probe and method for str-genotyping
US7101673B2 (en) Method for detecting a nucleic acid comprising asymmetrical amplification
US20020058271A1 (en) Process for detecting a nucleic acid target
US20040185477A1 (en) Methods of detecting differences in genomic sequence representation
Eggerding Oligonucleotide Ligation Assay

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2490864

Country of ref document: CA

Ref document number: 2003243700

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 758/MUMNP/2004

Country of ref document: IN

Ref document number: 2004517727

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 1020047021365

Country of ref document: KR

Ref document number: 20038154013

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 2003761972

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020047021365

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003761972

Country of ref document: EP