Recent developments in primer design for DNA polymorphism and mRNA profiling in higher plants
© Yang et al; licensee BioMed Central Ltd. 2006
Received: 14 January 2006
Accepted: 01 March 2006
Published: 01 March 2006
Skip to main content
© Yang et al; licensee BioMed Central Ltd. 2006
Received: 14 January 2006
Accepted: 01 March 2006
Published: 01 March 2006
Primer design is a critical step in the application of PCR-based technologies in gene expression and genetic diversity analysis. As more plant genomes have been sequenced in recent years, the emphasis of primer design strategy has shifted to genome-wide and high-throughput direction. This paper summarizes recent advances in primer design for profiling of DNA polymorphism and mRNA in higher plants, as well as new primer systems developed for animals that can be adapted for plants.
mRNA profiling is very important for identifying new genes, determining gene function and elucidating genetic networks. Differential display [1, 2], cDNA amplified fragment length polymorphism (cDNA-AFLP) [3–5], and microarray  technologies have been employed extensively for profiling of plant mRNAs. DNA polymorphism profiling is essential for gene mapping, marker-assisted selection of crop plants, and molecular diversity studies. PCR-based techniques such as AFLP and microsatellites or simple sequence repeats (SSRs) have also played important roles in plant DNA profiling. Primers are essential components of PCR-based systems as well as modern microarray systems which utilize appropriate probes that are obtained by PCR amplification. This paper summarizes recent advances in primer design for profiling of DNA polymorphism and mRNA in higher plants, as well as new primer systems developed for animals that can be adapted for plants.
Free programs for PCR primer design.
General primer design
Primo Pro 3.4
Primer design based upon multi-alignments
Primer design for single-nucleotide polymorphism (SNPs)
Primo SNP 3.4
Primer design for Simple Sequence Repeat (SSR)
Primer design for Microarrays
Oligonucleotide properties calculation
Primer design for differential display.
Traditional differential display system
The XXXXXXX was designed to target the mRNA sequences with a good coverage of mRNA species in the sample.
Annealing control primer system
The XXXXXXXXXX was designed to target the mRNA sequences with a good coverage of mRNA species in the sample.
Multiplex differential display system
5'-CTTNNXXXXXXXX-3' (N = A, C, G, or T)
The XXXXXXXX was designed to target the mRNA sequences with a good coverage of mRNA species in the sample.
Hwang et al.  developed an annealing control primer (ACP) system that is comprised of a tripartite structure with a polydeoxyinosine [poly(dI)] linker between the 3' end target core sequence and the 5' end non-target universal sequence. This ACP linker prevents annealing of the 5' end non-target sequence to the template and facilitates primer hybridization at the 3' end to the target sequence at specific temperatures, resulting in a dramatic improvement of annealing specificity. This system was recently adapted for the identification of differentially expressed genes involved in mouse development . The primer design of ACP is shown in Table 2. This system could be easily adapted for mRNA profiling in plants by substituting the 3' end animal-targeting sequences for those targeting plant mRNAs. To evaluate our ability to improve annealing specificity, our laboratory also designed a multiplex DD system, in which the 5' primers were designed as 5'-CTTNN-eight mRNA specific bases – 3' (N = A, C, G, or T) (Table 2). The rationale for this primer structure is that in a PCR reaction one 3' primer is used with a mixture of sixteen 5' primers that share the eight 3' bases but differ in the "N" wobble sites. Under high stringency PCR conditions, each of the sixteen 5' primers binds preferentially to a group of mRNA species that perfectly match the eight 3' bases of the 5' primers, reducing competition for amplification among cDNAs, and consequently increasing the chance of detecting a specific transcript by a given 5' primer . Both the primers in ACP and the primers designed by ourselves are longer than those in the original DD . Generally, the annealing temperature increases with the length of the primer. With the original DD annealing temperature, 40–50°C , the size changes of these modified primers may result in the improvement of PCR efficiency so that DD could produce more strong bands in the DD gels.
The 3' eight mRNA specific bases for the 5' primer set for Multiplex DD .
In cDNA-AFLP, cDNA samples are digested with two different restriction enzymes, adapters are attached to the specific ends of the resulting fragments, and the fragments are amplified using primers homologous to the adaptors with an extension of additional selective nucleotides. Thus, for each selective primer pair only the fragments whose ends match the primer extensions get amplified and these fragments form a pool. Finally, the fragments in each pool are separated by electrophoresis . The primer design for cDNA-AFLP depends on the choice of restriction enzymes and 3'selective sequences. Unfortunately, one pair of enzymes does not in practice produce a fragment for every cDNA molecule that could be amplified and detected by electrophoresis. The fragments generated from a particular cDNA can be too long or too short to be revealed by electrophoresis. One pair of restriction enzymes generally covers up to two-thirds of the transcripts in a species [4, 21, 22]. In cDNA-AFLP performed with plant samples, two selective bases on the 3' end of each primer are required to give a scorable banding pattern, giving a total of 256 (16 × 16) possible primer combinations .
Recently, Wang and Bughrara  found that for Festuca species, restriction enzyme Nsp I coupled with Taq I generated a much higher number of transcript-derived fragments than the commonly used enzyme pair of Eco RI and Taq I. The enzyme Nsp I has two degenerate bases in its recognition sequence (R CATGY). This enzyme can cut the cDNA more frequently than Eco RI, and generate more bands in the cDNA-AFLP gels. An additional advantage of using enzyme pair of Nsp I and Taq I is that the possible selective primer combinations were 128 (8 × 16), only half of those for Eco RI/Taq I (16 × 16). To achieve a 90% coverage of mRNAs in a species, it is necessary to increase the number of enzyme combinations up to 4 . In a cDNA-AFLP system with 4 enzyme combinations and two selective bases on the 3' end of each primer, there are 1,024 (4 × 256) possible primer combinations. PCR analysis of all the 1,024 possible primer combinations would be time-consuming and costly. In addition, a portion of the mRNA species could be sampled by two or more enzyme combinations, resulting in a somewhat wasteful usage of resources. Fortunately, several computer programs have recently been developed to perform in silico simulation of cDNA-AFLP using available sequencing data [21, 23, 24]. With the assumption that the real target genome has roughly the same characteristics as the sequence data available , it is possible to find appropriate enzyme combinations to increase mRNA coverage while decreasing selective primer combinations by reducing redundant coverage of the same mRNA species. This could be achieved by simulating cDNA-AFLP in silico for the available genome sequencing data in A. thaliana , O. sativa [26–28] and Populus trichocarpa  as well as plant EST data in the public domains.
DNA microarrays provide powerful tools for global mRNA profiling. Despite widespread use, recent studies have demonstrated discordance among data produced by different microarray platforms and approaches [30–33]. For example, Tan et al.  reported that from a set of 185 common genes in PANC-1 cells, only four behaved consistently on three major commercial microarray platforms from Affymetrix, Agilent and Amersham. One major reason for this is due to the fact that probes have not generally been designed in the past for specificity with gene-splice variants. It is encouraging that companies are now beginning to make arrays specific to different splice variants [31, 34, 35]. The discordance among different microarray platforms can also be caused by cross-hybridization of highly similar sequences . Possible choices of probe types include spotted cDNA sequences or PCR products, several hundred to thousand base pairs in length, short (25–30 mer) oligonucleotides or longer (60–70 mer) oligonucleotide reporters . Based on theoretical considerations that were confirmed experimentally, it appears that 150-mer is the optimal probe length for expression measurement , and thus PCR primers can be designed to amplify the 150-mer gene-specific probes.
Cross-species comparisons of gene expression are important for identifying functionally related genes, because if a set of genes displays similar expression patterns in several species, the probability that the genes are functionally related, rather than co-expressed by chance, increases. Microarray experiments using probes covering a whole transcriptome are expensive to initiate, and a major part of the cost arises from the synthesis of gene-specific PCR primers or hybridization probes . Andersson et al.  developed a method to reduce the number of primers required to amplify the genes of two different genomes. In this method, regions of high sequence similarity were identified, and from these regions PCR primers shared between the genomes were selected, such that either one or, preferentially, both primers in a given PCR were used for amplification from both genomes. This method could be used to design PCR primers for amplification of microarray probes shared by A. thaliana and P. trichocarpa, or A. thaliana and O. sativa, or P. trichocarpa and O. sativa.
Recently, a series of SRAP primers were designed by Li and Quiros  for the amplification of open reading frames (ORFs). This system is based on two-primer amplification. The 17 or 18-mer primers consist of the following elements: core sequences, which are 13 – 14 bases long, where the first 10 or 11 bases starting at the 5' end are sequences of no specific constitution ("filler" sequences) followed by the sequence CCGG in the forward primer and AATT in the reverse primer. The core is followed by three selective nucleotides at the 3' end. The filler sequences of the forward and reverse primers must be different from each other and can be 10 or 11 bases long. The SRAP system has been successfully utilized to profile DNA polymorphism in turf grass species , tomato (Lycopersicon esculentum L. Mill.) , and squash (Cucurbita moschata) .
SSAP is a multiplex amplified fragment length polymorphism (AFLP)-like technique that displays individual retrotransposon insertions as bands on a sequencing gel. Retrotransposons are mobile genetic elements that accomplish transposition via an RNA intermediate that is reverse transcribed before integration into a new location within the host genome. They are ubiquitous in eukaryotic organisms and constitute a major portion of the nuclear genome (often more than half of the total DNA) in plants . It has been successfully used for profiling of DNA polymorphism in many crops such as barley (Hordeum vulgare) , M. sativa , and sweetpotato (Ipomoea batatas (L.) Lam.) . To adapt conventional SSAP method to high-throughput situations, Tang et al.  developed and optimized a fluorescent multiplex PCR system for simultaneous selective amplification of Ty1-copia retrotransposon-based SSAPs, followed by capillary electrophoresis.
Recently, Robinson et al.  developed a computer program to identify and design PCR primers for amplification of SSR loci based on available DNA sequence information. SSR primers have been designed using publicly available expressed sequence tags (ESTs) in barley [49, 50], almond (Prunus communis Fritsch.) , peach (P. persica (L.) Batsch.) , T. aestivum , and O. sativa . These SSRs are useful as molecular markers because their development is inexpensive, they represent transcribed genes and their putative function can often be deduced by a homology search . SSRs have been the backbone to creating molecular maps for a number of years.
Chung and Staub  developed a set of consensus chloroplast primer pairs for simple sequence repeats (ccSSRs) from N. tabacum chloroplast sequences. All primer pairs produced amplicons after PCR employing chloroplast DNA from members of the Cucurbitaceae (six species) and Solanaceae (four species). Sixteen, 22 and 19 of the initial 23 primer pairs were successively amplified by PCR using template DNA from species of the Apiaceae (two species), Brassicaceae (one species) and Fabaceae (two species), respectively. Twenty of the 23 primer pairs were also functional in three monocot species of the Liliaceae (onion and garlic), and the Poaceae (oat). ccSSR primers were strategically "recombined" and were referred to correctly as recombined consensus chloroplast primers (RCCP) for PCR analysis of cucumber DNA.
Gawel et al.  developed a semi-specific PCR system in which primers were designed to target the semi-conservative sequences of the intron-exon junction. The most informative primers were selected from among the exon targeting (ET) and intron targeting (IT) primers, 12 to 18 bases in length. Also, Holland et al.  developed PCR primer pairs that target exons, introns, promoter regions in Z. mays and introns as well as repeat sequences in Avena sativa. Most recently, Hu and Vick  developed a primer system called target region amplification polymorphism (TRAP). This system uses 2 primers of 18 nucleotides each to generate markers. One of the primers, the fixed primer, is designed from the targeted EST sequence in the database; the second primer, the arbitrary primer, is an arbitrary sequence with either an AT- or GC-rich core to anneal with an intron or exon, respectively. The TRAP technique, taking advantage of the availability of sequence information, should be useful in plant genomics research involved in marker-trait association .
Oligonucleotide sequences of 12 URP primers (Adapted from ).
Optimization of primer structure or probe properties for increased specificity of primer (or probe) hybridization to the target sequence in PCR reactions (or microarray analysis).
Increased genome-wide coverage with minimum primer numbers and reduced sampling redundancy.
Novel primer design for amplification of microarray probes specific to gene-splice variants for more accurate mRNA profiling.
Recent innovations have led to more cost-effective and successful profiling studies and greater ease in subsequent purification of gene fragments. This review is intended not only to help scientists to update their knowledge of primer design for DNA polymorphism and mRNA profiling in higher plants, but also to increase their interests in making technical improvements using plant genomics and bioinformatics approach.
The authors would like to thank Roselee Harmon for assistance with laboratory analyses performed to obtain data related to primer utilization and specificity.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.