Index to this page

Restriction Fragment Length Polymorphisms (RFLPs)

Restriction enzymes cut DNA at precise points producing

If probes encounter a complementary sequence of nucleotides in a test sample of DNA, they bind to it by Watson-Crick base pairing and thus identify it.

Polymorphisms are inherited differences found among the individuals in a population.

RFLPs have provided valuable information in many areas of biology, including:

Case 1: Screening for the sickle-cell gene

Sickle-cell disease is a genetic disorder in which both genes in the patient encode the amino acid valine (Val) in the sixth position of the beta chain (betaS) of the hemoglobin molecule. "Normal" beta chains (betaA) have glutamic acid at this position.

The only difference between the two genes is the substitution of a T for an A in the middle position of codon 6.


When the normal gene (betaA) is digested with the enzyme and the fragments separated by electrophoresis, the probe binds to a short fragment (between the red arrows).

However, the enzyme cannot cut the sickle-cell gene at this site, so the probe attaches to a much larger fragment (between the blue arrows).

The figure (from data provided by S. E. Antonarakis) shows the pedigree of a family whose only son has sickle-cell disease. Both his father and mother were heterozygous (semifilled box and circle respectively) as they had to be to produce an afflicted child (solid box). The electrophoresis patterns for each member of the family are placed directly beneath them. Note that the two homozygous children (1 and 3) have only a single band, but these are more intense because there is twice as much DNA in them.

In this example, a change of a single nucleotide produced the RFLP. This is a very common cause of RFLPs and now such polymorphisms are often referred to as single nucleotide polymorphisms or SNPs. (However, not all RFLPs arise from SNPs. Link to an example of one that didn't.)

How can these tools be used?

By testing the DNA of prospective parents, their genotype can be determined and their odds of producing an afflicted child can be determined. In the case of sickle-cell disease, if both parents are heterozygous for the genes, there is a 1 in 4 chance that they will produce a child with the disease. Amniocentesis and chorionic villus sampling make it possible to apply the same techniques to the DNA of a fetus early in pregnancy. The parents can learn whether the unborn child will be free of the disease or not. They may choose to have an abortion rather than bring an afflicted child into the world.

Three problems:

Case 2: Screening for a RFLP "marker"

If a particular RFLP is usually associated with a particular genetic disease, then the presence or absence of that RFLP can be used to counsel people about their risk of developing or transmitting the disease. The assumption is that the gene they are really interested in is located so close to the RFLP that the presence of the RFLP can serve as a surrogate for the disease gene itself. But people wanting to be tested cannot simply walk in off the street. Because of crossing over, a particular RFLP might be associated with the mutant gene in some people, with its healthy allele in others. Thus it is essential to examine not only the patient but as many members of the patient's family as possible.

The most useful probes for such analysis bind to a unique sequence of DNA; that is, a sequence occurring at only one place in the genome. Often this DNA is of unknown, if any, function. This can actually be helpful as this DNA has been freer to mutate without harm to the owner. The probe will hybridize (bind to) different lengths of digested DNA in different people depending on where the enzyme cutting sites are that each person has inherited. Thus a large variety of alleles (polymorphisms) may be present in the population. Some people will be homozygous and reveal a single band; others (e.g., all the family members shown below) will be heterozygous with each allele producing its band.

The pedigree shows the inheritance of a RFLP marker through three generations in a single family. A total of 8 alleles (numbered to the left of the blots) are present in the family. The RFLPs of each member of the family are placed directly below his (squares) or her (circles) symbol and RFLP numbers.

If, for example, everyone who inherited RFLP 2 also has a certain inherited disorder, and no one lacking RFLP 2 has the disorder, we deduce that the gene for the disease is closely linked to this RFLP. If the parents decide to have another child, prenatal testing could reveal whether that child was apt to come down with the disease.

But note, that crossing over during gamete formation could have moved the RFLP to the healthy allele. So the greater the distance between the RFLP and the gene locus, the lower the probability of an accurate diagnosis.

Link to discussion of genetic linkage.

Case 3: DNA "typing"

Each human cell contains 6 x 109 base pairs of DNA. Some of this represents protein-encoding genes (e.g., for the beta chain of hemoglobin) that are identical in a large proportion of people. But long stretches of DNA do not encode for anything and are free to mutate extensively. It seems certain that if we could read the entire sequence of DNA in each human, we would never find two that were identical (unless the samples were from identical siblings; i.e., derived from a single zygote).

So each person's DNA is as unique as a fingerprint.

This truth has not escaped the law enforcement and legal professions. Analysis of DNA, called DNA typing, is widely used to

The image (courtesy of Lifecodes Corporation) shows the test results in a rape case. Two probes were used: one revealing the bands at the top, the other those at the bottom.

DNA was tested from

One the basis of this test, suspect #2 can clearly be ruled out. None of his bands matches the bands found in the semen.

Is suspect #1 guilty?

We can never be certain. The best we can do is to estimate the probability that another person, picked at random, could provide the same DNA fingerprint.

As a conservative estimate, a given allele (band) might be found in 25% of the people tested. The probability of a random match of two alleles is (0.25)2 or 1 in 16. The probability that 6 alleles match, as in this case, is (0.25)6 or 1 in 4096.

But the suspect was not picked at random, so you may feel that the evidence of guilt is strong.

The more probes you use, the more confident you can be that you have gotten the right man. If, for example, a set of probes revealed 14 bands in a suspect's DNA identical to those in the semen sample, the probability that you have the wrong man drops to less than 1 in 268 million (0.25)14 = 1/268,435,456, which is almost as great as the entire population, males and females, in the United States.

Starting in 1999, law enforcement agencies in both Great Britain and the United States began switching to a new version of RFLP analysis using shorter sequences called STRs ("Short Tandem Repeats").

STRs are repeated sequences of a few (usually four) nucleotides, e.g., TCATTCATTCATTCAT. They often occur in the untranslated parts of known genes (whose sequence can be used for the PCR primers). The exact number of repeats (6, 7, 8, 9, etc.) varies in different people (and, often, in the gene on each chromosome; that is, people are often heterozygous for the marker).

When 13 STR loci — scattered over different chromosomes — are examined, the chance that two people picked at random have the same pattern is less than 1 in 1 trillion. The U.S. Federal Bureau of Investigation (FBI) wants to increase the number of loci examined to 20 further eliminating the possibility of false positives.

Welcome&Next Search

7 January 2019