DNA Sequencing

What is DNA Sequencing?

DNA sequencing is a fundamental laboratory technique used to determine the exact order of nucleotides (or bases) in a DNA molecule. These bases—Adenine (A), Thymine (T), Cytosine (C), and Guanine (G)—form the genetic code that carries the biological instructions for cellular development, functioning, and reproduction.

Importance in Genomics and Medicine

1) The specific sequence of these bases encodes information vital for the expression and regulation of genes, making DNA sequencing a critical tool in genetic and genomic studies. Understanding the sequence helps researchers interpret the structure and function of genes, regulatory elements, and non-coding regions of the genome.

2) Today, there are multiple techniques available for DNA sequencing, each with its advantages, limitations, and applications. The development of faster, cheaper, and more accurate sequencing methods is a rapidly advancing field within genomics research, paving the way for innovations in personalized medicine, disease diagnostics, and biotechnology.

1. First Generation DNA Sequencing (Sanger Sequencing)

This was the earliest method used to sequence DNA. It laid the foundation for future advancements but had many limitations.

Steps Involved:

  • First, chromosomes were separated using a technique called Pulse Field Gel Electrophoresis (PFGE).
  • The DNA was then cut into smaller fragments using restriction enzymes.
  • These fragments were inserted into cloning vectors such as BAC (Bacterial Artificial Chromosome), YAC (Yeast Artificial Chromosome), and PAC (P1 Artificial Chromosome) which could carry large DNA segments.
  • The inserted DNA fragments often overlapped, which allowed researchers to build a continuous sequence.

Sequencing Process – Sanger Method:

  • DNA was copied using special nucleotides that stop the chain at specific bases (A, T, C, or G).
  • Each base was tagged with a fluorescent marker.
  • The resulting DNA fragments were separated by size using capillary gel electrophoresis.
  • A fluorescence detector read the sequence of the bases as the fragments passed through the capillary.
  • The output was shown as a chromatogram – a graph with colored peaks representing different bases.

Output and Efficiency:

  • Each capillary could read about 800–1000 base pairs of DNA.
  • A sequencing machine with 96 capillaries could process around 96,000 base pairs in one run.

Drawbacks:

  • The process was very slow and labour-intensive.
  • It required a lot of manual effort.
  • Costs were very high, especially for large-scale genome projects.

2. Next Generation DNA Sequencing (NGS)

To overcome the limitations of Sanger sequencing, scientists developed Next Generation Sequencing, also known as NGS. It is the most commonly used method today due to its speed, cost-effectiveness, and ability to process millions of DNA fragments at once.

Key Features:

  • NGS allows massively parallel sequencing, which means millions of DNA fragments can be read simultaneously.
  • It does not require cloning or sub-cloning steps, which saves time and money.
  • Provides high accuracy because of deep coverage and multiple reads.

Example – Illumina Sequencing:

  1. DNA is broken into small fragments (~1–2 kb).
  2. Adaptors (short DNA sequences) are attached to the ends of these fragments.
  3. The fragments bind to a glass slide called a flow cell, where short oligonucleotides are already attached.
  4. The DNA bends and forms a bridge, creating a loop.
  5. Bridge PCR amplification copies the DNA repeatedly, forming clusters of identical fragments.
  6. Fluorescently labeled nucleotides (dNTPs) are added one at a time.
  7. A camera captures the fluorescent signals after each base is added.
  8. The sequence is recorded for each DNA cluster base by base.

Advantages:

  • High throughput – millions of sequences at once.
  • Relatively low cost per base.
  • Automated and efficient workflow.

Limitations:

  • It produces short reads (only 75 to 300 base pairs).
  • Additional computational strategies are needed to assemble complete genomes from short reads.

3. Third Generation DNA Sequencing (Nanopore Technology)

This is the most advanced and real-time sequencing method. It sequences single DNA molecules directly, without any copying or fluorescent tags.

How It Works:

  • A DNA helicase enzyme unwinds the double-stranded DNA.
  • A single strand of DNA is passed through a nanopore, which is a tiny protein hole embedded in a synthetic membrane.
  • An ionic current flows through the pore.
  • As each DNA base (A, T, C, G) passes through, it causes a unique change in the current.
  • These changes in current are detected and translated into the DNA sequence.

Advantages:

  • No need for PCR or chemical tagging.
  • Real-time sequencing – results are available immediately.
  • Can be used in remo     te or field locations.
  • Produces very long reads, even up to 1 million base pairs.
  • Low cost, simple sample preparation, and minimal equipment.

Applications of DNA Sequencing:

  1. Genomic Research:
    • Unraveling the complete genetic code of organisms for understanding genetic diversity and evolution.
  2. Medical Diagnostics:
    • Identifying genetic mutations associated with diseases for diagnostic and prognostic purposes.
  3. Personalized Medicine:
    • Tailoring medical treatments based on an individual’s genetic makeup.
  4. Pharmaceutical Development:
    • Accelerating drug discovery and development by understanding genetic targets.
  5. Forensic Analysis:
    • DNA profiling for identification purposes in forensic investigations
Scroll to Top