Last updated: March 4, 2026

NGS Library Prep: Oligo Design & Validation Workflow

Intermediate20-40 minutes

Design and validate oligonucleotides for next-generation sequencing library preparation: adapter sequences, dual-index barcodes, amplicon primers, and hybrid capture probes. This workflow covers oligo requirements for Illumina, Element Biosciences, and MGI platforms.

What You'll Learn

  • YesDesign adapter oligos with correct structure for ligation-based and tagmentation workflows
  • YesSelect and validate unique dual indexes (UDI) to prevent index hopping on patterned flow cells
  • YesDesign amplicon sequencing primers with adapter tails and validate Tm uniformity
  • YesEvaluate capture probe design parameters (GC, Tm, secondary structure) for hybrid capture panels
  • YesUse batch QC tools to screen oligo pools for synthesis-problematic sequences before ordering

1. NGS Oligo Types Overview

NGS library preparation requires several types of oligonucleotides, each with specific design criteria. The choice depends on your sequencing platform, library prep chemistry, and target enrichment strategy.

Oligo TypeLengthLibrary Prep MethodKey Design ParametersValidation Tool
Y-adapters (paired)60-70 ntLigation (TruSeq, NEBNext)Tm, secondary structure, 3' blockingStructure Predictor
Index primers (i7/i5)24-35 ntAll methodsHamming distance ≥3, color balanceBatch QC
Amplicon primers25-35 nt (target + tail)Amplicon-seqTm uniformity, GC 40-60%, specificityTm Calculator
Capture probes120-150 ntHybrid captureGC 30-65%, Tm uniformity, no repeatsGC Analyzer
UMI adapters70-80 nt (incl. N-bases)PCR-free / dedupRandom bases quality, adapter TmOligo Properties
Blocking oligos20-30 ntHybrid captureComplement adapter sequencesTm Calculator

2. Adapter Design & Validation

Sequencing adapters are the most critical oligos in library prep. They must form the correct Y-shaped structure after annealing, ligate efficiently to fragmented DNA, and be compatible with your sequencing platform's flow cell chemistry.

Illumina-Compatible Adapter Structure

P5 flow cellRd1 SP[i5 Index]Rd2 SP
P7 flow cellRd2 SP[i7 Index]Rd1 SP[Insert]

SP = sequencing primer binding site. Flow cell sequences (P5/P7) are fixed by Illumina — do not modify.

ParameterRequirementWhy
Adapter Tm65-72°C (annealed region)Stable duplex during cluster generation
3' endT-overhang (for A-tailing) or bluntMatches insert end preparation
5' modificationPhosphorylation (P5) or noneRequired for ligation on one strand
Secondary structureΔG > -3 kcal/mol at 20°CPrevent self-folding during ligation
PurificationHPLC or PAGECritical — truncated adapters create chimeras

Validate adapter sequences with our Secondary Structure Predictor to check for hairpins and self-dimers at 20°C (ligation temperature), and our Tm Calculator to confirm the annealed duplex region has sufficient Tm for cluster generation.

3. Index/Barcode Selection

Index sequences (barcodes) enable multiplexed sequencing — pooling multiple samples on a single flow cell. Proper index design prevents misassignment and ensures accurate demultiplexing.

Design RuleRequirementRationale
Hamming Distance≥3 between any pairTolerates 1 sequencing error during demultiplexing
Color BalanceAt least 1 A/C and 1 G/T per cycleRequired for 2-channel chemistry (NovaSeq, NextSeq)
GC Content25-65% per indexAvoid extreme bias in index reads
Homopolymers≤3 consecutive same basePrevent phasing errors during index read
Index Length8-10 nt standard, 10 nt for UDILonger indexes = more multiplexing capacity
Unique Dual IndexingEach i7+i5 combination uniquePrevents index hopping on patterned flow cells

Use our Batch Sequence QC tool to validate your index set: upload all indexes and check for GC extremes, homopolymers, and sequence similarity. For Hamming distance verification, compare each pair to ensure ≥3 mismatches.

4. Amplicon Sequencing Primers

Amplicon sequencing uses target-specific primers with adapter tail sequences to selectively amplify regions of interest. The primer consists of a target-specific portion (20-25 nt) plus a 5' adapter overhang (~33 nt).

Amplicon Primer Structure

5'-[Adapter tail ~33 nt]-[Target-specific 20-25 nt]-3'

Total primer length: ~53-58 nt. Design the target-specific portion first (Tm, GC, specificity), then append the platform-specific adapter tail. Only the target-specific Tm matters for PCR — the tail does not bind during initial cycles.

ParameterTarget-Specific RegionFull Oligo
Length20-25 nt53-58 nt
Tm60-65°CN/A (tail irrelevant for PCR)
GC Content40-60%Check full oligo for synthesis
ΔTm (all primers)<2°CCritical for multiplex
Amplicon Size150-300 bpFit within read length
Secondary StructureΔG > -2 kcal/molCheck full oligo with tail

Validate all amplicon primers with our Tm Calculator (target-specific region only) and screen the full oligos with Batch QC for synthesis compatibility.

5. Capture Probe Design

Hybridization capture uses biotinylated oligo probes (120-150 nt) to enrich target regions from a whole-genome library. Probe quality directly determines capture uniformity and on-target rate.

ParameterOptimalAvoidTool
Probe length120-150 nt<80 nt or >200 nt-
GC content30-65%<20% or >75%GC Analyzer
Tm uniformityWithin 5°C across panel>10°C spreadTm Calculator
Tiling density2-3x overlapNo overlap (gaps)-
Repeat maskingMask RepBase/RepeatMaskerProbes in Alu, LINE, SINEBatch QC
Secondary structureΔG > -5 kcal/molStrong hairpinsStructure Predictor
Modification5' biotin (TEG spacer)Internal biotin-

6. Step-by-Step Validation Workflow

1

Identify Required Oligos

List all oligos needed for your library prep: adapters, indexes, primers, probes, or blocking oligos. Match to your sequencing platform.

Use All Tools
2

Validate Tm and GC

Check Tm (nearest-neighbor) and GC content for each oligo. Ensure Tm uniformity within each functional group (e.g., all amplicon primers within 2°C).

Use Tm Calculator
3

Screen Secondary Structures

Check all oligos for hairpins and self-dimers at their working temperature (20°C for adapters, 60°C for PCR primers). Flag ΔG < -3 kcal/mol.

Use Structure Predictor
4

Batch QC for Pools

For large oligo sets (amplicon panels, capture probes): run batch analysis for GC extremes, homopolymers, tandem repeats, and other problematic features.

Use Batch QC
5

Verify Index Compatibility

Check Hamming distances, color balance, and uniqueness for your index set. Verify compatibility with your flow cell chemistry.

Use Batch QC
6

Order and QC

Order oligos with appropriate purification (HPLC for adapters, desalting for primers). Request mass spec verification for critical adapters.

Use Dilution Calculator

Frequently Asked Questions

What oligos do I need for NGS library preparation?
The specific oligos depend on your library prep method. For ligation-based preps (TruSeq, NEBNext): paired adapters with unique dual indexes (UDI). For amplicon sequencing: target-specific primers with adapter tails. For hybrid capture: biotinylated capture probes (120-150 nt). For tagmentation (Nextera): only index primers are needed (adapters inserted enzymatically). All methods require index sequences for multiplexing.
How do I prevent index hopping in multiplexed sequencing?
Index hopping (index switching) causes sample misassignment on patterned flow cells (NovaSeq, NextSeq 2000). Prevention: (1) Use unique dual indexes (UDI) instead of combinatorial indexing; (2) Minimize free adapter in final libraries (bead cleanup); (3) Use UDI sets with maximum Hamming distance (≥3 edits); (4) Include a demultiplexing filter for unexpected index combinations. Our Batch QC tool can verify Hamming distances between your index sets.
What Tm should I target for NGS adapters?
Adapter oligos have different Tm requirements by component: index sequences (i7/i5) should have Tm 65-72°C for stable hybridization during cluster generation; sequencing primer binding sites should match the platform specification (typically 60-65°C); adapter stem regions should be stable at ligation temperature (20-25°C). Use our Tm Calculator with your specific buffer conditions.
How do I design capture probes for target enrichment?
Capture probes (120-150 nt biotinylated oligos) require: (1) Tile target regions with 2-3x overlap for complete coverage; (2) GC content 30-65% per probe (flag extremes); (3) Mask repetitive elements (RepeatMasker); (4) Tm uniformity within 5°C across the probe panel; (5) Avoid strong secondary structures (ΔG > -5 kcal/mol). Vendor panels (IDT xGen, Agilent SureSelect, Twist) handle probe design, but custom panels require manual design validation.
What is the difference between amplicon and hybrid capture sequencing?
Amplicon sequencing uses PCR primers to selectively amplify targets (100-500 bp per amplicon). It is cheaper, faster, and works with low-input DNA, but has PCR bias and limited target size (<500 kb typical). Hybrid capture uses biotinylated probes to pull down target fragments from a whole-genome library. It is more uniform, handles larger target regions (1-60 Mb), but costs more and requires more input DNA (50-200 ng).
How many indexes do I need for my sequencing run?
The number of indexes depends on: flow cell capacity, desired reads per sample, and demultiplexing requirements. For example, a NovaSeq S4 flow cell producing 10 billion reads with 30M reads/sample needs ~333 indexes. Always use at least 4 unique indexes per lane for proper registration on patterned flow cells. Standard UDI sets come in 96 (plate format) or 384 (high-throughput) configurations.

Related Tools

Further Reading