Question 1

What is FASTA format?

Accepted Answer

FASTA format is a text-based format for representing nucleotide or protein sequences. Each entry starts with a ">" symbol followed by a description line (header), and the sequence itself on subsequent lines. Example: >Primer_1
ATCGATCGATCG. The format was originally developed for the FASTA alignment tool and is now the most widely used sequence format in bioinformatics. Our converter handles both single-line and multi-line FASTA sequences.

Question 2

How do I prepare sequences for IDT or Twist orders?

Accepted Answer

IDT requires sequences in plate-map format (CSV or Excel with specific column names: Name, Sequence, Scale, Purification). Twist Bioscience accepts CSV with Name and Sequence columns. Our Vendor Format Adapter tool generates vendor-specific formats directly. However, if you have sequences in FASTA format and need a quick CSV for vendor ordering, this Format Converter is the first step — convert to CSV, then use the Vendor Format Adapter for final formatting.

Question 3

What are IUPAC ambiguity codes?

Accepted Answer

IUPAC codes extend the standard DNA alphabet (A, T, C, G) with ambiguity codes that represent multiple possible bases: R = A or G (purine), Y = C or T (pyrimidine), S = G or C (strong), W = A or T (weak), K = G or T, M = A or C, B = not A, D = not C, H = not G, V = not T, N = any base. Our converter validates against the full IUPAC alphabet and flags non-standard characters.

Question 4

Can I convert multi-line FASTA to single-line?

Accepted Answer

Yes. Multi-line FASTA (where long sequences are wrapped at 60 or 80 characters per line) is common in genomic databases. Our converter automatically joins multi-line sequences into single-line format during conversion. When outputting FASTA, you can choose between single-line (compact) and multi-line (wrapped at 80 characters) format.

Question 5

How does deduplication work?

Accepted Answer

Deduplication identifies and removes sequences that appear more than once in your dataset. Comparison is case-insensitive (ATCG = atcg = AtCg). When duplicates are found, the first occurrence is kept and subsequent copies are removed. The tool reports the number of duplicates found. This is particularly useful for oligo pools where duplicate sequences waste synthesis resources without adding experimental value.

FASTA Converter - Oligonucleotide Sequence Format Converter

Generic sequence conversion before vendor-specific export

Input & Options

Processing Options

ID Modification (Optional)

Results

Why Sequence Format Conversion Matters

How to Use the Format Converter

Frequently Asked Questions

Related Tools

Vendor Format Adapter

Batch Sequence QC

GC Content Analyzer

Check a result or ordering detail

Related reading