Whole Genome Sequencing Frequently Asked Questions

Whole Genome Sequencing

What is whole genome sequencing (WGS)?

Whole genome sequencing (WGS) is the comprehensive reading and analysis of an entire genome, including intronic and exonic regions. It provides a high-resolution, base-by-base view of the entire genetic material, including all chromosomes and mitochondrial or plasmid DNA.

What are the differences between the service packages?

The packages differ in price and turnaround time:

Value is the most economical option to help projects come under budget
Preferred offers a balance of speed and cost, leveraging our optimized processes for quick results without breaking the bank
Express helps meet tight deadlines with industry-leading turnaround times
Lightning provides results at lightning speed for select library options

The package type can be revised after receiving your initial quote.

Note: Service packages may not be available for all services (e.g. EZ services), projects, or institutions. Please inquire if an expedited solution is required for an exempt service.

Do you offer PCR-free whole genome sequencing?

Yes, PCR-free library preparation is available for whole genome sequencing. PCR-free means reduced library bias and gaps resulting in high data quality and optimal variant detection.

How do I request a quote for whole genome sequencing?

For short-read (Illumina) whole genome sequencing, select the Whole Genome Sequencing icon.
For long-read (PacBio and Oxford Nanopore Technologies) whole genome sequencing, select the Long-Read Services icon.

Then fill out the form with your project information and select Next: Review Inquiry at the bottom. Review the details of your inquiry and click Submit Quote Request.

What is the price for whole genome sequencing?

Please submit a quote request to receive accurate pricing information, as the cost depends on the details of the project. You can also contact the team by submitting an inquiry.

For a quick tutorial, watch our step-by-step instructions.

Technical Design

What are the technical details of your whole genome sequencing?

Please download our Technical Specifications Sheet. Visit the following pages for more information about Low-Pass Whole Genome Sequencing and Clinical Whole Genome Sequencing.

What is the difference between resequencing and de novo sequencing and assembly?

Resequencing is typically performed when a reference genome sequence is available. Sequencing reads are aligned back to the reference to determine the location in the genome the specific read best matches. Resequencing is often applied to variant detection (single nucleotide polymorphisms, small insertions/deletions, structural variants, copy number variation) and derivatives thereof — such as tumor vs normal comparison, population genetic analysis, Mendelian disease analysis, and trio sequencing.

De-novo sequencing and assembly are typically applied to organisms where no reference genome is available, or the available reference is of poor quality. Genomes that have not been sequenced before must be assembled via a de novo approach following sequencing. This assembly can then be used for additional analyses and the basis for future resequencing projects.

What sequencing platforms are used for whole genome sequencing?

The answer is highly dependent on many factors, including the goals of your experiment, the organism of interest, and the amount of material available.

Most resequencing applications are well suited for Illumina platforms. Illumina sequencing produces large numbers of short reads (<300 bp) at high quality allowing for robust variant detection. Illumina datasets are widely applied to genomic studies of viruses, bacteria, mammals, and plants with a wide range of input materials including low input DNA, FFPE samples, and circulating cell free DNA.

However, certain analyses are improved by very long sequencing reads generated by the PacBio (up to 25kb) or Oxford Nanopore Technologies (exceeding 4Mb) platforms. These long reads can span large repeat regions, enabling near complete de novo assembly of small to medium sized genomes, improved assembly of large complex genomes (e.g. plants, human and animals), and more sensitive detection of large structural variants. Long-read sequencing is ideal for complex genomes that could be highly repetitive in nature and/or have GC extremes.

For more information, visit our NGS Platforms webpage.

What starting materials are accepted?

The most common starting material is extracted genomic DNA. However, we have extensive experience performing extractions from a wide variety of materials, including cell pellets, fresh frozen tissue, whole blood, and FFPE slides. Read our Sample Submission Guidelines for more information.

How do I properly prepare high molecular weight (HMW) DNA samples for long-read sequencing?

Please refer to our HMW DNA Best Practices for guidelines on extracting and handing these samples.

How much coverage do I need for my experiment?

The answer is highly dependent on many factors, including the goals of your experiment, the organism of interest, and the amount of material available.

Below are some general recommendations for short-read whole genome sequencing using Illumina platforms:

Germline/frequent variant analysis: 20-50x
Somatic/rare variants: 100-1000x
Tumor vs Normal: ≥60x tumor, ≥30x normal
Population studies: 20-50x
De novo assembly: 100-1000x

Below are some general recommendations for long-read whole genome sequencing using PacBio and Oxford Nanopore Technologies platforms:

Gap filling and scaffolding: 10x
Large structural variant detection: 10x
Germline/frequent variant analysis: 20-50x
De novo assembly: 50-100x

Coverage is a multiplier based on the total size of the genome (see below):

Coverage = (read length) x (total number of reads) / (genome size)

Example: 30x = (2x150 bp/read) x (300 x 10⁶ reads) / (3 x 10⁹ bp)

Data Analysis

What bioinformatics services are available for whole genome sequencing projects?

We provide raw data as FASTQ files for all projects. We also have advanced bioinformatics capabilities to provide optional data analysis services, including:

For resequencing of individual human genomes on Illumina platforms -

Standard DNA Analysis Package: mapping, SNP/INDEL detection, structural variant analysis, and copy number variation detection
De-novo assembly
Tumor vs normal variant analysis
Joint-genotyping
Population genotyping
Integration site prediction
Custom analysis

For sequencing of non-human genomes on Illumina platforms –

Standard DNA Analysis Package: mapping, SNP/INDEL detection, and structural variant analysis
CNV discovery
De-novo assembly
Custom analysis

Larger structural variants can be better resolved with low coverage on long-read sequencing such as PacBio and Oxford Nanopore Technologies platforms. Complete to near-complete sequence assemblies can be obtained for most genomes.

General Questions

How do I contact GENEWIZ for a technical consultation about my project?

The GENEWIZ NGS Team is composed of Ph.D. scientists who can help you optimize your project design and provide consultation. You can contact the team by submitting an inquiry.

Does GENEWIZ guarantee turnaround time?

The turnaround time listed within the quote is inclusive of all steps quoted, unless otherwise noted. Timelines are based on first-pass processing. If repeat processing is required, the turnaround may be subject to change and will be proactively communicated to the client.

If the scope of a project changes after project initiation or if sample or project clarification is required after sample receipt, GENEWIZ may reassess the turnaround time based on the subsequent communications and modifications (if applicable).

What extraction methods are used?

All kits and reagents are routinely updated to remain best-in-class. Please proactively contact GENEWIZ if you would like historical versions, which may be available on a case-by-case basis.

Extractions are performed to the best of our ability and are unable to guarantee yield due to multiple variables that may affect sample yield and quality. Costs to cover the work performed will be applied, regardless of the outcome.

What library preparation methods are used?

All kits and reagents are routinely updated to remain best-in-class. Please proactively contact GENEWIZ if you would like historical versions, which may be available on a case-by-case basis.

Does GENEWIZ perform sample QC and what methods are used?

For most of the services, an initial sample QC is included within the fee of the project for all samples that will proceed to the next processing stage (e.g. library preparation or sequencing) unless otherwise noted within the quotation. Resubmissions or additional, optional samples may incur a nominal fee for QC.

Initial sample QC can include:

Assessing RNA integrity by Fragment Analyzer and concentration by Qubit assay
Assessing DNA concentration by Qubit assay and DNA size (for specific projects only) by pulse-field gel electrophoresis
Assessing premade library size and concentration on the Fragment Analyzer and concentration by Qubit assay
Assessing cryopreserved cells by cell counting and a cell viability assay

What sequencing instruments will be used for my project?

The sequencing platform will be listed within the Service Description of the quotation.

Unless specifically noted in the quotation, GENEWIZ reserves the right to choose between equivalent instruments (e.g. Illumina MiSeq vs NovaSeq X Plus, PacBio Sequel IIe vs Revio, Oxford Nanopore GridION vs PromethION, etc.) depending on the target read depth and configuration requested. If a specific instrument is required, please add special comments in your quote/order and notify the GENEWIZ NGS team prior to project initiation.

You can contact the GENEWIZ NGS team at:

US: NGS@azenta.com
EU & UK: NGS.Europe@azenta.com

Does GENEWIZ guarantee data quality and yield?

Illumina-based projects

For samples that pass QC and libraries prepared at GENEWIZ:

Data Quality

NovaSeq 2x150bp: ≥85% of bases ≥Q30

NovaSeq 2x250bp: ≥80% of bases ≥Q30

MiSeq 2x150bp: ≥80% of bases ≥Q30

MiSeq 2x250bp: ≥75% of bases ≥Q30

Data Yield

Within 10% of total data target yield per lane or flowcell, unless otherwise noted

Within 20% of per sample target yield, unless otherwise noted

Quality and yield for samples that do not pass QC and are processed at best effort are not guaranteed. Premade libraries/library pools submitted for sequencing only quality and yield are evaluated on a case-by-case basis.

Multiplexing is performed to the best of our ability to ensure relatively even data distribution amongst samples.

PacBio-based projects

Due to various sample-related factors that may influence long-read sequencing yield and quality, GENEWIZ cannot guarantee overall data output and quality. However, based on extensive experience with long-read workflows, GENEWIZ has established target metrics based on the PacBio system – at least X M CCS reads per Sequel IIe SMRT cell and Y M CCS reads per Revio SMRT cell. If a project does not meet these targets, an investigation will be conducted. If the issue is determined to be unrelated to the sample, a repeat or top-off will be performed as necessary.

What is GENEWIZ’s coverage guarantee?

Due to the possible wide range of performance being influenced by sequence complexity and sample quality, GENEWIZ does not guarantee average coverage or on-target specificity for each sample. We can instead recommend target data output that would increase the chances of obtaining the desired coverage based on the sample and library type utilized.

What is GENEWIZ’s data analysis guarantee?

Data analysis is performed to the best of our ability and results are not guaranteed for samples or libraries not passing QC or do not fit the analysis pipeline’s criteria.

Can you recommend the best data delivery option?

Yes, we can recommend the best data delivery option based on the platform and project details. Options include:

Secure File Transfer Protocol (sFTP) - additional charges may apply
Customer Cloud Account - AWS, Microsoft Azure, Google Cloud
Hard Drive - additional charges may apply

Delivery to most customer cloud solutions is free of charge. If you have any questions, please submit an inquiry.

How will my data be delivered?

By default, results are sent via a secure File Transfer Protocol (sFTP). Please refer to our sFTP Data Download Guide for instructions on how to download your data and troubleshooting tips. Additional charges may be applicable for large data transfers.

Please refer to the delivery email sent from GENEWIZ for detailed information on the transfer and consult with your IT department to ensure compliance with your institution’s policies.

Will my data be secure at GENEWIZ?

We take data security very seriously and we make every attempt to keep your data private and protected. The data transfer we offer is secure; however, should you desire an alternative delivery method, we are happy to work with you.

How long does GENEWIZ store samples?

We hold any remaining samples for up to 3 months after project completion. Clinical samples processed in our Regulatory-Environment or CLIA-licensed laboratory may have longer sample storage timelines. Please contact us if you would like samples retained for a longer period or shipped back to you.

How long does GENEWIZ store data?

GENEWIZ offers raw data storage (i.e. FASTQ for Illumina, BAM for PacBio, POD5 for Oxford Nanopore) for up to 6 months after project completion. Data generated for samples processed in the Regulatory-Environment or CLIA-licensed laboratory may have longer data retention timelines. Please contact us if you would like the data to be retained for a longer period.

Sample Preparation

How should I prepare and send my samples?

View our Sample Submission Guidelines for instructions on preparing and sending samples. Organize your samples in tubes or plates following the order indicated on the sample submission form.

Tubes: Prepare samples in clearly labeled and well-organized 1.5 mL flip-cap microcentrifuge tubes. Please avoid using Parafilm to seal the tubes.

Icon_460x305_Tubes_48Samples

Unless otherwise instructed, GENEWIZ reserves the right to combine multiple vials of the same sample for extraction and/or library preparation.

Plates: For projects with 16 or more samples, prepare samples in clearly labeled, securely sealed 96-well full-skirted PCR v-bottom plates. Arrange the samples vertically by column (i.e. A1, B1, C1, etc.)

Icon_460x305_Plates_48Samples

Ship samples directly to our facility. Use the shipping address listed on the order receipt.

What happens if my samples do not meet your starting material requirements?

Please reach out to us by submitting an inquiry.

Ordering & Processing

How do I request a quote?

For a quick tutorial, watch the video below.

How do I confirm a quote?

For a quick tutorial, watch the video below.

How can I monitor the progress of my project?

The status of your order, including an estimated date of delivery, can be viewed anytime through your GENEWIZ account. Visit the Order Summary page of your project to find the current order status.

How can I send my samples to GENEWIZ?

Option #1: Ship samples directly to our facility. Use the shipping address listed on your order receipt.

Option #2: For double-stranded DNA samples, submit samples into a local GENEWIZ dropbox which are conveniently located throughout the US, Europe and United Kingdom. To locate a dropbox near you, please submit an inquiry. Place your order receipt and samples in a Ziploc bag packaged according to sample submission guidelines specific for your Next Generation Sequencing service. 

Note: Not recommended for RNA and primary sample types.

What are the cutoff times for my dropbox pickup?

During checkout, consult the Order Summary  page to find out the daily cutoff times for your selected dropbox (see example below).

If you miss the dropbox deadline, feel free to ship samples directly to us. 

Whole Genome Sequencing Frequently Asked Questions

Jump to Topic