CAT DNA Sequencing Services

Genomics/Bioinformatics Seminar being offered in September 2024

Sept 27, 2024 - So you want to do a: PacBio RNAseq Experiment

 

The UCSF CAT Core employs both Illumina and PacBio sequencers for the full range of short to long-read applications.

The CAT Core currently has the following sequencers available:

Illumina

PacBio

The CAT's Illumina NovaSeqX operation started in March 2023, and the PacBio Revio system in July 2023. A detailed comparison between the two platforms can be found at the bottom of this page.

Illumina Service Offerings within the CAT

NovaSeq X Plus

The NovaSeq X Plus system is a dual flow cell system with flow cell types of 25B (8 lanes), 10B (8 lanes), and 1.5B (2 lanes). Sequencing kits include a 100bp cycle, 200bp cycle, and 300bp cycle kits.

NovaSeq X Plus Queues
Flow Cell/lanes Reads/lane Cycle kit CAT Queues (R1xBC1xBC2xR2)

25B

(8 lanes)

3.25B 100 cycles Coming Q4 2024
200 cycles Coming Q4 2024
300 cycles 28x10x10x90
151x12x24x151
Custom

10B

(8 lanes)

1.25B 100 cycles 28x10x10x90
51x12x24x51
101x12x24x0
Custom
200 cycles Custom
300 cycles 151x12x24x151
Custom

1.5B

(2 lanes)

0.75B 100 cycles Custom
200 cycles Custom
300 cycles Custom

NovaSeq 6000

The NovaSeq 6000  system is a dual flow cell system with flow cell types of S4 (2 lanes), S2/S1/SP (2 lanes each ). Sequencing kits include 35bp cycle (only available on the S4 flow cell), 50bp cycle (not available on the S4 flow cell), 100bp cycle, 300bp cycle, and 500bp cycle (only available on the SP flow cell) kits.

NovaSeq 6000 Queues
Flow Cell/lanes Reads/lane Cycle kit CAT Queues (R1xBC1xBC2xR2)

S4

(4 lanes)

2.5B 35 cycles Custom
200 cycles 101x12x24x101
300 cycles 151x12x12x151
Custom

S2

(2 lanes)

2.1B 100 cycles Custom
200 cycles Custom
300 cycles Custom

S1

(2 lanes)

800M 100 cycles Custom
200 cycles Custom
300 cycles Custom

SP

(2 lanes)

400M 100 cycles Custom
200 cycles Custom
300 cycles Custom
500 cycles Custom

* Users need to submit libraries that are to be run on one or more lanes, the CAT Core will not pool libraries across different users.

* Those who submit samples for an entire flow cell will be kept together and run sooner.

* Users that require a Custom configuration, or need custom primers will need to request a whole flow cell. Custom configurations need also be noted on the iLab submission form.

PacBio Service Offerings within the CAT

A full run on the PacBio Revio system is 4 SMRT cells. Users are allowed to request 1 SMRT cell at a time, the core will attempt to load 4 SMRT cells at a time, however depending on the queue SMRT cells can be run in any increment of 1 to 4.

Illumina NovaSeq X and PacBio Revio Comparison

Illumina short-read sequencers and PacBio long-read sequencers are two prominent technologies in genomics with distinct characteristics, advantages, and applications. The choice in deciding which to use depends on the application, feel free to contact the CAT core and ask for a consultation. Below is a detailed comparison:

  Illumina PacBio

Technology

Illumina uses a technology called "sequencing by synthesis" (SBS). During SBS, fluorescently labeled nucleotides are incorporated into a growing DNA strand, and the emitted fluorescence is detected to determine the sequence.

PacBio uses single-molecule, Real-Time (SMRT) sequencing. This technology involves observing the synthesis of DNA in real time as a polymerase incorporates fluorescently labeled nucleotides.

Read Lengths

Short reads, typically between 50 to 300 base pairs (bp) per read.

Long reads, typically between 10,000 to 25,000 base pair CCS reads.

Accuracy

High accuracy with error rates generally below 1%. Illumina achieves this through redundant sequencing, where each base is read multiple times.

Initially, individual long reads from PacBio had higher error rates (10-15%). However, advances like circular consensus sequencing (CCS) have significantly improved accuracy, often achieving 99.9% accuracy in HiFi reads. The Revio does not generate non-HiFi reads.

Output

Generates a high volume of data. An Illumina NovaSeq X sequencer can produce billions of reads per run. Outputs can achieve 25B reads and 8Tb of data per flow cell.

Produces lower overall read counts compared to Illumina but with much longer reads, providing valuable contextual information for certain applications. Output can achieve 6-10M reads and up to 100Gb of data per SMRT cell.

Cost

Cost-effective on a per-base basis due to the high throughput, making it suitable for large-scale projects like whole-genome sequencing and RNA sequencing.

More expensive on a per-base basis compared to Illumina, but the cost can be justified for specific applications requiring long reads.

Applications

Ideal for applications requiring high coverage and precision, such as genome-wide association studies (GWAS), exome sequencing, transcriptome analysis, and detecting small variants (SNPs and indels).

Suitable for de novo genome assembly, resolving complex genomic regions, detecting structural variants, isoform sequencing (Iso-Seq), and metagenomics. The Revio can also make 5mC methylation calls in CpGs.

Limitations

Short reads can be problematic for assembling genomes with repetitive regions, structural variants, and complex rearrangements.

Lower throughput compared to Illumina and higher initial error rates in raw reads, though corrected by consensus sequencing.

Summary

  • Illumina short-read sequencers excel in producing high volumes of highly accurate short reads cost-effectively, making them ideal for projects requiring high coverage and detailed variant detection.
  • PacBio long-read sequencers provide much longer reads, which are invaluable for assembling complex genomes, detecting large structural variants, and resolving repetitive regions, albeit at a higher cost, and with CCS reads are also highly accurate.

Referencing the CAT Core and Publications

When submitting manuscripts, please acknowledge the CAT by including the text: “Sequencing was performed at the UCSF CAT, supported by UCSF PBBR, RRP IMIA, and NIH 1S10OD028511-01 grants.” 

 

To aid our facility in future grant submissions, it helps to reference labs and publications that have benefited from our core please notify us of your achievement by emailing catcore [at] ucsf.edu the PMID number, lab, and the piece of equipment used in the paper.