Readfish enables targeted nanopore sequencing of gigabase-sized genomes


Nanopore sequencers can be used to selectively sequence certain DNA molecules in a pool by reversing the voltage across individual nanopores to reject specific sequences, enabling enrichment and depletion to address biological questions.

Previously, we achieved this using dynamic time warping to map the signal to a reference genome, but the method required substantial computational resources and did not scale to gigabase-sized references. Here we overcome this limitation by using graphical processing unit (GPU) base-calling.

We show enrichment of specific chromosomes from the human genome and of low-abundance organisms in mixed populations without a priori knowledge of sample composition.

Finally, we enrich targeted panels comprising 25,600 exons from 10,000 human genes and 717 genes implicated in cancer, identifying PMLRARA fusions in the NB4 cell line in <15 h sequencing.

These methods can be used to efficiently screen any target panel of genes without specialized sample preparation using any computer and a suitable GPU.

Our toolkit, readfish, is available at https://www.github.com/looselab/readfish.

Authors: Alexander Payne, Nadine Holmes, Thomas Clarke, Rory Munro, Bisrat Debebe, Matthew W Loose