Long-read sequencing and genome assembly of natural history collection samples and challenging specimens

Museum collections harbor millions of samples, largely unutilized for long-read sequencing. Here, we use ethanol-preserved samples containing kilobase-sized DNA to show that amplification-free protocols can yield contiguous genome assemblies. Additionally, using a modified amplification-based protoc...

Full description

Bibliographic Details
Main Authors: Bein, Bernhard, Chrysostomakis, Ioannis, Arantes, Larissa S., Brown, Tom, Gerheim, Charlotte, Schell, Tilman, Schneider, Clément, Leushkin, Evgeny, Chen, Zeyuan, Sigwart, Julia, Gonzalez, Vanessa, Wong, Nur Leena W.S., Santos, Fabricio R., Blom, Mozes P.K., Mayer, Frieder, Mazzoni, Camila J., Böhne, Astrid, Winkler, Sylke, Greve, Carola, Hiller, Michael
Format: Article
Language:English
Published: BioMed Central 2025
Online Access:http://psasir.upm.edu.my/id/eprint/120192/
http://psasir.upm.edu.my/id/eprint/120192/1/120192.pdf
Description
Summary:Museum collections harbor millions of samples, largely unutilized for long-read sequencing. Here, we use ethanol-preserved samples containing kilobase-sized DNA to show that amplification-free protocols can yield contiguous genome assemblies. Additionally, using a modified amplification-based protocol, employing an alternative polymerase to overcome PCR bias, we assemble the 3.1 Gb maned sloth genome, surpassing the previous 500 Mb protocol size limit. Our protocol also improves assemblies of other difficult-to-sequence molluscs and arthropods, including millimeter-sized organisms. By highlighting collections as valuable sample resources and facilitating genome assembly of tiny and challenging organisms, our study advances efforts to obtain reference genomes of all eukaryotes.