Large dataset downloading made easy

Sequence databases, such as NCBI, are a very important resource in many areas of science. Downloading small amounts of sequences to local storage can easily be performed using any recent web browser, but downloading tens of thousands of sequences is not as simple.

NCBI Mass Sequence Downloader is an open source program aimed at simplifying obtaining large amounts of sequence data from NCBI databases to local storage. It is written in python (can be run under both python 2 and python 3), and uses PyQt5 for the GUI. The program can be run in either graphical or command line mode.

Source code is licensed under the GPLv3, and is supported on Linux, Windows and Mac OSX. Available on github both in StuntsPT or ElsevierSoftware repository.

Associated publication

Responsible contact
f.pinamartins@gmail.com

Responsible organization

EDAM Topic

If you use this resource, please acknowledge: BioData.pt – Infraestrutura Portuguesa de Dados Biológicos refª 22231/01/SAICT/2016, funded by Portugal 2020.