• Research
  • People
  • Publications
  • Software & Databases
  • News & Galary
  • Join us

Wang Lab

NCBIminer

June 17, 2016

NCBIminer is freely available, cross-platform and user-friendly software for mining nucleotide sequence data from GenBank. It has several features that enable users to accurately and efficiently download sequences with specific attributes from the GenBank database:

1) it uses a novel search strategy, and can download sequences for distantly related taxonomic groups with high accuracy;

2) it deals with genes, CDS, rRNA, and other GenBank-defined feature types;

3) it can filter sequences by length and similarities with the reference sequence using user-defined parameters;

4) it can download information on DNA sample collections, e.g. voucher specimen, country, latitude and longitude, and collector;

5) it takes advantage of parallelization for a high efficiency workflow. We demonstrate the use and performance of NCBIminer by downloading sequences for the plant family Campanulaceaes. Compared to other methods, NCBIminer harvests more and longer sequences, and is less sensitive to query sequences.

Install:
If you have MATLAB Compiler Runtime (MCR) v8.4 (Or MATLAB R2014b) installed, download this file and run. Ortherwise download MCR version of R2014b (8.4) from http://cn.mathworks.com/products/compiler/mcr/ and install MCR. 

Citation:
Xu, X., Dimitrov, D., Rahbek, C. and Wang, Z. (2015), NCBIminer: sequences harvest from Genbank. Ecography, 38: 426–430. doi:10.1111/ecog.01055

Download NCBIminer_v3.01_win_64 (2017-02-20) for Windows

Download source code

NCBIminer for linux 64 is coming soon...

< top >  

Zhiheng Wang, College of Urban & Environmental Sciences, Peking University, 5 Yiheyuan Road, Beijing 100871, China;

Email: Zhiheng.wang@pku.edu.cn     Last updated 2023/08/28