高山流水分享 http://blog.sciencenet.cn/u/friendpine 走在科学路上的一位无名侠客,只是静静的走着.........

博文

从NCBI批量下载序列

已有 16791 次阅读 2016-12-30 23:16 |个人分类:生物信息学与计算生物学|系统分类:科研笔记

好久没有玩过大量序列了,发现之前的脚本不管用了,所以从bing上又学习了一遍。下面是一位大神总结的几种最简单的方法。From https://edwards.sdsu.edu/research/ncbi-sequence-or-fasta-batch-download-using-entrez/
Three easy ways to download multiple sequences from NCBI

There are different ways of how to download multiple sequences from the NCBI databases in a single request.

1) Using the batch Entrez website

http://www.>ncbi.nlm.nih.gov/sites/batchentrez

2) Using Perl: (copy into your terminal and press return/enter)

perl -e 'use LWP::Simple;getstore("http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&rettype=fasta&retmode=text&id=".join(",",qw(6701965 6701969 6702094 6702105 6702160)),"seqs.fasta");'

This takes the IDs separated by spaces and the filename of the fasta file with the sequences that will be generated (seqs.fasta). If you don’t try to get the nucleotide data, then you will have to change the database name as well.

3) Using your browser: (paste this to the address field)

http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&rettype=fasta&retmode=text&id=6701965,6701969,6702094,6702105,6702160
This time the IDs are separated by commas. Same here, if you need to get data from a different database you just have to change that.




https://blog.sciencenet.cn/blog-54276-1024373.html

上一篇:怎样理解具有边际效应的SNP相互作用
下一篇:Ubuntu 中EMBOSS的安装
收藏 IP: 111.74.215.*| 热度|

1 李万峰

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-11-24 10:06

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部