不积小流 , 无以成江海 !分享 http://blog.sciencenet.cn/u/xiongchaoliang

博文

HISAT用法详解

已有 5826 次阅读 2015-9-5 19:44 |个人分类:【技术-软件】|系统分类:科研笔记

HISAT Pipeline (http://www.ccb.jhu.edu/software/hisat/manual.shtml)
(1).Indexing a reference genome

$hisat-build  /reference/mouse/mm9.fa  /reference/mouse/mm9

#The command should print many lines of output then quit. When the command

completes, the directory will contain ten new files that all start with

`mm9` and end with `.1.bt2`, `.2.bt2`, `.3.bt2`, `.4.bt2`, `.5.bt2`, `.6.bt2`,

`.rev.1.bt2`, `.rev.2.bt2`, `.rev.5.bt2`, and `.rev.6.bt2`.  These files constitute the index - you're done!

 

(2).Aligning reads

$hisat  -X  /reference/mouse/mm9  -1  /Data/reads_1.fq  -2  /Data/reads_2.fq  -S  reads.sam


(3).Use `samtools view` to convert the SAM file into a BAM file

$samtools  view  -bS  reads.sam  >  reads.bam


(4).Use `samtools sort` to convert the BAM file to a sorted BAM file

$samtools  sort  reads.bam  reads.sorted

# We now have a sorted BAM file called `reads.sorted.bam`. Sorted BAM is a useful

format because the alignments are (a) compressed, which is convenient for

long-term storage, and (b) sorted, which is conveneint for variant discovery.


References:
-------------------------------
1.HISAT: a fast spliced aligner with low memory requirements




https://blog.sciencenet.cn/blog-1509670-918725.html

上一篇:五个著名的variant databases:
下一篇:EditPlus的安装、设置和使用
收藏 IP: 159.226.43.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-6-30 05:56

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部