郑爱萍的个人博客分享 http://blog.sciencenet.cn/u/zhengaiping

博文

如何计算FPKM值

已有 18715 次阅读 2014-10-14 12:09 |系统分类:科研笔记

FPKM, Fragments Kilobase of exon model per millon mapped reads, which can be used to indicate the expression (abundance) characteristics of genes. Now I will describe operation about obtaining interested gene FPKM value.

1.Software Download

1).fastq-dump: convert sra file to fastq file.

 website:http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=software 

2).bowtie:an ultrafast and memory efficient tool for aligning sequencing reads to long reference sequences.

 website:http://bowtie-bio.sourceforge.net/bowtie2/index.shtml 

3).cufflinks:assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples.

 website:http://cufflinks.cbcb.umd.edu/ 

4).gffread: convert gff3 file to gtf file.

 website:http://cufflinks.cbcb.umd.edu/ (This program is included with cufflinks package)

2. Operation

1) Download genome.fa and genes.gff3 file from genome website; Download sra file from NCBI

2) Format conversion

 $ fastq-dump -I --split-files SRR123456789.sra # convert sra file to fastq file

 $ gffread -E genes.gff3 -o genes.gtf # convert gff3 file to gtf file

3) Index files

 $bowtie2-build genome.fa genome

4) Alignment

 $bowtie2 -x genome -1 SRR123456789_1.fastq -2 SRR123456789_2.fastq -S SRR123456789.sam

 $samtools view -bS SRR123456789.sam > SRR123456789.bam

 $samtools sort SRR123456789.bam SRR123456789

5) FPKM values

 $cufflinks SRR123456789.bam -G genes.gtf -o result

After these operations, we can extract FPKM values from genes.frkm_tracking file based on gene ID.

 

 



https://blog.sciencenet.cn/blog-867017-835568.html

上一篇:和先正达公司正式签订合作协议
下一篇:“973”计划“十三五”农业领域“西部微生物资源利用”研讨会
收藏 IP: 218.6.135.*| 热度|

1 李万峰

该博文允许注册用户评论 请点击登录 评论 (2 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-5-1 23:53

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部