Kevin2015的个人博客分享 http://blog.sciencenet.cn/u/Kevin2015

博文

chip-seq 数据分析下游1--Rcade

已有 3445 次阅读 2016-5-11 14:47 |个人分类:知识点专题|系统分类:科研笔记

CHIP-SEQ后期数据解析

第一部分:chip-seq+rna-seq

这里详细研究一下Rcade: R-based analysis of ChIP-seq And DifferentialExpression data

Rcade is a bioconductor tool that analyses ChIP-seq data by coupling the ChIP-seq resultsto an existing Differential Expression(DE) analysis. Rcade enables inferring direct targets of a transcription factor (TF) - thesetargets should exhibit TF binding activity, and their expression levels shouldchange in response to TF binding to gene associated regulatory elements.

1.      安装相关包

RglbiomaRtRcade

2.      Data地址

library("Rcade")

dir <- file.path(system.file("extdata", package="Rcade"), "STAT1")

dir

3.      DE analysis. 建议使用limm处理差异基因。

DE <- read.csv(file.path(dir,"DE.csv"))

DElookup <-list(GeneID="ENSG", logFC="logFC",B="B","Genes.Location", "Symbol")

4.       Chip-seq analysis

dir(dir, pattern = ".bam")

targets <- read.csv(file.path(dir,"targets.csv"), as.is = TRUE)

targets

5.       Annotation

anno <- read.csv(file.path(dir,"anno.csv"))

anno <- anno[order(anno$chromosome_name),]

colnames(anno) <-c("ENSG","chr","start","end","str")

ChIPannoZones <-defineBins(anno, zone=c(-1500, 1500), geneID="ENSG")

6.       Prior specification

DE.prior = 0.01

prior.mode = "keepChIP"

prior = c("D|C" = 0.05, "D|notC" = 0.005)

7.      Analysis

UseRcadeAnalysis function:

Rcade <-RcadeAnalysis(DE, ChIPannoZones,annoZoneGeneidName="ENSG", ChIPtargets=targets,ChIPfileDir = dir, DE.prior=DE.prior,

prior.mode=prior.mode,prior=prior,DElookup=DElookup)

Rcade

The Rcade object stores information from the analysis. The DE, ChIP andRcade analysis can be then stored in seperate objects.

xDE <-getDE(Rcade)

xChIP <-getChIP(Rcade)

xRcade <- getRcade(Rcade)

8.       Plottingand QC

1. PrincipleComponent Analysis on the counts with the plotPCA function

plotPCA(Rcade)

2. The MMplot shows log-ratios from DE plotted against log-ratios from the ChIP-seq

plotPCA(Rcade)

plotMM(Rcade)

3. PlotBBB() plots log-odds values forChIP-seq, DE and the combined ChIP-seq/DE analysis. Youneed to install the rglpackage.

library(rgl)

plotBBB(Rcade)

Exportresults

exportRcade(Rcade,directory="RcadeOutput",cutoffArg=2000)

moreadvanced export options are available:

exportRcade(Rcade,directory="RcadeOutput", cutoffMode="top", cutoffArg =1000,

justGeneID=FALSE,removeDuplicates="beforeCutoff")

Exportfiles

This function exportsRcade output to disk - speci_cally, it creates the following _les:

_ ChIP.csv

_ ChIPonly.csv

_ DEandChIP.csv

_ DownChIP.csv

_ Down.csv

_ DownNoChIP.csv

_ Nothing.csv

_ UpChIP.csv

_ Up.csv

_ UpNoChIP.csv

Each _le containsgenes appropriate to its hypothesis, sorted by descending B value (i.e. rankedfrom

most interesting toleast interesting). For example, if you wanted the genes that display DE(either up

or down) and also haveChIP signal present, you would look at the top rows of DEandChIP.csv. For

genes that have a ChIPsignal but explicitly show no DE, use ChIPonly.csv.

Rcade is an extremelyuseful package for downstream analysis of ChIP-seq data. It goes beyond peak

or gene lists andenables identi_cation of direct targets of transcription factors. These can beused in

combination withNetwork Biology methods to unravel complexities of biological systems.




https://blog.sciencenet.cn/blog-2609994-976509.html

上一篇:chip-seq 数据前期处理
下一篇:R pipeline for splicing analysis
收藏 IP: 220.163.147.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-9-2 06:15

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部